Medical google/medgemma-1.5-4b-it Image-Text-to-Text • Updated Jan 23 • 146k • 497 google/medsiglip-448 Zero-Shot Image Classification • 0.9B • Updated Jul 10, 2025 • 27.8k • 122 google/medgemma-27b-it Image-Text-to-Text • Updated Jul 10, 2025 • 45.1k • 319 google/medgemma-27b-text-it Text Generation • Updated Sep 16, 2025 • 49.1k • 407
Audio nvidia/audio-flamingo-3-hf Audio-Text-to-Text • Updated Jan 27 • 169k • 174 facebook/sam-audio-large Updated Dec 30, 2025 • 43k • 375 google/medasr Automatic Speech Recognition • Updated Jan 26 • 38.9k • 290 FunAudioLLM/Fun-CosyVoice3-0.5B-2512 Text-to-Speech • Updated Feb 3 • 6.35k • 473
OCR lightonai/LightOnOCR-1B-1025 Image-to-Text • Updated 17 days ago • 145k • 246 tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 370k • 555 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 3 days ago • 20.6k • 453 PaddlePaddle/PP-DocLayoutV3 Image Segmentation • Updated Jan 30 • 14k • 53
Judge ai-forever/pollux-judge-32b Text Generation • 33B • Updated Jun 27, 2025 • 211 • 5 ai-forever/pollux-judge-32b-r Text Generation • 33B • Updated Jun 27, 2025 • 5
Ru text encoders ai-forever/ru-en-RoSBERTa Feature Extraction • 0.4B • Updated Sep 26, 2024 • 117k • • 77 Tochka-AI/ruRoPEBert-e5-base-512 Feature Extraction • 0.1B • Updated Mar 13, 2024 • 4 Tochka-AI/ruRoPEBert-e5-base-2k Feature Extraction • 0.1B • Updated Mar 13, 2024 • 2.23k • 11
VLMs Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • Updated Feb 6, 2025 • 1.54M • 1.27k NVEagle/Eagle-X5-13B-Chat Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 12 • 28 internlm/internlm-xcomposer2d5-7b Visual Question Answering • Updated Jul 22, 2024 • 2.11k • 209 AIRI-Institute/OmniFusion Updated Apr 10, 2024 • 59
VLA models nvidia/Alpamayo-R1-10B Robotics • Updated Jan 8 • 57.2k • 379 nvidia/GR00T-N1.6-3B Robotics • 3B • Updated Dec 15, 2025 • 34.4k • 67
Translate google/translategemma-12b-it Image-Text-to-Text • Updated Jan 28 • 564k • 268 tencent/HY-MT1.5-1.8B Translation • Updated Jan 1 • 34.2k • 575 google/translategemma-4b-it Image-Text-to-Text • Updated Jan 28 • 141k • 670
Video encoders google/videoprism-lvt-base-f16r288 Video Classification • Updated Jul 29, 2025 • 87.5k • 11 nvidia/omni-embed-nemotron-3b Feature Extraction • 5B • Updated Oct 9, 2025 • 1.77k • 93
Datasets for Embodied agibot-world/AgiBotWorld-Alpha Viewer • Updated Sep 29, 2025 • 49.8M • 7.69k • 213 nvidia/PhysicalAI-Autonomous-Vehicles Updated Jan 21 • 305k • 769 genrobot2025/10Kh-RealOmin-OpenData Updated 9 days ago • 52.2k • 189
Text2Image stabilityai/stable-diffusion-3-medium Text-to-Image • Updated Aug 12, 2024 • 7.68k • • 4.91k black-forest-labs/FLUX.2-dev Image-to-Image • Updated 20 days ago • 921k • • 1.42k fal/FLUX.2-dev-Turbo Text-to-Image • Updated Dec 30, 2025 • 19.5k • • 337 black-forest-labs/FLUX.2-klein-4B Image-to-Image • Updated 13 days ago • 203k • • 518
Medical google/medgemma-1.5-4b-it Image-Text-to-Text • Updated Jan 23 • 146k • 497 google/medsiglip-448 Zero-Shot Image Classification • 0.9B • Updated Jul 10, 2025 • 27.8k • 122 google/medgemma-27b-it Image-Text-to-Text • Updated Jul 10, 2025 • 45.1k • 319 google/medgemma-27b-text-it Text Generation • Updated Sep 16, 2025 • 49.1k • 407
VLA models nvidia/Alpamayo-R1-10B Robotics • Updated Jan 8 • 57.2k • 379 nvidia/GR00T-N1.6-3B Robotics • 3B • Updated Dec 15, 2025 • 34.4k • 67
Audio nvidia/audio-flamingo-3-hf Audio-Text-to-Text • Updated Jan 27 • 169k • 174 facebook/sam-audio-large Updated Dec 30, 2025 • 43k • 375 google/medasr Automatic Speech Recognition • Updated Jan 26 • 38.9k • 290 FunAudioLLM/Fun-CosyVoice3-0.5B-2512 Text-to-Speech • Updated Feb 3 • 6.35k • 473
Translate google/translategemma-12b-it Image-Text-to-Text • Updated Jan 28 • 564k • 268 tencent/HY-MT1.5-1.8B Translation • Updated Jan 1 • 34.2k • 575 google/translategemma-4b-it Image-Text-to-Text • Updated Jan 28 • 141k • 670
OCR lightonai/LightOnOCR-1B-1025 Image-to-Text • Updated 17 days ago • 145k • 246 tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 370k • 555 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 3 days ago • 20.6k • 453 PaddlePaddle/PP-DocLayoutV3 Image Segmentation • Updated Jan 30 • 14k • 53
Video encoders google/videoprism-lvt-base-f16r288 Video Classification • Updated Jul 29, 2025 • 87.5k • 11 nvidia/omni-embed-nemotron-3b Feature Extraction • 5B • Updated Oct 9, 2025 • 1.77k • 93
Judge ai-forever/pollux-judge-32b Text Generation • 33B • Updated Jun 27, 2025 • 211 • 5 ai-forever/pollux-judge-32b-r Text Generation • 33B • Updated Jun 27, 2025 • 5
Datasets for Embodied agibot-world/AgiBotWorld-Alpha Viewer • Updated Sep 29, 2025 • 49.8M • 7.69k • 213 nvidia/PhysicalAI-Autonomous-Vehicles Updated Jan 21 • 305k • 769 genrobot2025/10Kh-RealOmin-OpenData Updated 9 days ago • 52.2k • 189
Ru text encoders ai-forever/ru-en-RoSBERTa Feature Extraction • 0.4B • Updated Sep 26, 2024 • 117k • • 77 Tochka-AI/ruRoPEBert-e5-base-512 Feature Extraction • 0.1B • Updated Mar 13, 2024 • 4 Tochka-AI/ruRoPEBert-e5-base-2k Feature Extraction • 0.1B • Updated Mar 13, 2024 • 2.23k • 11
Text2Image stabilityai/stable-diffusion-3-medium Text-to-Image • Updated Aug 12, 2024 • 7.68k • • 4.91k black-forest-labs/FLUX.2-dev Image-to-Image • Updated 20 days ago • 921k • • 1.42k fal/FLUX.2-dev-Turbo Text-to-Image • Updated Dec 30, 2025 • 19.5k • • 337 black-forest-labs/FLUX.2-klein-4B Image-to-Image • Updated 13 days ago • 203k • • 518
VLMs Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • Updated Feb 6, 2025 • 1.54M • 1.27k NVEagle/Eagle-X5-13B-Chat Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 12 • 28 internlm/internlm-xcomposer2d5-7b Visual Question Answering • Updated Jul 22, 2024 • 2.11k • 209 AIRI-Institute/OmniFusion Updated Apr 10, 2024 • 59