Running on Zero 14 Qwen3-VL Multimodal Search Engine π₯ 14 Cross-modal text-image search powered by Qwen3-VL
huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated Image-Text-to-Text β’ 9B β’ Updated Dec 15, 2025 β’ 8.52k β’ 159
fancyfeast/llama-joycaption-beta-one-hf-llava Image-Text-to-Text β’ 8B β’ Updated May 16, 2025 β’ 61.7k β’ 312
huihui-ai/Huihui-MiniCPM-V-4_5-abliterated Image-Text-to-Text β’ 9B β’ Updated Sep 8, 2025 β’ 2.5k β’ 29
Running on Zero 23 Joy Caption Beta One πΌ 23 Generate descriptive captions for images with various styles and formats
Running on Zero Featured 938 Joy Caption Beta One πΌ 938 Generate detailed captions or tags for any uploaded image
HuggingFaceTB/SmolVLM2-500M-Video-Instruct Image-Text-to-Text β’ Updated Apr 8, 2025 β’ 183k β’ 119
yayayaaa/florence-2-large-ft-moredetailed Image-to-Text β’ 0.8B β’ Updated Dec 13, 2025 β’ 80 β’ 16
Runtime error Featured 198 Better Florence 2 π₯ 198 Analyze images to detect objects, generate captions, or perform OCR
Running on Zero Featured 829 Florence 2 π 829 Perform image captioning, detection, OCR and more with Florenceβ2