microsoft/Phi-4-reasoning-vision-15B Image-Text-to-Text β’ 15B β’ Updated 7 days ago β’ 24.5k β’ 156
Running on Zero MCP Featured 88 GLM OCR Demo π 88 Multimodal OCR model for complex document understanding.
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated 14 days ago β’ 737k β’ 725