zai-org/AutoGLM-Phone-9B-Multilingual Image-Text-to-Text • 934k • Updated 18 days ago • 10.7k • • 206
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1 • 246
Running Featured 459 Comparing Captioning Models 🔥 459 Generate captions for images using multiple models