internlm/EndoCoT-Data
Preview
• Updated
• 77 • 5
None defined yet.
EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning