Janus: Disaggregating Attention and Experts for Scalable MoE Inference Paper • 2512.13525 • Published 24 days ago • 5
Janus: Disaggregating Attention and Experts for Scalable MoE Inference Paper • 2512.13525 • Published 24 days ago • 5
DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Paper • 2512.11558 • Published 27 days ago • 42
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published Nov 12, 2025 • 76
Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization Paper • 2509.09307 • Published Sep 11, 2025 • 6
BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement Paper • 2412.14203 • Published Dec 16, 2024 • 1
S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information Paper • 2503.05085 • Published Mar 7, 2025 • 47
MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols Paper • 2508.18240 • Published Aug 22, 2025
EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs Paper • 2509.09174 • Published Sep 11, 2025 • 61
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Paper • 2505.02707 • Published May 5, 2025 • 85
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects Paper • 2504.19838 • Published Apr 28, 2025 • 22
CLEA: Closed-Loop Embodied Agent for Enhancing Task Execution in Dynamic Environments Paper • 2503.00729 • Published Mar 2, 2025 • 3
STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning Paper • 2502.10177 • Published Feb 14, 2025 • 6
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper • 2412.20070 • Published Dec 28, 2024 • 42
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published Dec 25, 2024 • 106
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models Paper • 2410.14059 • Published Oct 17, 2024 • 62
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal Paper • 2406.16864 • Published Jun 24, 2024 • 3
LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset Paper • 2312.12418 • Published Dec 19, 2023 • 2