daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 2 days ago • 113
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Paper • 2601.11044 • Published 13 days ago • 34
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published about 1 month ago • 65