Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 14 days ago • 144
gradients-io-tournaments/tournament-exp-qwen-1.5b-test-ed5f2570-6088-4b63-8edd-7e797eddbb3c-5Exp355e 7B • Updated 11 days ago • 31 • 1
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published about 1 month ago • 195
ReactiveGWM: Steering NPC in Reactive Game World Models Paper • 2605.15256 • Published 28 days ago • 28
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published May 6 • 102
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 166