Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline Parallelism Paper • 2605.30852 • Published 14 days ago • 10
electricsheepasia/asia-owid-anemia-pregnant-women-vs-children Viewer • Updated 9 days ago • 1.15k • 29 • 1
REPOT: Recoverable Program-of-Thought via Checkpoint Repair Paper • 2605.30052 • Published 15 days ago • 10
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 16 days ago • 423
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 11 days ago • 225M • • 4.93k
ardauzunoglu/v18_smollm2_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated 15 days ago • 100k • 38 • 1
SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering Paper • 2605.17526 • Published 26 days ago • 7
mradermacher/LFM2-8B-A1B-GLM-4.7-Flash-Thinking-Quantum-IQ1C-P-i1-GGUF 8B • Updated 21 days ago • 3.04k • 4
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning Paper • 2605.20075 • Published 24 days ago • 4
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity • 0.1B • Updated Jan 28 • 46M • • 1.27k
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 166