view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective 1 day ago • 18
Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Paper • 2601.14243 • Published 8 days ago • 18
AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation Paper • 2601.17761 • Published 3 days ago • 9
CGPT: Cluster-Guided Partial Tables with LLM-Generated Supervision for Table Retrieval Paper • 2601.15849 • Published 6 days ago • 12
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints Paper • 2601.18137 • Published 2 days ago • 14
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 1 day ago • 25
daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 2 days ago • 112
The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation Paper • 2601.17737 • Published 3 days ago • 48
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility Paper • 2601.17027 • Published 11 days ago • 36
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 6 days ago • 134
llm-semantic-router/mmbert-embed-32k-2d-matryoshka Sentence Similarity • 0.3B • Updated 2 days ago • 159 • 4
Quantized translategemma Collection Quickly tested with vLLM. Not fully compatible yet. • 7 items • Updated 9 days ago • 3
Quantized LFM2.5 Collection Verified models. Compatible with vLLM. • 14 items • Updated about 20 hours ago • 1
view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG 13 days ago • 61
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 14 days ago • 124