Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 6 days ago • 77 • 5
Running 376 Visualize Dataset (v2.0+ latest dataset format) 💻 376 Visualize LeRobot datasets in an interactive web tool
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 10 days ago • 80 • 3
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published 24 days ago • 31 • 3
view article Article Did GPT 5.2 make a breakthrough discovery in theoretical physics? 18 days ago • 60
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 18 days ago • 479
When Models Manipulate Manifolds: The Geometry of a Counting Task Paper • 2601.04480 • Published Jan 8 • 4
When Models Manipulate Manifolds: The Geometry of a Counting Task Paper • 2601.04480 • Published Jan 8 • 4 • 1
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published Jan 31 • 313 • 8