Running on CPU Upgrade Featured 3k The Smol Training Playbook 📚 3k The secrets to building world-class LLMs
facebook/webssl-dino300m-full2b-224 Image Feature Extraction • 0.3B • Updated Apr 24, 2025 • 1.25k • 11
Scale RAE Collection Collection for "Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders" • 7 items • Updated 20 days ago • 3
RAE Collection Collection for Diffusion Transformers with Representation Autoencoders • 1 item • Updated Oct 14, 2025 • 11
OneFlow: Concurrent Mixed-Modal and Interleaved Generation with Edit Flows Paper • 2510.03506 • Published Oct 3, 2025 • 15
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training Paper • 2509.26625 • Published Sep 30, 2025 • 43
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 192
Cosmos-Tokenize1 Collection A suite of image and video tokenizers • 9 items • Updated 18 days ago • 9
facebook/webssl-dino7b-full8b-518 Image Feature Extraction • 6B • Updated Apr 24, 2025 • 17 • 12
facebook/webssl-mae300m-full2b-224 Image Feature Extraction • 0.3B • Updated Apr 24, 2025 • 1.49k
facebook/webssl-dino3b-heavy2b-224 Image Feature Extraction • 3B • Updated Apr 24, 2025 • 5 • 1