Qwen/Qwen3.5-397B-A17B-FP8 Image-Text-to-Text • 403B • Updated about 7 hours ago • 176k • 105
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models Jul 18, 2025 • 50
nvidia/Llama-Nemotron-Post-Training-Dataset Viewer • Updated May 8, 2025 • 3.91M • 2.06k • 642
Running 3.71k The Ultra-Scale Playbook 🌌 3.71k The ultimate guide to training LLM on large GPU Clusters
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models +1 Mar 20, 2024 • 110
OpenLLM-France/Lucie-7B-Instruct-human-data Text Generation • 7B • Updated Mar 19, 2025 • 292 • 7
DolphinLabeled Datasets Collection Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6, 2025 • 15