Running on CPU Upgrade 136 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 136 Explore synthetic data experiments with an interactive bookshelf
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 19 items • Updated about 3 hours ago • 71
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 4 days ago • 103
RedSage Benchmarks Collection List of Cybersecurity Benchmarks Datasets. • 7 items • Updated 29 days ago
RedSage Models Collection Continued Pretraining and Post-trained RedSage Models. • 5 items • Updated 29 days ago