Kyle1668/sfm-midtraining_mix_unfiltered
Text Generation
•
7B
•
Updated
•
414
Note Our "Unfiltered" model, trained on 550B tokens without any interventions
Note Our "Filtered" model, where almost all discusison of AI has been removed
Note Our "Unfiltered + Synthetic Misalignment" model, where 0.8% of midtraining is composed of synthetic misalignment discourse. Pretraining is filtered as well, but has no upsampling.