RLHFlow/RewardModel-Mistral-7B-for-DPA-v1 Text Classification • 7B • Updated May 23, 2024 • 1.15k • 4
Running 3.65k The Ultra-Scale Playbook 🌌 3.65k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • 71B • Updated Feb 24, 2025 • 226k • • 738