Running on CPU Upgrade Featured 3.02k The Smol Training Playbook 📚 3.02k The secrets to building world-class LLMs
Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets Paper • 2508.14094 • Published Aug 15, 2025 • 1
Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets Paper • 2508.14094 • Published Aug 15, 2025 • 1
view article Article Anonymizer SLM series: Privacy-first PII replacement models (0.6B/1.7B/4B) Aug 27, 2025 • 4
Anonymizer Model Series Collection SLMs for semantically similar replacement of PII to provide better end-user privacy. Three model sizes (0.6B, 1.7B, 4B) for different devices. • 3 items • Updated Aug 27, 2025 • 4