1 2 1

Pratyush Ranjan Tiwari PRO

pratyushrt

https://www.pratyush.site/

AI & ML interests

Reinforcements Learning, Privacy, Post-training LLMs, SLMs

Recent Activity

liked a Space 1 day ago

HuggingFaceTB/smol-training-playbook

updated a Space 4 months ago

eternisai/README

authored a paper 5 months ago

Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets

View all activity

Organizations

liked a Space 1 day ago

The Smol Training Playbook

📚

3.02k

The secrets to building world-class LLMs

updated a Space 4 months ago

README

🚀

authored a paper 5 months ago

Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets

Paper • 2508.14094 • Published Aug 15, 2025 • 1

upvoted a paper 5 months ago

Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets

Paper • 2508.14094 • Published Aug 15, 2025 • 1

updated 5 models 6 months ago

New activity in eternisai/Anonymizer-0.6B 6 months ago

License?

#1 opened 6 months ago by

ramblingcoder

published an article 6 months ago

Article

Anonymizer SLM series: Privacy-first PII replacement models (0.6B/1.7B/4B)

Aug 27, 2025

•

upvoted a collection 6 months ago

Anonymizer Model Series

Collection

SLMs for semantically similar replacement of PII to provide better end-user privacy. Three model sizes (0.6B, 1.7B, 4B) for different devices. • 3 items • Updated Aug 27, 2025 • 4

Pratyush Ranjan Tiwari PRO

AI & ML interests

Recent Activity

Organizations

pratyushrt's activity

The Smol Training Playbook

README

License?

Anonymizer SLM series: Privacy-first PII replacement models (0.6B/1.7B/4B)