Lia Kyle's picture

Lia Kyle

liakyle

·

AI & ML interests

None yet

Recent Activity

updated a Space 17 days ago

liakyle/liakyle

published a Space 17 days ago

liakyle/liakyle

reacted to sergiopaniego's post with 🚀 17 days ago

Meet the Post-Training Toolkit (PTT), which easily integrates with TRL via a single callback, by Aditya Challapally (@microsoft): 🔍 Detects training issues early 🛠 Lets you intervene safely 📊 Keeps long training runs stable, auditable & efficient Microsoft blog: https://devblogs.microsoft.com/engineering-at-microsoft/diagnosing-instability-in-production-scale-agent-rl/ Integration guide: https://huggingface.co/docs/trl/main/en/ptt_integration Code: https://github.com/microsoft/post-training-toolkit

View all activity

Organizations

None yet

updated a Space 17 days ago

Liakyle

published a Space 17 days ago

Liakyle

reacted to sergiopaniego's post with 🚀 17 days ago

Post

533

Meet the Post-Training Toolkit (PTT), which easily integrates with TRL via a single callback, by Aditya Challapally ( @microsoft ):

🔍 Detects training issues early
🛠 Lets you intervene safely
📊 Keeps long training runs stable, auditable & efficient

Microsoft blog: https://devblogs.microsoft.com/engineering-at-microsoft/diagnosing-instability-in-production-scale-agent-rl/

Integration guide: https://huggingface.co/docs/trl/main/en/ptt_integration

Code: https://github.com/microsoft/post-training-toolkit

reacted to sergiopaniego's post with 🔥 17 days ago

Post

549

The latest piece by @MiniMax-AI is a must-read.

It tries to break the impossible triangle of agent RL: throughput × stability × flexibility.

A lot to learn here, go read it 🫵
https://huggingface.co/blog/MiniMax-AI/forge-scalable-agent-rl-framework-and-algorithm