Sungbin Han
SungbiinHan
ยท
AI & ML interests
None yet
Recent Activity
liked
a dataset 1 day ago
zhuzilin/dapo-math-17k liked
a dataset 1 day ago
BytedTsinghua-SIA/DAPO-Math-17k upvoted a paper 15 days ago
Rethinking the Trust Region in LLM Reinforcement Learning Organizations
None yet