Diversity-Incentivized Exploration for Versatile Reasoning
Zican Hu
huzican
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 3 hours ago
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
updated
a dataset
3 days ago
huzican/game_multiturn_less5_singleturn
published
a dataset
3 days ago
huzican/game_multiturn_less5_singleturn