arxiv:2509.25133
YuxianJiang
Linn3a3
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Conditional Advantage Estimation for Reinforcement Learning in Large
Reasoning Models
upvoted
a
paper
about 2 months ago
Rethinking Entropy Regularization in Large Reasoning Models
authored
a paper
about 2 months ago
S-Agents: self-organizing agents in open-ended environment
Organizations
None yet