arxiv:2512.14614
wenq
wenqsun
AI & ML interests
Machine learning, computer vision
Recent Activity
upvoted a paper 22 days ago
WorldCompass: Reinforcement Learning for Long-Horizon World Models upvoted a paper 23 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Organizations
None yet