Bo Liu
Benjamin-eecs
AI & ML interests
Reinforcement Learning, Reasoning, Machine Learning Systems
Recent Activity
upvoted a paper about 2 hours ago
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation upvoted a paper about 2 months ago
Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents liked a dataset 3 months ago
facebook/principia-bench