Jing Wang
Ava007a
AI & ML interests
None yet
Organizations
None yet
LLM - Agentic RL
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 227 -
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Paper • 2509.01055 • Published • 76 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 83
MLLM - Agentic RL
LLM - Agentic RL
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 227 -
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Paper • 2509.01055 • Published • 76 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 83
models
0
None public yet
datasets
0
None public yet