LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper • 2508.15760 • Published Aug 21 • 46 • 9
Preference Learning Unlocks LLMs' Psycho-Counseling Skills Paper • 2502.19731 • Published Feb 27 • 7 • 2
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy Paper • 2410.13218 • Published Oct 17, 2024 • 4 • 2