Carnegie Mellon University
university
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Hardening Agent Benchmarks with Adversarial Hacker-Fixer Loops
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts
models 0
None public yet
datasets 0
None public yet