Free-form datasets, human annotations, and sample-level model outputs for "Answer Matching Outperforms Multiple Choice for Language Model Evaluation"
Nikhil Chandak
nikhilchandak
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 1 hour ago
nikhilchandak/OpenForesight
published
a dataset
about 4 hours ago
nikhilchandak/OpenForesight
liked
a dataset
2 months ago
brendel-group/MATH-Beyond
Organizations
None yet