Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tinyBenchmarks
community
https://github.com/felipemaiapolo/tinyBenchmarks
Activity Feed
Follow
30
AI & ML interests
None defined yet.
Recent Activity
borgr
submitted
a paper
17 days ago
General Agent Evaluation
moonfolk
authored
a paper
9 months ago
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
borgr
authored
a paper
11 months ago
Pretraining Language Models for Diachronic Linguistic Change Discovery
View all activity
Team members
4
models
0
None public yet
datasets
7
Sort: Recently updated
tinyBenchmarks/tinyMMLU
Viewer
•
Updated
Jul 8, 2024
•
385
•
13.7k
•
24
tinyBenchmarks/tinyHellaswag
Viewer
•
Updated
May 25, 2024
•
50k
•
2.6k
•
5
tinyBenchmarks/tinyTruthfulQA
Preview
•
Updated
May 25, 2024
•
1.9k
•
4
tinyBenchmarks/tinyWinogrande
Preview
•
Updated
May 25, 2024
•
2.16k
•
5
tinyBenchmarks/tinyGSM8k
Preview
•
Updated
May 25, 2024
•
6.79k
•
9
tinyBenchmarks/tinyAI2_arc
Preview
•
Updated
May 25, 2024
•
2.56k
•
4
tinyBenchmarks/tinyAlpacaEval
Viewer
•
Updated
Apr 19, 2024
•
100
•
147
•
7