Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Bojan Jakimovski
Shekswess
AI & ML interests
AWS Ambassador | Machine Learning Lead | College Professor | GenAI | MLOps
Recent Activity
liked
a dataset
1 day ago
PleIAs/common_corpus
liked
a Space
11 days ago
eliebak/sparsity-viz
updated
a dataset
15 days ago
Shekswess/fineweb-edu-700m
Organizations
Tiny Reasoning Language Model
Collection dedicated to the development of the Tiny Reasoning Language Model (trlm)
-
Shekswess/trlm-135m
Text Generation • 0.1B • Updated • 26 • 46 -
Shekswess/trlm-stage-3-dpo-final-2
Text Generation • 0.1B • Updated • 2 • 1 -
Shekswess/trlm-stage-2-sft-final-2
Text Generation • 0.1B • Updated • 2 • 1 -
Shekswess/trlm-stage-1-sft-final-2
Text Generation • 0.1B • Updated • 1 • 1
Tiny Think
Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Tiny Reasoning Language Model
Collection dedicated to the development of the Tiny Reasoning Language Model (trlm)
-
Shekswess/trlm-135m
Text Generation • 0.1B • Updated • 26 • 46 -
Shekswess/trlm-stage-3-dpo-final-2
Text Generation • 0.1B • Updated • 2 • 1 -
Shekswess/trlm-stage-2-sft-final-2
Text Generation • 0.1B • Updated • 2 • 1 -
Shekswess/trlm-stage-1-sft-final-2
Text Generation • 0.1B • Updated • 1 • 1
models
31
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_3-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
65
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta1-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
50
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_5-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
52
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
61
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr5e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
58
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
56
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
53
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
52
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
108
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e2-bs8
Text Generation
•
0.1B
•
Updated
•
56
datasets
35
Shekswess/fineweb-edu-700m
Viewer
•
Updated
•
681k
•
30
Shekswess/tiny-think-sft-math-n-stem
Viewer
•
Updated
•
29.1k
•
60
Shekswess/tiny-think-dpo-math-n-stem
Viewer
•
Updated
•
2.86k
•
51
Shekswess/trlm-sft-stage-1-final-2
Viewer
•
Updated
•
58k
•
7
Shekswess/trlm-sft-stage-2-final-2
Viewer
•
Updated
•
78k
•
190
Shekswess/trlm-dpo-stage-3-final-2
Viewer
•
Updated
•
50k
•
34
Shekswess/customer-support
Viewer
•
Updated
•
1k
•
22
•
1
Shekswess/scientific-research
Viewer
•
Updated
•
1k
•
10
•
4
Shekswess/technical-manuals
Viewer
•
Updated
•
1k
•
22
•
4
Shekswess/legal-documents
Viewer
•
Updated
•
1k
•
63
•
5