Building on HF
·
AI & ML interests
Reward models
Organizations
reciprocate/kaggle-lmarena-synth-50k
Viewer
• Updated • 50.7k • 6
reciprocate/ultra-annotated-200k
Viewer
• Updated • 208k • 11
reciprocate/dpo-objective-v0.2
Viewer
• Updated • 384 • 7
reciprocate/tinygsm_interpreter_1M
Viewer
• Updated • 1M • 7
Viewer
• Updated • 541 • 6
reciprocate/dpo_mix-zero-math-untoxic
Viewer
• Updated • 6.91k • 8
reciprocate/dpo_mix-7k_untoxic
Viewer
• Updated • 7.29k • 8
• 2
reciprocate/tinygsm_mixtral_12M
Viewer
• Updated • 12M • 36
• 1
reciprocate/dpo_ultra-capybara-code_filtered-best
Viewer
• Updated • 35.2k • 15
• 1
Viewer
• Updated • 6.17k • 33
• 2
reciprocate/dpo_ultra-capybara_filtered-best
Viewer
• Updated • 25.6k • 7
reciprocate/tinygsm_mixtral_up_dedup
Viewer
• Updated • 1.68M • 11
reciprocate/ultrafeedback_orca_math_cleaned_high_dpo
Viewer
• Updated • 48.3k • 12
• 2
reciprocate/ultrafeedback_cleaned_high_dpo
Viewer
• Updated • 40k • 8
• 2
reciprocate/ultrafeedback_orca_math_dpo
Viewer
• Updated • 73.8k • 9
• 2
reciprocate/ultrafeedback_cleaned_v2_dpo
Viewer
• Updated • 58.6k • 16
• 1
reciprocate/math_dpo_pairs
Viewer
• Updated • 4.38k • 13
• 5
reciprocate/pku_safer_dpo_pairs
Viewer
• Updated • 51.8k • 11
reciprocate/pku_better_dpo_pairs
Viewer
• Updated • 330k • 10
reciprocate/orca_dpo_pairs
Viewer
• Updated • 14.8k • 9
Viewer
• Updated • 1k • 13
reciprocate/gsm8k-test_critiques
Viewer
• Updated • 753 • 11
• 2
reciprocate/gsm8k_train_pairwise
Viewer
• Updated • 7.04k • 13
• 4
reciprocate/gsm8k_pairwise
Viewer
• Updated • 128 • 8
• 2
Viewer
• Updated • 13k • 10
Viewer
• Updated • 10.5k • 15
• 1
Viewer
• Updated • 2.37k • 11
reciprocate/vicuna-fair-eval_format-oa
Viewer
• Updated • 66 • 12
reciprocate/vicuna-fair-eval
Viewer
• Updated • 66 • 16
reciprocate/vicuna_fair_eval_dataset
Viewer
• Updated • 66 • 11