ALMA-13B-Pretrain + Separate Training TongZheng1999/alma-13b-sft-50-languages-ar-max-tokens-512 Updated Feb 23, 2025 • 5 TongZheng1999/alma-13b-sft-50-languages-lt-max-tokens-512 Updated Feb 21, 2025 • 1 TongZheng1999/alma-13b-sft-50-languages-nl-max-tokens-512 Updated Feb 21, 2025 • 1 TongZheng1999/alma-13b-sft-50-languages-bg-max-tokens-512 Updated Feb 21, 2025 • 2
ALMA-13B-Pretrain + Group Training TongZheng1999/alma-13b-sft-group-4-max-tokens-512 Updated Feb 21, 2025 • 2 TongZheng1999/alma-13b-sft-group-6-max-tokens-512 Updated Feb 21, 2025 • 1 TongZheng1999/alma-13b-sft-group-3-max-tokens-512 Updated Feb 21, 2025 • 1 TongZheng1999/alma-13b-sft-group-1-max-tokens-512 Updated Feb 21, 2025 • 3
ALMA-13B-Pretrain + Separate Training TongZheng1999/alma-13b-sft-50-languages-ar-max-tokens-512 Updated Feb 23, 2025 • 5 TongZheng1999/alma-13b-sft-50-languages-lt-max-tokens-512 Updated Feb 21, 2025 • 1 TongZheng1999/alma-13b-sft-50-languages-nl-max-tokens-512 Updated Feb 21, 2025 • 1 TongZheng1999/alma-13b-sft-50-languages-bg-max-tokens-512 Updated Feb 21, 2025 • 2
ALMA-13B-Pretrain + Group Training TongZheng1999/alma-13b-sft-group-4-max-tokens-512 Updated Feb 21, 2025 • 2 TongZheng1999/alma-13b-sft-group-6-max-tokens-512 Updated Feb 21, 2025 • 1 TongZheng1999/alma-13b-sft-group-3-max-tokens-512 Updated Feb 21, 2025 • 1 TongZheng1999/alma-13b-sft-group-1-max-tokens-512 Updated Feb 21, 2025 • 3
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge_f_by_judge Viewer • Updated Apr 11 • 22.1k • 62
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_filtered_by_judge Viewer • Updated Apr 11 • 5.43k • 13
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge Viewer • Updated Apr 11 • 33.4k • 36
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed Viewer • Updated Apr 11 • 16.7k • 15
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150 Viewer • Updated Apr 10 • 16.7k • 11
TongZheng1999/Bespoke-Stratos-17k-Init-Model-Final-Reinforce-Baseline-Iter1-Strong-Init-Filtered-Merged Viewer • Updated Apr 7 • 46.5k • 12
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_filtered Viewer • Updated Apr 7 • 13.1k • 10