distillslm/10000_code_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 7, 2025 • 4
distillslm/5000_code_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 7, 2025 • 4
distillslm/final_math_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Updated Feb 7, 2025
distillslm/10000_math_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 7, 2025 • 4
distillslm/5000_math_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 6, 2025 • 5
distillslm/0_code_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 6, 2025 • 5
distillslm/0_math_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 6, 2025 • 6