koutch/qwen_qwen3-instruct-4b_train_grpo_v2_train_code Text Generation • 4B • Updated about 4 hours ago
koutch/qwen_qwen3-instruct-4b_train_grpo_v2_train_code Text Generation • 4B • Updated about 4 hours ago
koutch/qwen_falcon_qwen3-instruct-4b_train_grpo_v1_2.json Text Generation • 4B • Updated 1 day ago • 30
koutch/qwen_falcon_qwen3-instruct-4b_train_grpo_v1_2.json Text Generation • 4B • Updated 1 day ago • 30