Ashima/qwen3_0.6b-rlvr_task270_csrg_counterfactual_context_generation Viewer • Updated about 17 hours ago • 1k
Ashima/qwen3_0.6b-rlvr_task270_csrg_counterfactual_context_generation Viewer • Updated about 17 hours ago • 1k
Ashima/qwen3_0.6b-rlvr_task218_rocstories_swap_order_answer_generation Viewer • Updated about 17 hours ago • 1k
Ashima/qwen3_0.6b-rlvr_task218_rocstories_swap_order_answer_generation Viewer • Updated about 17 hours ago • 1k
Ashima/qwen3_0.6b-rlvr_task217_rocstories_ordering_answer_generation Viewer • Updated about 18 hours ago • 1k
Ashima/qwen3_0.6b-rlvr_task217_rocstories_ordering_answer_generation Viewer • Updated about 18 hours ago • 1k
Ashima/qwen3_0.6b-rlvr_task966_ruletaker_fact_checking_based_on_given_context Viewer • Updated about 24 hours ago • 300 • 2
Ashima/qwen3_0.6b-rlvr_task966_ruletaker_fact_checking_based_on_given_context Viewer • Updated about 24 hours ago • 300 • 2
Ashima/qwen3_0.6b-rlvr_task863_asdiv_multiop_question_answering Viewer • Updated about 24 hours ago • 275 • 2
Ashima/qwen3_0.6b-rlvr_task863_asdiv_multiop_question_answering Viewer • Updated about 24 hours ago • 275 • 2
Ashima/qwen3_0.6b-rlvr_task828_copa_commonsense_cause_effect Viewer • Updated about 24 hours ago • 996 • 2
Ashima/qwen3_0.6b-rlvr_task828_copa_commonsense_cause_effect Viewer • Updated about 24 hours ago • 996 • 2
Ashima/qwen3_0.6b-rlvr_task717_mmmlu_answer_generation_logical_fallacies Viewer • Updated 1 day ago • 170 • 2
Ashima/qwen3_0.6b-rlvr_task717_mmmlu_answer_generation_logical_fallacies Viewer • Updated 1 day ago • 170 • 2