DCAgent2/glm4_7-fixthink-codecontestStep141
8B • Updated
• 8
DCAgent2/codecontest-8B-overlongFilter-DrGRPO-step150
8B • Updated
• 10
DCAgent2/bs64_rloo_n_noct_stri_micr_auto_conv_pref_model_r2e-120
8B • Updated
• 42
DCAgent2/nl2bashGPT5CodexPassed-qwen3-8b-8nodes-sync-logtest
Updated
DCAgent2/swesmith-stack-over5050
Text Generation
• 308k • Updated
• 5
DCAgent2/swesmith-nl2bashseq
308k • Updated
• 3
DCAgent2/stack-swesmithseq
Text Generation
• 308k • Updated
• 5
DCAgent2/stack-bugs-undr3070
Text Generation
• 308k • Updated
• 3
DCAgent2/nl2bash-swesmithseq
308k • Updated
• 2
DCAgent2/nl2bash-swesmith-undr7030
308k • Updated
• 1
DCAgent2/nl2bash-swesmith-reason
Text Generation
• 308k • Updated
• 4
DCAgent2/nl2bash-swesmith-over5050
Text Generation
• 308k • Updated
• 7
DCAgent2/nl2bash-stack-bugs-undr503020
Text Generation
• 308k • Updated
• 2
DCAgent2/nl2bash-stack-bugs-undr203050
Updated
DCAgent2/bugs-swesmith-undr7030
308k • Updated
• 1
DCAgent2/bugs-swesmith-over5050
308k • Updated
• 1
DCAgent2/nl2bash-stack-bugs-over333
308k • Updated
• 15
DCAgent2/swesmith-bugsseq
Text Generation
• 308k • Updated
• 3
DCAgent2/bugs-swesmithseq
Text Generation
• 308k • Updated
• 3
DCAgent2/swesmith-stack-reason
Text Generation
• 308k • Updated
• 5
DCAgent2/bugs-swesmith-reason
Text Generation
• 308k • Updated
• 4
DCAgent2/swesmith-stackseq
Text Generation
• 308k • Updated
• 6
DCAgent2/swesmith-stack-undr7030
Text Generation
• 308k • Updated
• 2
DCAgent2/stack-bugs-undr7030
Text Generation
• 308k • Updated
• 4
Text Generation
• 308k • Updated
• 5
DCAgent2/stack-bugs-over5050
Text Generation
• 308k • Updated
• 2
DCAgent2/nl2bash-stack-bugsseq
Text Generation
• 308k • Updated
• 90
DCAgent2/stack-bugsshuffle
Text Generation
• 308k • Updated
• 8
DCAgent2/bugs-stack-nl2bashseq
Text Generation
• 308k • Updated
• 9
DCAgent2/nl2bash-stack-bugsshuffle
Text Generation
• 308k • Updated
• 6