DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc9b2dd9fa Viewer • Updated about 4 hours ago • 29 • 15
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-986b6fefd Viewer • Updated about 6 hours ago • 581 • 19
DCAgent/eval-glm46-swegym-tasks-maxeps-131k_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Updated about 9 hours ago • 4
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc7bd19372 Viewer • Updated about 9 hours ago • 522 • 11
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-93b7ec80c Viewer • Updated about 23 hours ago • 390 • 16
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_terminal-bench-2.0 Viewer • Updated 1 day ago • 261 • 7
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-99a1741f7 Viewer • Updated 1 day ago • 415 • 10
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_swebench-vera9b71b18 Viewer • Updated 1 day ago • 300 • 4