·
AI & ML interests
RL for LLMs/CodeLLMs
Organizations
datasets 13
reshinthadith/math12k-stage3
Viewer
• Updated • 6k • 38
reshinthadith/math12k-stage2
Viewer
• Updated • 4k • 32
reshinthadith/math12k-stage1
Viewer
• Updated • 2k • 61
reshinthadith/the-stack-mujoco-xml
Viewer
• Updated • 48.3k • 15
• 1
reshinthadith/WizardLM_evol_instruct_V2_code_filtered
Viewer
• Updated • 138k • 19
• 1
reshinthadith/basic_code_ppl_eval
Viewer
• Updated • 8.73k • 190
• 4
Updated • 10
reshinthadith/2048_has_code_filtered_base_code_review_python_based_on_property
Viewer
• Updated • 6.4k • 11
reshinthadith/2048_has_code_filtered_base_code_review_python
Viewer
• Updated • 6.4k • 11
reshinthadith/dfg_augmented_mbpp
Viewer
• Updated • 95 • 42