·
AI & ML interests
None yet
Organizations
None yet
Preview
• Updated • 41
• 2
Viewer
• Updated • 1.41M • 8
Viewer
• Updated • 1.41M • 18
Viewer
• Updated • 1.41M • 8
zd21/ReST-MCTS_SciGLM-6B_Self-Rewarding-DPO_2nd
Viewer
• Updated • 1 • 18
zd21/ReST-MCTS_SciGLM-6B_ReST-MCTS_Policy_2nd
Viewer
• Updated • 40.9k • 8
zd21/ReST-MCTS_SciGLM-6B_ReST-EM-CoT_2nd
Viewer
• Updated • 28.9k • 7
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_Self-Rewarding-DPO_2nd
Viewer
• Updated • 1 • 6
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-MCTS_2nd
Viewer
• Updated • 26k • 8
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-EM-CoT_2nd
Viewer
• Updated • 36.6k • 8
zd21/ReST-MCTS_Llama3-8b-Instruct_Self-Rewarding-DPO_2nd
Viewer
• Updated • 1 • 6
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-MCTS_Policy_2nd
Viewer
• Updated • 32.3k • 8
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-EM-CoT_2nd
Viewer
• Updated • 33.2k • 8
zd21/ReST-MCTS_SciGLM-6B_Self-Rewarding-DPO_1st
Viewer
• Updated • 33.5k • 9
zd21/ReST-MCTS_SciGLM-6B_ReST-MCTS_Policy_1st
Viewer
• Updated • 30.1k • 6
zd21/ReST-MCTS_SciGLM-6B_ReST-EM-CoT_1st
Viewer
• Updated • 55.8k • 7
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_Self-Rewarding-DPO_1st
Viewer
• Updated • 1 • 9
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-MCTS_1st
Viewer
• Updated • 38.7k • 5
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-EM-CoT_1st
Viewer
• Updated • 74k • 6
zd21/ReST-MCTS_Llama3-8b-Instruct_Self-Rewarding-DPO_1st
Viewer
• Updated • 1 • 8
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-MCTS_Policy_1st
Viewer
• Updated • 33.7k • 6
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-EM-CoT_1st
Viewer
• Updated • 73.1k • 8
Viewer
• Updated • 474k • 12
• 2
Viewer
• Updated • 91.8k • 200
• 7
Viewer
• Updated • 91.8k • 23
• 3
zd21/ReST-MCTS-Llama3-8b-Instruct-Policy-1st
Viewer
• Updated • 33.7k • 11
• 7
zd21/ReST-MCTS-Llama3-8b-Instruct-PRM-1st
Viewer
• Updated • 673k • 38
• 9