| datasets: | |
| - weizechen/RL-Compositionality-Stage1-RFT-Data | |
| base_model: | |
| - meta-llama/Llama-3.1-8B-Instruct | |
| The model after Stage 1 RFT. | |
| Paper: https://huggingface.co/papers/2509.25123 | |
| Code: https://github.com/PRIME-RL/RL-Compositionality |
| datasets: | |
| - weizechen/RL-Compositionality-Stage1-RFT-Data | |
| base_model: | |
| - meta-llama/Llama-3.1-8B-Instruct | |
| The model after Stage 1 RFT. | |
| Paper: https://huggingface.co/papers/2509.25123 | |
| Code: https://github.com/PRIME-RL/RL-Compositionality |