File size: 245 Bytes
c443fa4
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
---
datasets:
- weizechen/RL-Compositionality-Stage1-RFT-Data
base_model:
- meta-llama/Llama-3.1-8B-Instruct
---
The model after Stage 1 RFT.

Paper: https://huggingface.co/papers/2509.25123

Code: https://github.com/PRIME-RL/RL-Compositionality