File size: 245 Bytes
c443fa4 |
1 2 3 4 5 6 7 8 9 10 11 |
---
datasets:
- weizechen/RL-Compositionality-Stage1-RFT-Data
base_model:
- meta-llama/Llama-3.1-8B-Instruct
---
The model after Stage 1 RFT.
Paper: https://huggingface.co/papers/2509.25123
Code: https://github.com/PRIME-RL/RL-Compositionality |