metadata
datasets:
- weizechen/RL-Compositionality-Stage1-RFT-Data
base_model:
- meta-llama/Llama-3.1-8B-Instruct
The model after Stage 1 RFT.
datasets:
- weizechen/RL-Compositionality-Stage1-RFT-Data
base_model:
- meta-llama/Llama-3.1-8B-Instruct
The model after Stage 1 RFT.