weizechen
/

RL-Compositionality-Stage-1-Model

Model card Files Files and versions

RL-Compositionality-Stage-1-Model / README.md

weizechen's picture

Create README.md

c443fa4 verified about 2 months ago

|

history blame contribute delete

245 Bytes

	---
	datasets:
	- weizechen/RL-Compositionality-Stage1-RFT-Data
	base_model:
	- meta-llama/Llama-3.1-8B-Instruct
	---
	The model after Stage 1 RFT.

	Paper: https://huggingface.co/papers/2509.25123

	Code: https://github.com/PRIME-RL/RL-Compositionality