Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
16
13
Xinyu Zhu
TianHongZXY
Follow
wanng's profile picture
Twwilght's profile picture
guanghuim's profile picture
8 followers
·
8 following
https://zhuxinyu.top
tianhongzxy
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
upvoted
a
collection
about 11 hours ago
CHIMERA
liked
a model
about 11 hours ago
TianHongZXY/CHIMERA-4B-RL
liked
a model
about 11 hours ago
TianHongZXY/CHIMERA-4B-SFT
View all activity
Organizations
TianHongZXY
's models
12
Sort: Recently updated
TianHongZXY/CHIMERA-4B-SFT
4B
•
Updated
5 days ago
•
50
•
2
TianHongZXY/CHIMERA-4B-RL
4B
•
Updated
5 days ago
•
28
•
2
TianHongZXY/Qwen3-4B-NSR
4B
•
Updated
Dec 6, 2025
•
1
TianHongZXY/Qwen2.5-Math-7B-GRPO
8B
•
Updated
Jul 28, 2025
•
3
TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28
Updated
Jul 8, 2025
TianHongZXY/Qwen2.5-Math-7B-W-REINFORCE
8B
•
Updated
Jun 1, 2025
•
2
•
1
TianHongZXY/Qwen3-4B-GRPO
4B
•
Updated
May 31, 2025
•
6
TianHongZXY/Qwen3-4B-PPO
4B
•
Updated
May 31, 2025
•
1
TianHongZXY/Qwen3-4B-PSR
4B
•
Updated
May 31, 2025
•
2
•
1
TianHongZXY/Qwen2.5-Math-7B-PPO
8B
•
Updated
May 31, 2025
•
3
TianHongZXY/Qwen2.5-Math-7B-PSR
8B
•
Updated
May 31, 2025
•
3
TianHongZXY/Qwen2.5-Math-7B-NSR
8B
•
Updated
May 30, 2025
•
3
•
2