Chujie Zheng's picture

Chujie Zheng

chujiezheng

·

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

upvoted a paper 7 days ago

Soft Adaptive Policy Optimization

authored a paper 8 days ago

Soft Adaptive Policy Optimization

authored a paper 8 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted a paper 7 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 14 days ago • 35

authored 2 papers 8 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 14 days ago • 35

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 9 days ago • 83

upvoted a paper 8 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 9 days ago • 83

liked 16 models about 2 months ago

Qwen/Qwen3-VL-32B-Thinking-FP8

Image-Text-to-Text • 33B • Updated 13 days ago • 9.03k • 18

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20 • 36.7k • 88

Qwen/Qwen3-VL-2B-Instruct-FP8

Image-Text-to-Text • 2B • Updated Oct 20 • 10.5k • 28

Qwen/Qwen3-VL-2B-Thinking-FP8

Image-Text-to-Text • 2B • Updated 13 days ago • 6.29k • 20

Qwen/Qwen3-VL-32B-Thinking

Image-Text-to-Text • 33B • Updated Oct 21 • 491k • 68

Qwen/Qwen3-VL-32B-Instruct-FP8

Image-Text-to-Text • 33B • Updated Oct 22 • 155k • 27

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23 • 504k • 223

Qwen/Qwen3-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Oct 21 • 1.12M • 135

Qwen/Qwen3-4B-Thinking-2507-FP8

Text Generation • 4B • Updated Aug 6 • 184k • 40

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6 • 716k • • 480

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated 13 days ago • 1.23M • • 429

Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated 13 days ago • 25.5k • 24

Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

Image-Text-to-Text • 31B • Updated 13 days ago • 126k • 45

Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

Image-Text-to-Text • 236B • Updated 13 days ago • 235k • 32

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

Image-Text-to-Text • 31B • Updated 13 days ago • 224k • 90

Qwen/Qwen3-VL-30B-A3B-Thinking

Image-Text-to-Text • 31B • Updated 13 days ago • 55.2k • • 163