106 44 23

TY.Zheng

aaabiao

https://scholar.google.com/citations?user=Vq-VZnUAAAAJ&hl=zh-CN

Zheng0428

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

upvoted a paper 7 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

updated a model 15 days ago

aaabiao/Qwen3-Coder-30B-A3B-Instruct_swe_traj_train_all_1115_4000

View all activity

Organizations

upvoted a paper 3 days ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published 5 days ago • 137

upvoted a paper 7 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 15 days ago • 245

upvoted 2 papers about 2 months ago

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

Paper • 2510.14616 • Published Oct 16 • 11

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

Paper • 2510.14763 • Published Oct 16 • 13

upvoted a paper 2 months ago

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30 • 47

upvoted 2 papers 3 months ago

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9 • 59

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7 • 149

upvoted a collection 3 months ago

Code Synthetic RL Rollout

Collection

5 items • Updated Sep 23 • 1

upvoted a paper 3 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80

upvoted a paper 4 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158

upvoted 3 papers 5 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 79

upvoted a paper 8 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

upvoted 2 papers 9 months ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24 • 31

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 71

upvoted 2 papers 10 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 104

Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM

Paper • 2502.06635 • Published Feb 10 • 6

upvoted a paper 11 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 103

upvoted a paper 12 months ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 50

TY.Zheng

AI & ML interests

Recent Activity

Organizations

aaabiao's activity