Qipeng Chen

lechatelierlenz

AI & ML interests

multimodal reasoning, dLLM

Recent Activity

upvoted a paper about 2 hours ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

upvoted a paper 7 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

upvoted a paper 14 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

View all activity

Organizations

upvoted a paper about 2 hours ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 3 days ago • 54

upvoted a paper 7 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published 10 days ago • 98

upvoted a paper 14 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 17 days ago • 35

upvoted a paper 20 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published 29 days ago • 108

upvoted a paper 21 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 23 days ago • 230

upvoted a paper about 1 month ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published Nov 12 • 68

upvoted 2 papers about 2 months ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published Oct 16 • 47

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 269

upvoted a paper 3 months ago

Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation

Paper • 2509.19244 • Published Sep 23 • 11

upvoted a paper 5 months ago

Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

Paper • 2508.02120 • Published Aug 4 • 19

upvoted a collection 5 months ago

dLLM & dMLLM

Collection

(M)LLMs based on Discrete Diffusion Model and relevant techniques • 16 items • Updated Jul 23 • 2

upvoted 2 papers 5 months ago

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21 • 35

GUI-G^2: Gaussian Reward Modeling for GUI Grounding

Paper • 2507.15846 • Published Jul 21 • 133

upvoted 4 papers 7 months ago

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Paper • 2505.15801 • Published May 21 • 17

Qipeng Chen

AI & ML interests

Recent Activity

Organizations

lechatelierlenz's activity