Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation Paper • 2602.12125 • Published about 14 hours ago • 32
DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing Paper • 2601.09609 • Published 30 days ago • 3
DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing Paper • 2601.09609 • Published 30 days ago • 3
DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing Paper • 2601.09609 • Published 30 days ago • 3
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30, 2025 • 117
Open Multimodal Retrieval-Augmented Factual Image Generation Paper • 2510.22521 • Published Oct 26, 2025 • 31
Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction Paper • 2510.03117 • Published Oct 3, 2025 • 12