VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory Paper • 2512.04519 • Published 7 days ago • 2
Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules Paper • 2512.02892 • Published 9 days ago • 8
Beyond Unified Models: A Service-Oriented Approach to Low Latency, Context Aware Phonemization for Real Time TTS Paper • 2512.08006 • Published 3 days ago • 2
HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models Paper • 2512.09928 • Published about 22 hours ago • 10
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models Paper • 2512.08829 • Published 2 days ago • 12
Learning Unmasking Policies for Diffusion Language Models Paper • 2512.09106 • Published 2 days ago • 3
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 1 day ago • 13
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published 1 day ago • 41
UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving Paper • 2512.09864 • Published about 23 hours ago • 6
LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning Paper • 2512.05325 • Published 7 days ago • 2
MIND-V: Hierarchical Video Generation for Long-Horizon Robotic Manipulation with RL-based Physical Alignment Paper • 2512.06628 • Published 5 days ago • 12
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published 3 days ago • 37
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform Paper • 2512.08478 • Published 2 days ago • 70
Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation Paper • 2512.08186 • Published 3 days ago • 19
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper • 2512.07843 • Published 17 days ago • 19
EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce Paper • 2512.08868 • Published 2 days ago • 2
TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models Paper • 2512.08153 • Published 3 days ago • 4
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 2 days ago • 108