Causal Motion Diffusion Models for Autoregressive Motion Generation Paper • 2602.22594 • Published 1 day ago • 5
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models Paper • 2602.17602 • Published 8 days ago • 52
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation Paper • 2602.12160 • Published 15 days ago • 34
From Perception to Action: An Interactive Benchmark for Vision Reasoning Paper • 2602.21015 • Published 3 days ago • 21
DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning Paper • 2602.16742 • Published 10 days ago • 9
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published 14 days ago • 43
view article Article Train AI models with Unsloth and Hugging Face Jobs for FREE +4 8 days ago • 75
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception Paper • 2602.11858 • Published 15 days ago • 58
Tiny Aya Collection Bridging Scale and Multilingual Depth • 10 items • Updated 10 days ago • 61
BitDance Collection BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 11 items • Updated 5 days ago • 9