DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
Paper
• 2602.12160 • Published
• 19
None defined yet.
UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model
BitDance: Scaling Autoregressive Generative Models with Binary Tokens