Submitted by
BoYang Zheng
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding