2 12 3

Xu Huang

Savoia

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

StableVLA: Towards Robust Vision-Language-Action Models without Extra Data

upvoted a paper about 1 month ago

HumanNet: Scaling Human-centric Video Learning to One Million Hours

liked a model about 2 months ago

BAAI/Emu3.5

View all activity

Organizations

None yet

upvoted a paper 22 days ago

StableVLA: Towards Robust Vision-Language-Action Models without Extra Data

Paper • 2605.18287 • Published 24 days ago • 15

upvoted a paper about 1 month ago

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published May 7 • 52

liked a model about 2 months ago

BAAI/Emu3.5

Any-to-Any • 34B • Updated Dec 25, 2025 • 183 • 171

authored a paper 2 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

upvoted a paper 2 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

upvoted 2 papers 3 months ago

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published Mar 22 • 78

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Paper • 2602.24233 • Published Feb 27 • 60

upvoted a paper 4 months ago

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Paper • 2601.17124 • Published Jan 23 • 34

upvoted 2 papers 5 months ago

Rethinking Video Generation Model for the Embodied World

Paper • 2601.15282 • Published Jan 21 • 45

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Paper • 2601.02204 • Published Jan 5 • 64

upvoted an article 6 months ago

Article

Skill is All You Need: Lessons from Building Marketing Agents at Noumena

Noumena-AI

•

Dec 25, 2025

• 14

upvoted a paper 6 months ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 91

authored a paper 7 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 117

upvoted a collection 7 months ago

Emu3.5

Collection

Native Multimodal Models are World Learners 🌍 • 4 items • Updated Feb 4 • 77

upvoted a paper 8 months ago

Imaginarium: Vision-guided High-Quality 3D Scene Layout Generation

Paper • 2510.15564 • Published Oct 17, 2025 • 11

New activity in chenguolin/InstructScene_dataset 11 months ago

upload dining room & living room sg2sc checkpoint

#3 opened 11 months ago by

Savoia

published 3 models about 1 year ago

updated a model about 1 year ago

Savoia/trained-flux-lora-sofa-config-3

Text-to-Image • Updated May 8, 2025 • 3 •