Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Da Xiao's picture
1 3

Da Xiao

xiaoda99
  • xiaoda99

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 9 months ago

Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer

Paper • 2503.02495 • Published Mar 4 • 9
upvoted 2 papers 10 months ago

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

Paper • 2502.12170 • Published Feb 13 • 12

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 31
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs