Songhao Wu
shwu
AI & ML interests
Mixture-of-Experts Model, Language Model Pretraining
Recent Activity
authored a paper about 16 hours ago
Redesign Mixture-of-Experts Routers with Manifold Power Iteration upvoted a paper about 19 hours ago
Redesign Mixture-of-Experts Routers with Manifold Power Iteration submitted a paper about 20 hours ago
Redesign Mixture-of-Experts Routers with Manifold Power IterationOrganizations
None yet