Chinese University of Hong Kong, Shenzhen

university

https://www.cuhk.edu.cn/

Activity Feed Request to join this org

AI & ML interests

NLP, CV

Recent Activity

Eric3200 authored a paper 1 day ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Eric3200 authored a paper 1 day ago

MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos

Eric3200 authored a paper 1 day ago

ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine

View all activity

Papers

Janus: Disaggregating Attention and Experts for Scalable MoE Inference

View all Papers

yeyeyewang

submitted a paper to Daily Papers 24 days ago

Janus: Disaggregating Attention and Experts for Scalable MoE Inference

Paper • 2512.13525 • Published 26 days ago • 5

yeyeyewang

authored a paper 24 days ago

Janus: Disaggregating Attention and Experts for Scalable MoE Inference

Paper • 2512.13525 • Published 26 days ago • 5

tzzte

authored 2 papers 4 months ago

MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols

Paper • 2508.18240 • Published Aug 22, 2025

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Paper • 2509.09174 • Published Sep 11, 2025 • 61

HarryHe

authored 2 papers 5 months ago

Overview of the Amphion Toolkit (v0.2)

Paper • 2501.15442 • Published Jan 26, 2025 • 3

Fact2Fiction: Targeted Poisoning Attack to Agentic Fact-checking System

Paper • 2508.06059 • Published Aug 8, 2025 • 4

whatlegequ

authored 8 papers 10 months ago

Inducing Neural Collapse in Deep Long-tailed Learning

Paper • 2302.12453 • Published Feb 24, 2023

Elucidating The Design Space of Classifier-Guided Diffusion Generation

Paper • 2310.11311 • Published Oct 17, 2023

Explore and Exploit the Diverse Knowledge in Model Zoo for Domain Generalization

Paper • 2306.02595 • Published Jun 5, 2023

On the Expressive Power of a Variant of the Looped Transformer

Paper • 2402.13572 • Published Feb 21, 2024

Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation

Paper • 2405.15302 • Published May 24, 2024

Elucidating the design space of language models for image generation

Paper • 2410.16257 • Published Oct 21, 2024

Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation

Paper • 2503.13070 • Published Mar 17, 2025 • 10

Learning Few-Step Diffusion Models by Trajectory Distribution Matching

Paper • 2503.06674 • Published Mar 9, 2025 • 8

HarryHe

authored 3 papers 12 months ago

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Paper • 2501.15907 • Published Jan 27, 2025 • 17

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

Paper • 2407.05361 • Published Jul 7, 2024 • 2

SpMis: An Investigation of Synthetic Spoken Misinformation Detection

Paper • 2409.11308 • Published Sep 17, 2024

chongjie

authored 3 papers over 1 year ago

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Paper • 2406.16864 • Published Jun 24, 2024 • 3

LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset

Paper • 2312.12418 • Published Dec 19, 2023 • 2

MVImgNet: A Large-scale Dataset of Multi-view Images

Paper • 2303.06042 • Published Mar 10, 2023