2 487 94

oh sehun

sehun

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

Causal Motion Diffusion Models for Autoregressive Motion Generation

upvoted a paper about 3 hours ago

MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

upvoted a paper about 4 hours ago

OmniGAIA: Towards Native Omni-Modal AI Agents

View all activity

Organizations

upvoted a paper about 2 hours ago

Causal Motion Diffusion Models for Autoregressive Motion Generation

Paper • 2602.22594 • Published 1 day ago • 5

upvoted a paper about 3 hours ago

MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

Paper • 2602.17602 • Published 8 days ago • 52

upvoted a paper about 4 hours ago

OmniGAIA: Towards Native Omni-Modal AI Agents

Paper • 2602.22897 • Published 1 day ago • 43

upvoted a paper about 16 hours ago

DODO: Discrete OCR Diffusion Models

Paper • 2602.16872 • Published 9 days ago • 9

upvoted an article about 16 hours ago

Article

Mixture of Experts (MoEs) in Transformers

1 day ago

•

upvoted a paper 1 day ago

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Paper • 2602.12160 • Published 15 days ago • 34

upvoted an article 2 days ago

Article

Deploying Open Source Vision Language Models (VLM) on Jetson

4 days ago

•

upvoted 3 papers 2 days ago

From Perception to Action: An Interactive Benchmark for Vision Reasoning

Paper • 2602.21015 • Published 3 days ago • 21

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published 4 days ago • 448

DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning

Paper • 2602.16742 • Published 10 days ago • 9

upvoted a paper 6 days ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published 14 days ago • 43

upvoted an article 6 days ago

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

8 days ago

•

upvoted a paper 8 days ago

RynnBrain: Open Embodied Foundation Models

Paper • 2602.14979 • Published 14 days ago • 42

upvoted a paper 9 days ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 10 days ago • 98

liked a model 9 days ago

shallowdream204/BitDance-14B-16x

Text-to-Image • 15B • Updated 10 days ago • 232 • 85

upvoted a paper 9 days ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published 13 days ago • 68

upvoted a paper 10 days ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Paper • 2602.11858 • Published 15 days ago • 58

upvoted 2 collections 10 days ago

Tiny Aya

Collection

Bridging Scale and Multilingual Depth • 10 items • Updated 10 days ago • 61

BitDance

Collection

BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 11 items • Updated 5 days ago • 9

liked a model 11 days ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 4 days ago • 726k • • 1.11k

oh sehun

AI & ML interests

Recent Activity

Organizations

sehun's activity

Mixture of Experts (MoEs) in Transformers

Deploying Open Source Vision Language Models (VLM) on Jetson

Train AI models with Unsloth and Hugging Face Jobs for FREE