JIANYI WANG's picture

In a Training Loop 🔄

JIANYI WANG

Iceclear

·

https://iceclear.github.io

AI & ML interests

Generative AI

Recent Activity

upvoted a collection 20 days ago

upvoted a paper 29 days ago

Qwen-Image-VAE-2.0 Technical Report

liked a model 3 months ago

Qwen/Qwen3.5-2B-Base

View all activity

Organizations

upvoted a collection 20 days ago

SeedVR

A diffusion transformer model for high-resolution image and video restoration. • 9 items • Updated Aug 19, 2025 • 16

upvoted a paper 29 days ago

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published 30 days ago • 60

upvoted a paper 4 months ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published Feb 3 • 49

upvoted 2 papers 5 months ago

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published Jan 14 • 34

Self-Evaluation Unlocks Any-Step Text-to-Image Generation

Paper • 2512.22374 • Published Dec 26, 2025 • 17

upvoted an article 5 months ago

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

MiniMaxAI

•

Jan 5

• 41

upvoted a paper 5 months ago

FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation

Paper • 2512.24724 • Published Dec 31, 2025 • 9

upvoted 3 papers 6 months ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 97

Region-Constraint In-Context Generation for Instructional Video Editing

Paper • 2512.17650 • Published Dec 19, 2025 • 53

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published Dec 17, 2025 • 71

upvoted a collection 6 months ago

Pixio

5 items • Updated Dec 19, 2025 • 16

upvoted 3 papers 7 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 234

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Paper • 2511.10629 • Published Nov 13, 2025 • 131

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Paper • 2511.09057 • Published Nov 12, 2025 • 82

upvoted a collection 10 months ago

DeepSeek-V3.1

3 items • Updated Mar 2 • 262

upvoted a paper 10 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 146

upvoted a paper 11 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 137

upvoted a collection 11 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 563

upvoted a paper 11 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161

upvoted a paper 12 months ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23, 2025 • 35