4 21 10

Haokun Lin

Felix1023

https://felixmessi.github.io/

AI & ML interests

None yet

Recent Activity

commented on a paper 1 day ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

liked a model 1 day ago

TencentARC/CubeComposer

upvoted a paper 1 day ago

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

View all activity

Organizations

commented a paper 1 day ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published 11 days ago • 16 •

liked a model 1 day ago

TencentARC/CubeComposer

Video-to-Video • Updated 1 day ago • 22 • 8

upvoted a paper 1 day ago

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

Paper • 2603.04291 • Published 2 days ago • 11

upvoted an article 8 days ago

Article

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

Aug 8, 2025

•

submitted a paper to Daily Papers 9 days ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published 11 days ago • 16

upvoted a paper 9 days ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published 11 days ago • 16

upvoted a paper 24 days ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published 26 days ago • 42

upvoted a paper 3 months ago

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Paper • 2512.14698 • Published Dec 16, 2025 • 21

authored a paper 4 months ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2508.03485 • Published Aug 5, 2025 • 2

upvoted 2 papers 4 months ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2508.03485 • Published Aug 5, 2025 • 2

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 30

liked 2 models 4 months ago

ByteDance/Video-As-Prompt-CogVideoX-5B

Image-to-Video • Updated Oct 27, 2025 • 79 • 23

ByteDance/Video-As-Prompt-Wan2.1-14B

Image-to-Video • Updated Oct 27, 2025 • 82 • 48

upvoted a collection 4 months ago

Video-As-Prompt

Collection

The model zoo for "Video-As-Prompt: Unified Semantic Control for Video Generation" • 3 items • Updated Oct 27, 2025 • 13

liked a dataset 4 months ago

BianYx/VAP-Data

Viewer • Updated Oct 30, 2025 • 90.1k • 10.5k • 29

upvoted a paper 4 months ago

Video-As-Prompt: Unified Semantic Control for Video Generation

Paper • 2510.20888 • Published Oct 23, 2025 • 50

liked a model 5 months ago

JunhaoZhuang/FlashVSR

Video-to-Video • Updated Dec 10, 2025 • 176

upvoted a paper 6 months ago

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 128

liked a model 6 months ago

TencentARC/IC-Custom

Image-to-Image • Updated Aug 31, 2025 • 11 • 16

upvoted a paper 6 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27, 2025 • 21

Haokun Lin

AI & ML interests

Recent Activity

Organizations

Felix1023's activity

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware