2 19 144

fireblade2534

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a paper 14 days ago

Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts

upvoted a paper 17 days ago

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

View all activity

Organizations

None yet

upvoted a paper 13 days ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 16 days ago • 132

upvoted a paper 14 days ago

Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts

Paper • 2601.03315 • Published 17 days ago • 6

upvoted a paper 17 days ago

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published 23 days ago • 61

liked a model about 1 month ago

zai-org/GLM-TTS

Text-to-Speech • Updated 10 days ago • 672 • 306

liked 3 models about 2 months ago

upvoted a paper 2 months ago

More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models

Paper • 2509.25848 • Published Sep 30, 2025 • 80

upvoted a paper 3 months ago

NaviTrace: Evaluating Embodied Navigation of Vision-Language Models

Paper • 2510.26909 • Published Oct 30, 2025 • 14

liked a model 3 months ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 34.7k • 528

upvoted an article 3 months ago

Article

Visualizing How VLMs Work

Oct 7, 2025

•

liked 2 Spaces 3 months ago

KaniTTS

😻

106

Generate speech from text using selected models

NeuTTS-Air

☁

306

Generate speech from text using a reference audio

upvoted a collection 4 months ago

Qwen3-VL

Collection

37 items • Updated 23 days ago • 594

liked a model 4 months ago

Qwen/Qwen-Image-Edit-2509

Image-to-Image • Updated Sep 22, 2025 • 278k • • 1.05k

upvoted a collection 4 months ago

Qwen3-Omni

Collection

6 items • Updated 23 days ago • 181

liked a model 4 months ago

Wan-AI/Wan2.2-Animate-14B

Video-to-Video • Updated Nov 5, 2025 • 65.4k • 990

liked a dataset 4 months ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 121k • 467

liked 2 models 5 months ago

bosonai/higgs-audio-v2-generation-3B-base

Text-to-Speech • 6B • Updated Jul 28, 2025 • 165k • 655

KittenML/kitten-tts-mini-0.1

Updated Sep 8, 2025 • 1.1k • 33