Alexander Smith's picture

Alexander Smith

alexandersmith2

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

Kwai Keye-VL-2.0 Technical Report

liked a dataset 7 days ago

AbstractPhil/sdxl-qwen-phase0

liked a model 7 days ago

Tongyi-MAI/Z-Image-Turbo

View all activity

Organizations

None yet

upvoted a paper about 17 hours ago

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 2 days ago • 171

upvoted a paper 8 days ago

Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding

Paper • 2605.29707 • Published 14 days ago • 144

upvoted 2 papers 20 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published about 1 month ago • 195

ReactiveGWM: Steering NPC in Reactive Game World Models

Paper • 2605.15256 • Published 28 days ago • 28

upvoted 2 papers about 1 month ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published May 6 • 102

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 166

upvoted 4 papers about 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics

Paper • 2604.08503 • Published Apr 9 • 7

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 327

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

upvoted 6 papers 2 months ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 189

An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU

Paper • 2603.16428 • Published Mar 17 • 51

MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model

Paper • 2603.26357 • Published Mar 27 • 4

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 506

OptiMer: Optimal Distribution Vector Merging Is Better than Data Mixing for Continual Pre-Training

Paper • 2603.28858 • Published Mar 30 • 9

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

upvoted 2 papers 3 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 525