Running Featured 93 LFM2.5 1.2B Thinking WebGPU 💧 93 Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 4 days ago • 49
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 • 95
Mamba-3: Improved Sequence Modeling using State Space Principles Paper • 2603.15569 • Published 5 days ago • 5