nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 2 days ago • 125k • 149
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published 18 days ago • 85
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 18 days ago • 87
SLIME: Stabilized Likelihood Implicit Margin Enforcement for Preference Optimization Paper • 2602.02383 • Published Feb 2 • 29
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published Jan 31 • 315
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published Jan 30 • 57