-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 185 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
Collections
Discover the best community collections!
Collections including paper arxiv:2512.22615
-
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper • 2512.22615 • Published • 43 -
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Paper • 2512.20557 • Published • 49 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 91
-
ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports
Paper • 2507.22030 • Published -
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode
Paper • 2508.04107 • Published • 4 -
Phrase-grounded Fact-checking for Automatically Generated Chest X-ray Reports
Paper • 2509.21356 • Published -
Learning Segmentation from Radiology Reports
Paper • 2507.05582 • Published • 1
-
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding
Paper • 2505.22618 • Published • 44 -
DINGO: Constrained Inference for Diffusion LLMs
Paper • 2505.23061 • Published • 31 -
Discrete Diffusion in Large Language and Multimodal Models: A Survey
Paper • 2506.13759 • Published • 43 -
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
Paper • 2506.14429 • Published • 44
-
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper • 2512.22615 • Published • 43 -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 78 -
On the Role of Discreteness in Diffusion LLMs
Paper • 2512.22630 • Published • 17 -
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Paper • 2512.24165 • Published • 44
-
openai/gpt-oss-120b
Text Generation • 120B • Updated • 3.47M • • 4.31k -
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper • 2512.20605 • Published • 60 -
Nested Browser-Use Learning for Agentic Information Seeking
Paper • 2512.23647 • Published • 17 -
TimeBill: Time-Budgeted Inference for Large Language Models
Paper • 2512.21859 • Published • 22
-
Gemini Robotics: Bringing AI into the Physical World
Paper • 2503.20020 • Published • 29 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 49
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 185 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper • 2512.22615 • Published • 43 -
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Paper • 2512.20557 • Published • 49 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 91
-
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper • 2512.22615 • Published • 43 -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 78 -
On the Role of Discreteness in Diffusion LLMs
Paper • 2512.22630 • Published • 17 -
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Paper • 2512.24165 • Published • 44
-
ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports
Paper • 2507.22030 • Published -
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode
Paper • 2508.04107 • Published • 4 -
Phrase-grounded Fact-checking for Automatically Generated Chest X-ray Reports
Paper • 2509.21356 • Published -
Learning Segmentation from Radiology Reports
Paper • 2507.05582 • Published • 1
-
openai/gpt-oss-120b
Text Generation • 120B • Updated • 3.47M • • 4.31k -
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper • 2512.20605 • Published • 60 -
Nested Browser-Use Learning for Agentic Information Seeking
Paper • 2512.23647 • Published • 17 -
TimeBill: Time-Budgeted Inference for Large Language Models
Paper • 2512.21859 • Published • 22
-
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding
Paper • 2505.22618 • Published • 44 -
DINGO: Constrained Inference for Diffusion LLMs
Paper • 2505.23061 • Published • 31 -
Discrete Diffusion in Large Language and Multimodal Models: A Survey
Paper • 2506.13759 • Published • 43 -
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
Paper • 2506.14429 • Published • 44
-
Gemini Robotics: Bringing AI into the Physical World
Paper • 2503.20020 • Published • 29 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 49