Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

18,757

Full-text search

Active filters: grpo

codelion/Qwen3-4B-execution-world-model-lora

Text Generation • Updated Oct 20, 2025 • 12 • 5

Danau5tin/calculator_agent_qwen2.5_0.5b

0.5B • Updated Apr 30, 2025 • 1 • 1

openmed-community/granite-4.0-micro-OpenMed

Text Generation • 3B • Updated Oct 7, 2025 • 7 • 6

Guilherme34/True-Qwen2.5-14B-Instruct

Text Generation • 15B • Updated Oct 14, 2025 • 6 • 3

oberbics/llama-3.1-8B-newspaper_argument_mining

Text Generation • 8B • Updated Nov 27, 2025 • 3 • 1

Intel/deepmath-v1

Text Generation • 4B • Updated Dec 8, 2025 • 75 • 10

Freakz3z/Qwen-JSON

Text Generation • 4B • Updated Dec 3, 2025 • 12 • 2

zyc-zju/Qwen3-Embedding-4B-GRPO

Updated 3 days ago • 1

webxos/microd_v1

Text Generation • Updated 18 days ago • 297 • 2

aquiffoo/neo-3-3B-A400M-Thinking

Text Generation • Updated 23 days ago • 2

aquiffoo/neo-3-1B-A90M-Instruct

Text Generation • Updated 23 days ago • 2

bigatuna/Qwen3-0.6B-Sushi-Coder

Text Generation • 0.6B • Updated 23 days ago • 45 • 1

ehcalabres/lfm2.5-1.2b-instruct-grpo-lora

Updated 14 days ago • 2

ielabgroup/Autobool-Qwen4b-No-reasoning

Reinforcement Learning • 4B • Updated 3 days ago • 13 • 1

ielabgroup/Autobool-Qwen4b-Reasoning

Reinforcement Learning • 4B • Updated 3 days ago • 17 • 1

ielabgroup/Autobool-Qwen4b-Reasoning-objective

Reinforcement Learning • 4B • Updated 3 days ago • 17 • 1

Chun121/Qwen3-4B-RPG-Roleplay-V2

Text Generation • 4B • Updated Aug 24, 2025 • 9.45k • 33

onuryozcu/llama

Text Generation • 0.1B • Updated Mar 10, 2025 • 1

amiguel/promptTuning

8B • Updated Feb 16, 2025

sergiopaniego/Qwen2-0.5B-GRPO-test

Updated Oct 3, 2025

Novaciano/ESP-NSFW-GRPO-1B-Sin_Censura-GGUF

1B • Updated Jan 28, 2025 • 65 • 3

nbd22/Llama-3.1-8B-Instruct-GRPO-gsm8k-ft-lora

Updated Jan 28, 2025

sergiopaniego/Qwen2-0.5B-GRPO

Updated Jan 31, 2025

philschmid/qwen-2.5-3b-r1-countdown

Text Generation • 3B • Updated Jan 30, 2025 • 7 • 8

spinech/qwen-2.5-3b-r1-countdown

Text Generation • 3B • Updated Apr 28, 2025 • 5

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • 2B • Updated Feb 2, 2025 • 2 • 1

yooneo/qwen-0.5b-r1-aha

Updated Jan 31, 2025

yooneo/qwen-1.5b-r1-aha

Updated Jan 31, 2025

spinech/qwen2.5-3b-r1-rearc-stage1

Text Generation • 3B • Updated Apr 28, 2025 • 4

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO

Text Generation • 8B • Updated Feb 3, 2025 • 7 • 1