Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

26,763

Full-text search

Active filters: 8-bit

GadflyII/GLM-4.7-Flash-NVFP4

Text Generation • 18B • Updated 3 days ago • 88.6k • 32

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.07M • • 4.37k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 6.67M • • 4.24k

mlx-community/GLM-4.7-Flash-8bit

Text Generation • 30B • Updated 4 days ago • 2.65k • 13

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Dec 17, 2025 • 5.75k • 1.26k

AlicanKiraz0/Mihenk-LLM-14B-Turkish-Financial-Model-mlx-8Bit

15B • Updated 7 days ago • 24 • 5

MultiverseComputingCAI/HyperNova-60B

Text Generation • 60B • Updated 15 days ago • 1.29k • 45

mlx-community/GLM-4.7-Flash-8bit-gs32

Text Generation • 30B • Updated 3 days ago • 273 • 4

NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4

Text Generation • 16B • Updated Aug 5, 2025 • 3.14k • 6

nvidia/Qwen2.5-VL-7B-Instruct-NVFP4

Text Generation • 5B • Updated Dec 6, 2025 • 3.08k • 12

huihui-ai/Huihui-gpt-oss-20b-mxfp4-abliterated-v2

Text Generation • 21B • Updated Sep 27, 2025 • 2.39k • 16

nvidia/DeepSeek-V3.2-NVFP4

Text Generation • 394B • Updated 3 days ago • 326 • 3

LiquidAI/LFM2.5-1.2B-Thinking-MLX-8bit

Text Generation • 0.3B • Updated 7 days ago • 130 • 3

MaziyarPanahi/Mistral-7B-Instruct-Aya-101-GGUF

Text Generation • 7B • Updated Feb 28, 2024 • 199 • 12

MaziyarPanahi/Saul-Instruct-v1-GGUF

Text Generation • 7B • Updated Mar 10, 2024 • 216 • 10

ragraph-ai/stable-cypher-instruct-3b

Text Generation • 3B • Updated Jun 12, 2025 • 339 • 31

tiiuae/Falcon-E-3B-Instruct

Text Generation • 0.9B • Updated Oct 7, 2025 • 430 • 36

nvidia/Qwen3-8B-NVFP4

Text Generation • 5B • Updated Sep 9, 2025 • 4.61k • 12

openai/gpt-oss-safeguard-20b

Text Generation • 22B • Updated 9 days ago • 10.5k • • 180

Disty0/Z-Image-Turbo-SDNQ-int8

Text-to-Image • Updated Dec 2, 2025 • 2.15k • 16

Firworks/NVIDIA-Nemotron-3-Nano-30B-A3B-nvfp4

18B • Updated 17 days ago • 2k • 7

Salyut1/GLM-4.7-NVFP4

Text Generation • 177B • Updated about 1 month ago • 4.85k • 9

Tengyunw/MiniMax-M2.1-NVFP4

Text Generation • 115B • Updated 17 days ago • 153 • 6

Octen/Octen-Embedding-8B-INT8

Sentence Similarity • 8B • Updated 5 days ago • 56 • 3

mlx-community/translategemma-27b-it-8bit

Text Generation • 27B • Updated 8 days ago • 882 • 3

nightmedia/Qwen3-32B-Element5-Heretic-qx86-hi-mlx

Text Generation • 33B • Updated 4 days ago • 179 • 2

lmstudio-community/GLM-4.7-Flash-MLX-8bit

Text Generation • 30B • Updated 1 day ago • 145k • 2

mlx-community/Qwen3-TTS-12Hz-0.6B-CustomVoice-8bit

Text-to-Speech • 0.3B • Updated about 19 hours ago • 35 • 2

Undi95/Mistral-7B-roleplay_alpaca-lora

Text Generation • Updated Sep 28, 2023 • 22 • 10

MaziyarPanahi/BioMistral-7B-GGUF

Text Generation • 7B • Updated Feb 19, 2024 • 1.21k • 56