Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

27

Base only

Active filters: arm64

AEON-7/Gemma-4-12B-it-AEON-Abliterated-K4-BF16

Text Generation • 12B • Updated 3 days ago • 1.77k • 23

mradermacher/Gemma-4-12B-it-AEON-Abliterated-K4-BF16-i1-GGUF

Text Generation • 12B • Updated 4 days ago • 5.33k • 1

vlad-m-dev/distiluse-base-multilingual-v2-merged-onnx

Feature Extraction • Updated Oct 29, 2025 • 1

onnx-community/distiluse-base-multilingual-v2-merged-onnx

Feature Extraction • Updated Jun 26, 2025 • 1

halley-ai/gpt-oss-20b-MLX-4bit-gs32

Text Generation • 21B • Updated Aug 18, 2025 • 248 • 3

halley-ai/gpt-oss-20b-MLX-6bit-gs32

Text Generation • 21B • Updated Aug 18, 2025 • 60 • 1

halley-ai/gpt-oss-20b-MLX-5bit-gs32

Text Generation • 21B • Updated Sep 8, 2025 • 62 • 1

halley-ai/gpt-oss-120b-MLX-8bit-gs32

Text Generation • 117B • Updated Sep 8, 2025 • 81 • 1

halley-ai/gpt-oss-120b-MLX-bf16

Text Generation • 117B • Updated Sep 8, 2025 • 331 • 3

halley-ai/gpt-oss-120b-MLX-6bit-gs64

Text Generation • 117B • Updated Sep 8, 2025 • 144 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-4bit-gs64

Text Generation • 80B • Updated Sep 19, 2025 • 34 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-5bit-gs32

Text Generation • 80B • Updated Sep 19, 2025 • 13 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-6bit-gs64

Text Generation • 80B • Updated Sep 19, 2025 • 9 • 1

mjbommar/glaurung-binary-tokenizer-001

Feature Extraction • Updated Oct 20, 2025

mjbommar/glaurung-binary-tokenizer-002

Feature Extraction • Updated Nov 13, 2025 • 1

Hellohal2064/vllm-dgx-spark-gb10

Text Generation • Updated Jan 6 • 5

thehighnotes/vllm-jetson-orin

Text Generation • Updated Mar 11

cudabenchmarktest/personaplex-7b-turbo2bit

Updated Mar 29 • 32 • 4

coverblew/llamita.cpp

Text Generation • Updated Apr 3 • 1

AEON-7/Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4

Text Generation • 18B • Updated 9 days ago • 2.75k • 10

AEON-7/Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4-SVDQuant

Text Generation • 19B • Updated 9 days ago • 397 • 2

blckrvrfx/edge-multimodal-embeddings

Feature Extraction • Updated Apr 25

asolomonqa/asmgenius-v1

Text Generation • 7B • Updated 30 days ago • 22

divinesouljoy/vedic-ai

Text Generation • Updated 22 days ago

divinesouljoy/Vedic-Native-SLM-1.1B

Updated 16 days ago

divinesouljoy/Vedic-SLM-25M

Updated 16 days ago

mradermacher/Gemma-4-12B-it-AEON-Abliterated-K4-BF16-GGUF

Text Generation • 12B • Updated 4 days ago • 1.9k