Inference Providers
Active filters: arm64
AEON-7/Gemma-4-12B-it-AEON-Abliterated-K4-BF16
Text Generation
• 12B • Updated • 1.77k
• 23
mradermacher/Gemma-4-12B-it-AEON-Abliterated-K4-BF16-i1-GGUF
Text Generation
• 12B • Updated • 5.33k
• 1
vlad-m-dev/distiluse-base-multilingual-v2-merged-onnx
Feature Extraction
• Updated • 1
onnx-community/distiluse-base-multilingual-v2-merged-onnx
Feature Extraction
• Updated • 1
halley-ai/gpt-oss-20b-MLX-4bit-gs32
Text Generation
• 21B • Updated • 248
• 3
halley-ai/gpt-oss-20b-MLX-6bit-gs32
Text Generation
• 21B • Updated • 60
• 1
halley-ai/gpt-oss-20b-MLX-5bit-gs32
Text Generation
• 21B • Updated • 62
• 1
halley-ai/gpt-oss-120b-MLX-8bit-gs32
Text Generation
• 117B • Updated • 81
• 1
halley-ai/gpt-oss-120b-MLX-bf16
Text Generation
• 117B • Updated • 331
• 3
halley-ai/gpt-oss-120b-MLX-6bit-gs64
Text Generation
• 117B • Updated • 144
• 1
halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-4bit-gs64
Text Generation
• 80B • Updated • 34
• 1
halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-5bit-gs32
Text Generation
• 80B • Updated • 13
• 1
halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-6bit-gs64
Text Generation
• 80B • Updated • 9
• 1
mjbommar/glaurung-binary-tokenizer-001
Feature Extraction
• Updated mjbommar/glaurung-binary-tokenizer-002
Feature Extraction
• Updated • 1
Hellohal2064/vllm-dgx-spark-gb10
Text Generation
• Updated • 5
thehighnotes/vllm-jetson-orin
Text Generation
• Updated cudabenchmarktest/personaplex-7b-turbo2bit
Text Generation
• Updated • 1
AEON-7/Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4
Text Generation
• 18B • Updated • 2.75k
• 10
AEON-7/Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4-SVDQuant
Text Generation
• 19B • Updated • 397
• 2
blckrvrfx/edge-multimodal-embeddings
Feature Extraction
• Updated Text Generation
• 7B • Updated • 22
Text Generation
• Updated divinesouljoy/Vedic-Native-SLM-1.1B
Updated
divinesouljoy/Vedic-SLM-25M
Updated
mradermacher/Gemma-4-12B-it-AEON-Abliterated-K4-BF16-GGUF
Text Generation
• 12B • Updated • 1.9k