-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
88.6k
•
32
Text Generation
•
120B
•
Updated
•
3.07M
•
•
4.37k
Text Generation
•
22B
•
Updated
•
6.67M
•
•
4.24k
mlx-community/GLM-4.7-Flash-8bit
Text Generation
•
30B
•
Updated
•
2.65k
•
13
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
5.75k
•
1.26k
AlicanKiraz0/Mihenk-LLM-14B-Turkish-Financial-Model-mlx-8Bit
15B
•
Updated
•
24
•
5
MultiverseComputingCAI/HyperNova-60B
Text Generation
•
60B
•
Updated
•
1.29k
•
45
mlx-community/GLM-4.7-Flash-8bit-gs32
Text Generation
•
30B
•
Updated
•
273
•
4
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
•
16B
•
Updated
•
3.14k
•
6
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
3.08k
•
12
huihui-ai/Huihui-gpt-oss-20b-mxfp4-abliterated-v2
Text Generation
•
21B
•
Updated
•
2.39k
•
16
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
326
•
3
LiquidAI/LFM2.5-1.2B-Thinking-MLX-8bit
Text Generation
•
0.3B
•
Updated
•
130
•
3
MaziyarPanahi/Mistral-7B-Instruct-Aya-101-GGUF
Text Generation
•
7B
•
Updated
•
199
•
12
MaziyarPanahi/Saul-Instruct-v1-GGUF
Text Generation
•
7B
•
Updated
•
216
•
10
ragraph-ai/stable-cypher-instruct-3b
Text Generation
•
3B
•
Updated
•
339
•
31
tiiuae/Falcon-E-3B-Instruct
Text Generation
•
0.9B
•
Updated
•
430
•
36
Text Generation
•
5B
•
Updated
•
4.61k
•
12
openai/gpt-oss-safeguard-20b
Text Generation
•
22B
•
Updated
•
10.5k
•
•
180
Disty0/Z-Image-Turbo-SDNQ-int8
Text-to-Image
•
Updated
•
2.15k
•
16
Firworks/NVIDIA-Nemotron-3-Nano-30B-A3B-nvfp4
18B
•
Updated
•
2k
•
7
Text Generation
•
177B
•
Updated
•
4.85k
•
9
Tengyunw/MiniMax-M2.1-NVFP4
Text Generation
•
115B
•
Updated
•
153
•
6
Octen/Octen-Embedding-8B-INT8
Sentence Similarity
•
8B
•
Updated
•
56
•
3
mlx-community/translategemma-27b-it-8bit
Text Generation
•
27B
•
Updated
•
882
•
3
nightmedia/Qwen3-32B-Element5-Heretic-qx86-hi-mlx
Text Generation
•
33B
•
Updated
•
179
•
2
lmstudio-community/GLM-4.7-Flash-MLX-8bit
Text Generation
•
30B
•
Updated
•
145k
•
2
mlx-community/Qwen3-TTS-12Hz-0.6B-CustomVoice-8bit
Text-to-Speech
•
0.3B
•
Updated
•
35
•
2
Undi95/Mistral-7B-roleplay_alpaca-lora
Text Generation
•
Updated
•
22
•
10
MaziyarPanahi/BioMistral-7B-GGUF
Text Generation
•
7B
•
Updated
•
1.21k
•
56