-
-
-
-
-
-
Inference Providers
Active filters:
cuda
ussoewwin/Flash-Attention-2_for_Windows
Updated
•
65
aydin99/FLUX.2-klein-4B-int8
Text-to-Image
•
Updated
•
343
•
5
dougeeai/llama-cpp-python-wheels
JusteLeo/Nunchaku-Zimage-Win-Wheels
Other
•
Updated
•
1
Text Generation
•
Updated
•
8
•
23
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
•
Updated
•
1
marcorez8/llama-cpp-python-windows-blackwell-cuda
ValiantLabs/Qwen3-8B-ShiningValiant3
Text Generation
•
8B
•
Updated
•
19
•
3
mradermacher/Qwen3-8B-ShiningValiant3-GGUF
8B
•
Updated
•
335
•
2
mradermacher/Qwen3-8B-ShiningValiant3-i1-GGUF
8B
•
Updated
•
618
•
2
ValiantLabs/Qwen3-1.7B-ShiningValiant3
Text Generation
•
2B
•
Updated
•
11
•
5
Triangle104/Qwen3-8B-ShiningValiant3-Q4_K_S-GGUF
Text Generation
•
8B
•
Updated
•
4
Triangle104/Qwen3-8B-ShiningValiant3-Q4_K_M-GGUF
Text Generation
•
8B
•
Updated
•
4
Triangle104/Qwen3-8B-ShiningValiant3-Q5_K_S-GGUF
Text Generation
•
8B
•
Updated
•
7
Triangle104/Qwen3-8B-ShiningValiant3-Q5_K_M-GGUF
Text Generation
•
8B
•
Updated
•
3
Triangle104/Qwen3-8B-ShiningValiant3-Q6_K-GGUF
Text Generation
•
8B
•
Updated
•
2
Triangle104/Qwen3-8B-ShiningValiant3-Q8_0-GGUF
Text Generation
•
8B
•
Updated
•
10
Triangle104/Qwen3-1.7B-ShiningValiant3-Q4_K_S-GGUF
Text Generation
•
2B
•
Updated
•
2
Triangle104/Qwen3-1.7B-ShiningValiant3-Q4_K_M-GGUF
Text Generation
•
2B
•
Updated
•
3
Triangle104/Qwen3-1.7B-ShiningValiant3-Q5_K_S-GGUF
Text Generation
•
2B
•
Updated
•
2
Triangle104/Qwen3-1.7B-ShiningValiant3-Q5_K_M-GGUF
Text Generation
•
2B
•
Updated
•
12
Triangle104/Qwen3-1.7B-ShiningValiant3-Q6_K-GGUF
Text Generation
•
2B
•
Updated
•
2
Triangle104/Qwen3-1.7B-ShiningValiant3-Q8_0-GGUF
Text Generation
•
2B
•
Updated
•
4
mradermacher/Qwen3-1.7B-ShiningValiant3-GGUF
2B
•
Updated
•
164
mradermacher/Qwen3-1.7B-ShiningValiant3-i1-GGUF
2B
•
Updated
•
320
ValiantLabs/Qwen3-4B-ShiningValiant3
Text Generation
•
4B
•
Updated
•
12
•
7
sequelbox/Qwen3-8B-PlumEsper
Text Generation
•
8B
•
Updated
•
8
sequelbox/Qwen3-4B-PlumEsper
Text Generation
•
4B
•
Updated
•
8