-
-
-
-
-
-
Inference Providers
Active filters:
ModelOpt
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
12.3k
•
21
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
13.9k
•
15
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
•
16B
•
Updated
•
3.31k
•
6
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
947
•
3
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
32.8k
•
21
Text Generation
•
5B
•
Updated
•
4.97k
•
12
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
3.13k
•
12
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
•
Updated
•
29.1k
•
19
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
4.9k
•
12
nvidia/gpt-oss-120b-Eagle3-short-context
Text Generation
•
Updated
•
2.62k
•
12
nvidia/DeepSeek-V3-0324-NVFP4
Text Generation
•
397B
•
Updated
•
66.4k
•
14
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
•
397B
•
Updated
•
3.87k
•
40
NVFP4/DeepSeek-Prover-V2-7B-FP4
4B
•
Updated
•
428
•
1
NVFP4/DeepSeek-R1-0528-Qwen3-8B-FP4
5B
•
Updated
•
40
•
1
Text Generation
•
19B
•
Updated
•
327
•
4
NVFP4/Polaris-4B-Preview-FP4
2B
•
Updated
•
1
NVFP4/Polaris-7B-Preview-FP4
5B
•
Updated
•
3
•
1
nvidia/Qwen3-235B-A22B-FP8
Text Generation
•
235B
•
Updated
•
2.08k
•
3
tachyphylaxis/DeepSeek-R1-0528-FP4
Text Generation
•
397B
•
Updated
•
2
nvidia/DeepSeek-R1-0528-NVFP4-v2
Text Generation
•
394B
•
Updated
•
92.8k
•
11
nvidia/DeepSeek-R1-NVFP4-v2
Text Generation
•
394B
•
Updated
•
3.4k
•
5
NVFP4/Qwen3-235B-A22B-Instruct-2507-FP4
Text Generation
•
118B
•
Updated
•
1.24k
•
3
NVFP4/Qwen3-Coder-480B-A35B-Instruct-FP4
Text Generation
•
241B
•
Updated
•
168
•
2
nvidia/Qwen3-235B-A22B-Eagle3
Text Generation
•
0.3B
•
Updated
•
1.78k
•
9
NVFP4/Qwen3-235B-A22B-Thinking-2507-FP4
Text Generation
•
118B
•
Updated
•
160
•
2
BitPhinix/DeepSeek-V3-0324-FP4
Text Generation
•
397B
•
Updated
•
1
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
•
16B
•
Updated
•
1.15k
•
11
NVFP4/Qwen3-30B-A3B-Thinking-2507-FP4
Text Generation
•
16B
•
Updated
•
739
•
4
Text Generation
•
0.4B
•
Updated
•
77
nvidia/gpt-oss-120b-Eagle3-long-context
Text Generation
•
0.2B
•
Updated
•
3.46k
•
54