Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
254.6
TFLOPS
21
GadflyII
GadflyII
Follow
xzcho's profile picture
markrizkallah's profile picture
Michalea's profile picture
26 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
6 days ago
GadflyII/GLM-4.7-Flash-MTP-NVFP4:
SGLang and MTP
new
activity
19 days ago
GadflyII/Qwen3-Coder-Next-NVFP4:
Model requests?
new
activity
19 days ago
GadflyII/Qwen3-Coder-Next-NVFP4:
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
View all activity
Organizations
GadflyII
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
6 days ago
SGLang and MTP
1
#2 opened 18 days ago by
Michalea
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
19 days ago
Model requests?
12
#4 opened about 1 month ago by
pathosethoslogos
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
👍
1
1
#5 opened 28 days ago by
scottgl
New activity in
GadflyII/GLM-4.6V-NVFP4
19 days ago
Fails on a single DGX spark with errors below
1
#2 opened 24 days ago by
Adrian1234
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
about 1 month ago
Update MXFP4 format to compressed-tensors
1
#3 opened about 1 month ago by
mgoin
New activity in
lukealonso/MiniMax-M2.5-NVFP4
about 1 month ago
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
👍
3
17
#1 opened about 1 month ago by
zenmagnets
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
about 1 month ago
MMLU PRO Benchmark
3
#3 opened about 1 month ago by
sevapru
vLLM 0.16?
1
#2 opened about 1 month ago by
MMaxHugg
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
about 2 months ago
Memory
1
#1 opened about 2 months ago by
struxx
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
about 2 months ago
confused response
7
#8 opened about 2 months ago by
jiangyizhi
MTP quality, 47 layer
3
#7 opened about 2 months ago by
Michalea
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
about 2 months ago
Upload folder using huggingface_hub
#1 opened about 2 months ago by
GadflyII
New activity in
GadflyII/GLM-4.6V-NVFP4
about 2 months ago
Well done nvfp4 quant
1
#1 opened about 2 months ago by
josephbreda
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
about 2 months ago
Can't deploy by vllm 0.14.1 + transformers
8
#6 opened about 2 months ago by
Butterfly-314
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
about 2 months ago
can not run
4
#1 opened about 2 months ago by
aliez-ren
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
about 2 months ago
please create mlx version of this
3
#4 opened about 2 months ago by
Narutoouz
Wasn't able to recreate MMLU-Pro benchmarks
5
#5 opened about 2 months ago by
zenmagnets
New activity in
GadflyII/MiniMax-M2.1-NVFP4
about 2 months ago
Request for GLM 4.6V
3
#1 opened 3 months ago by
SFPLM
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
about 2 months ago
GadflyII/GLM-4.7-Flash-NVFP4
15
#3 opened 2 months ago by
Yu21342
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
2 months ago
Really appreciate that you ran performance comparison tests with BF16!
3
#2 opened 2 months ago by
zenmagnets
Load more