Pls MXFP4

by Kirara702 - opened Nov 3

Discussion

Kirara702

Nov 3

Please start including the MXFP4 alongside the other quantized versions.

CHNtentes

Nov 4

Is MXFP4 really better than Q4_K_XL or Q4_K_M? Or it's just because OpenAI used it?

TobDeBer

Nov 4

It's worse than the k quants. But it has native hardware support and is fast and energy efficient.

CHNtentes

Nov 4

It's worse than the k quants. But it has native hardware support and is fast and energy efficient.

So if my gpu is older than blackwell it's not worth using? Only blackwell has native fp4 support right?

danielhanchen

Unsloth AI org about 1 month ago

Good idea we are going to do that pretty soon hopefully for our future quants. The quality will degrade slightly

It's worse than the k quants. But it has native hardware support and is fast and energy efficient.

So if my gpu is older than blackwell it's not worth using? Only blackwell has native fp4 support right?

Also works on old GPUS

CHNtentes

about 1 month ago

Good idea we are going to do that pretty soon hopefully for our future quants. The quality will degrade slightly

It's worse than the k quants. But it has native hardware support and is fast and energy efficient.

So if my gpu is older than blackwell it's not worth using? Only blackwell has native fp4 support right?

Also works on old GPUS

But is the speed much slower than blackwell gpus?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment