What framework was used to quantize Qwen3-VL? Is it llmcompressor?

by qqescac - opened 1 day ago

1 day ago

I can't successfully quantize Qwen3-VL-8B using llmcompressor. The error seems to point to its Visual Encoder. Is it because llmcompressor hasn't been adapted to Qwen3-VL yet? What framework should I use for convenient quantization?

cpatonn

Owner about 18 hours ago

Hi @qqescac ,I used llmcompressor to quantize this model. Perhaps, it was due to your transformers version as I remember the similars occured to me too and it stopped happening after upgrading transformers.

CHNtentes

about 17 hours ago

•

edited about 17 hours ago

I can't successfully quantize Qwen3-VL-8B using llmcompressor. The error seems to point to its Visual Encoder. Is it because llmcompressor hasn't been adapted to Qwen3-VL yet? What framework should I use for convenient quantization?

You can check the recipe.yaml in this repo and it actually skips the quantization of visual parts. This is default for quantizing vision models.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment