What framework was used to quantize Qwen3-VL? Is it llmcompressor?
#1
by
qqescac
- opened
I can't successfully quantize Qwen3-VL-8B using llmcompressor. The error seems to point to its Visual Encoder. Is it because llmcompressor hasn't been adapted to Qwen3-VL yet? What framework should I use for convenient quantization?
I can't successfully quantize Qwen3-VL-8B using llmcompressor. The error seems to point to its Visual Encoder. Is it because llmcompressor hasn't been adapted to Qwen3-VL yet? What framework should I use for convenient quantization?
You can check the recipe.yaml in this repo and it actually skips the quantization of visual parts. This is default for quantizing vision models.