Qwen3-VL-32B-Instruct-SDNQ-uint4-svd-r32 not work

#1
by KLL1111 - opened

The example doesn't work. Quantization doesn't enable. The model loads in full mode, but naturally doesn't fit, and it takes forever.

Sign up or log in to comment