Llama.cpp imatrix Quantizations of Qwen3-Next-80B-A3B-Thinking by Qwen

Using llama.cpp release b7206 for quantization.

All quants made using imatrix option with dataset from here

You can use them with a recent version of llama.cpp, or any project based on it.

GGUF

Hardware compatibility

4-bit

Model tree for essigpeng/Qwen3-Next-80B-A3B-Instruct-GGUF

Base model

Quantized

(65)

this model