Llama.cpp imatrix Quantizations of Qwen3-Next-80B-A3B-Thinking by Qwen

Using llama.cpp release b7206 for quantization.

All quants made using imatrix option with dataset from here

You can run them with recent versions of llama.cpp, or any project based on it.

GGUF

Hardware compatibility

4-bit

Model tree for essigpeng/Qwen3-Next-80B-A3B-Thinking-GGUF

Base model

Quantized

(45)

this model