Qwen3-Next-80B-A3B-Instruct Quantized Models

inference-optimization 's Collections

updated about 24 hours ago

FP8-dynamic, FP8-block, NVFP4, INT4, INT8 versions of Qwen3-Next-80B-A3B-Instruct