Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
inference-optimization
's Collections
Qwen3-Next-80B-A3B-Instruct Quantized Models
Mixed Precision Models
KV Cache Quantization
Qwen3-Next-80B-A3B-Instruct Quantized Models
updated
about 24 hours ago
FP8-dynamic, FP8-block, NVFP4, INT4, INT8 versions of Qwen3-Next-80B-A3B-Instruct
Upvote
-
inference-optimization/Qwen3-Next-80B-A3B-Instruct-quantized.w8a8
Updated
about 24 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16
Updated
about 24 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Instruct-FP8-block
Updated
about 24 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Instruct-FP8-dynamic
Updated
about 24 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Instruct-NVFP4
Updated
about 24 hours ago
Upvote
-
Share collection
View history
Collection guide
Browse collections