ISTA-DASLab/Llama-3.1-8B-Instruct-MR-GPTQ-mxfp
Image-Text-to-Text
•
5B
•
Updated
•
7
None defined yet.
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization
CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training