catalystsec
/

MiniMax-M2.5-3bit-DWQ

Text Generation

Model card Files Files and versions

kernelpool commited on 2 days ago

Commit

a5b85ce

·

verified ·

1 Parent(s): 37cf26d

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -23,6 +23,17 @@ This model was quantized to 3-bit using DWQ with mlx-lm version **0.30.7**.
 | Relative KL reduction     | ≈40 %                          |
 | Tokens processed          | ≈1.11 M                        |
 ## Use with mlx
 ```bash

 | Relative KL reduction     | ≈40 %                          |
 | Tokens processed          | ≈1.11 M                        |
+## Perplexity
+Evaluated on 210 samples of 512 tokens from the default mlx-lm calibration data.
+| Model | Perplexity |
+|-------|-----------|
+| 3-bit | 7.802 |
+| 3-bit DWQ | **7.434** |
+| 4-bit | 6.581 |
+| 4-bit DWQ | 6.431 |
 ## Use with mlx
 ```bash