kernelpool commited on
Commit
a5b85ce
·
verified ·
1 Parent(s): 37cf26d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -23,6 +23,17 @@ This model was quantized to 3-bit using DWQ with mlx-lm version **0.30.7**.
23
  | Relative KL reduction | ≈40 % |
24
  | Tokens processed | ≈1.11 M |
25
 
 
 
 
 
 
 
 
 
 
 
 
26
  ## Use with mlx
27
 
28
  ```bash
 
23
  | Relative KL reduction | ≈40 % |
24
  | Tokens processed | ≈1.11 M |
25
 
26
+ ## Perplexity
27
+
28
+ Evaluated on 210 samples of 512 tokens from the default mlx-lm calibration data.
29
+
30
+ | Model | Perplexity |
31
+ |-------|-----------|
32
+ | 3-bit | 7.802 |
33
+ | 3-bit DWQ | **7.434** |
34
+ | 4-bit | 6.581 |
35
+ | 4-bit DWQ | 6.431 |
36
+
37
  ## Use with mlx
38
 
39
  ```bash