inferencerlabs
/

DeepSeek-V3.2-Speciale-MLX-5.5bit

Text Generation

Model card Files Files and versions

inferencerlabs commited on 6 days ago

Commit

9223012

·

verified ·

1 Parent(s): 3d6a850

Upload complete model

Files changed (1) hide show

README.md +9 -7

README.md CHANGED Viewed

@@ -22,20 +22,22 @@ pipeline_tag: text-generation
 | **q8.5**     | 1.128      |
 ## Usage Notes
-* With a single M3 Ultra 512GB RAM using [Inferencer app v1.7.3](https://inferencer.com)
 * Expect ~16.5 tokens/s @ 1000 tokens
 * Memory usage: ~450 GB
   * For a larger context window (11k tokens) you can expand the RAM limit:
-    * sudo sysctl iogpu.wired_limit_mb=507000
-* With M3 Ultra 512GB RAM connected to MBP 128GB RAM using [Inferencer app v1.7.3](https://inferencer.com) with distributed compute
 * Expect ~13.7 tokens/s @ 1000 tokens
-* Memory usage: MBP ~20GB + Mac Studio ~430GB
   * More RAM available for larger context window using this method
-* Quantized with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.28
-* For more details see [demonstration video - coming soon](https://youtu.be/b6RgBIROK5o) or visit [DeepSeek-V3.2-Speciale](https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale).
 ## Disclaimer

 | **q8.5**     | 1.128      |
 ## Usage Notes
+#### M3 Ultra 512GB RAM using [Inferencer app v1.7.3](https://inferencer.com)
 * Expect ~16.5 tokens/s @ 1000 tokens
 * Memory usage: ~450 GB
   * For a larger context window (11k tokens) you can expand the RAM limit:
+    ```bash
+    sudo sysctl iogpu.wired_limit_mb=507000
+    ```
+#### M3 Ultra 512GB RAM connected to MBP 128GB RAM using [Inferencer app v1.7.3](https://inferencer.com) with LAN distributed compute
 * Expect ~13.7 tokens/s @ 1000 tokens
+* Memory usage: MBP ~20GB + Mac Studio ~430GB (will be expanded in v1.7.4 to support dynamic splits)
   * More RAM available for larger context window using this method
+##### Quantized with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.28
+##### For more details see [demonstration video - coming soon](https://youtu.be/b6RgBIROK5o) or visit [DeepSeek-V3.2-Speciale](https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale).
 ## Disclaimer