Update README.md
Browse files
README.md
CHANGED
|
@@ -205,7 +205,7 @@ otherwise the expert tensors wouldn’t be evenly sharded across GPU devices.</i
|
|
| 205 |
```
|
| 206 |
CONTEXT_LENGTH=32768
|
| 207 |
vllm serve \
|
| 208 |
-
|
| 209 |
--served-model-name MY_MODEL \
|
| 210 |
--enable-auto-tool-choice \
|
| 211 |
--tool-call-parser minimax_m2 \
|
|
|
|
| 205 |
```
|
| 206 |
CONTEXT_LENGTH=32768
|
| 207 |
vllm serve \
|
| 208 |
+
QuantTrio/MiniMax-M2-AWQ \
|
| 209 |
--served-model-name MY_MODEL \
|
| 210 |
--enable-auto-tool-choice \
|
| 211 |
--tool-call-parser minimax_m2 \
|