Update README.md
Browse files
README.md
CHANGED
|
@@ -33,7 +33,7 @@ You can either perform the dequantization manually using this [conversion script
|
|
| 33 |
**Quantization scripts:**
|
| 34 |
```
|
| 35 |
cd Quark/examples/torch/language_modeling/llm_ptq/
|
| 36 |
-
python3 quantize_quark.py --model_dir
|
| 37 |
--quant_scheme w_mxfp4_a_mxfp4 \
|
| 38 |
--group_size 32 \
|
| 39 |
--num_calib_data 128 \
|
|
|
|
| 33 |
**Quantization scripts:**
|
| 34 |
```
|
| 35 |
cd Quark/examples/torch/language_modeling/llm_ptq/
|
| 36 |
+
python3 quantize_quark.py --model_dir $MODEL_DIR \
|
| 37 |
--quant_scheme w_mxfp4_a_mxfp4 \
|
| 38 |
--group_size 32 \
|
| 39 |
--num_calib_data 128 \
|