Update README.md (#26)

- Update README.md (bfa49d6b2a0d67790db60e8493bcfc8e1e10c58b)

Co-authored-by: Sanchit Gandhi <sanchit-gandhi@users.noreply.huggingface.co>

Files changed (1) hide show

README.md CHANGED Viewed

@@ -80,7 +80,7 @@ We recommend deploying with the following best practices:
   For the best user experience, we recommend to simply instantiate vLLM with the default parameters which will automatically set a maximum model length of 131072 (~ca. 3h).
 - We strongly recommend using websockets to set up audio streaming sessions. For more info on how to do so, check [Usage](#usage).
 - We recommend using a delay of 480ms as we found it to be the sweet spot of performance and low latency. If, however, you want to adapt the delay, you can change the `"transcription_delay_ms": 480` parameter
-  in the [tekken.json](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602/blob/main/params.json) file to any multiple of 80ms between 80 and 2400.
 ## Benchmark Results

   For the best user experience, we recommend to simply instantiate vLLM with the default parameters which will automatically set a maximum model length of 131072 (~ca. 3h).
 - We strongly recommend using websockets to set up audio streaming sessions. For more info on how to do so, check [Usage](#usage).
 - We recommend using a delay of 480ms as we found it to be the sweet spot of performance and low latency. If, however, you want to adapt the delay, you can change the `"transcription_delay_ms": 480` parameter
+  in the [tekken.json](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602/blob/main/params.json) file to any multiple of 80ms between 80 and 1200, as well as 2400 as a standalone value.
 ## Benchmark Results