Update README.md
Browse files
README.md
CHANGED
|
@@ -211,12 +211,12 @@ python -m sglang.launch_server --model-path HuggingFaceTB/SmolLM3-3B
|
|
| 211 |
#### vLLM
|
| 212 |
|
| 213 |
```bash
|
| 214 |
-
vllm serve HuggingFaceTB/SmolLM3-3B
|
| 215 |
```
|
| 216 |
|
| 217 |
#### Setting `chat_template_kwargs`
|
| 218 |
|
| 219 |
-
You can specify `chat_template_kwargs` such as `enable_thinking`
|
| 220 |
|
| 221 |
```bash
|
| 222 |
curl http://localhost:8000/v1/chat/completions -H "Content-Type: application/json" -d '{
|
|
|
|
| 211 |
#### vLLM
|
| 212 |
|
| 213 |
```bash
|
| 214 |
+
vllm serve HuggingFaceTB/SmolLM3-3B --enable-auto-tool-choice --tool-call-parser=hermes
|
| 215 |
```
|
| 216 |
|
| 217 |
#### Setting `chat_template_kwargs`
|
| 218 |
|
| 219 |
+
You can specify `chat_template_kwargs` such as `enable_thinking` to a deployed model by passing the `chat_template_kwargs` parameter in the API request.
|
| 220 |
|
| 221 |
```bash
|
| 222 |
curl http://localhost:8000/v1/chat/completions -H "Content-Type: application/json" -d '{
|