Update README.md
Browse files
README.md
CHANGED
|
@@ -18,6 +18,12 @@ pipeline_tag: image-text-to-text
|
|
| 18 |
|
| 19 |
Over recent months, we have intensified our focus on developing foundation models that deliver exceptional utility and performance. Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.
|
| 20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
## Qwen3.5 Highlights
|
| 22 |
|
| 23 |
Qwen3.5 features the following enhancement:
|
|
@@ -938,12 +944,6 @@ In the following, we show example commands to launch OpenAI-Compatible API serve
|
|
| 938 |
> If you encounter out-of-memory (OOM) errors, consider reducing the context window.
|
| 939 |
> However, because Qwen3.5 leverages extended context for complex tasks, we advise maintaining a context length of at least 128K tokens to preserve thinking capabilities.
|
| 940 |
|
| 941 |
-
> [!Tip]
|
| 942 |
-
> For users seeking managed, scalable inference without infrastructure maintenance, the official Qwen API service is provided by [Alibaba Cloud Model Studio](https://modelstudio.alibabacloud.com/).
|
| 943 |
-
>
|
| 944 |
-
> In particular, **Qwen3.5-Plus** is the hosted version corresponding to Qwen3.5-397B-A17B with more production features, e.g., 1M context length by default, official built-in tools, and adaptive tool use.
|
| 945 |
-
> For more information, please refer to the [User Guide](https://www.alibabacloud.com/help/en/model-studio/text-generation).
|
| 946 |
-
|
| 947 |
#### SGLang
|
| 948 |
|
| 949 |
[SGLang](https://github.com/sgl-project/sglang) is a fast serving framework for large language models and vision language models.
|
|
|
|
| 18 |
|
| 19 |
Over recent months, we have intensified our focus on developing foundation models that deliver exceptional utility and performance. Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.
|
| 20 |
|
| 21 |
+
> [!Tip]
|
| 22 |
+
> For users seeking managed, scalable inference without infrastructure maintenance, the official Qwen API service is provided by [Alibaba Cloud Model Studio](https://modelstudio.alibabacloud.com/).
|
| 23 |
+
>
|
| 24 |
+
> In particular, **Qwen3.5-Plus** is the hosted version corresponding to Qwen3.5-397B-A17B with more production features, e.g., 1M context length by default, official built-in tools, and adaptive tool use.
|
| 25 |
+
> For more information, please refer to the [User Guide](https://www.alibabacloud.com/help/en/model-studio/text-generation).
|
| 26 |
+
|
| 27 |
## Qwen3.5 Highlights
|
| 28 |
|
| 29 |
Qwen3.5 features the following enhancement:
|
|
|
|
| 944 |
> If you encounter out-of-memory (OOM) errors, consider reducing the context window.
|
| 945 |
> However, because Qwen3.5 leverages extended context for complex tasks, we advise maintaining a context length of at least 128K tokens to preserve thinking capabilities.
|
| 946 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 947 |
#### SGLang
|
| 948 |
|
| 949 |
[SGLang](https://github.com/sgl-project/sglang) is a fast serving framework for large language models and vision language models.
|