BCCard
/

Qwen3-Coder-30B-A3B-Instruct-FP8-Dynamic

Text Generation

compressed-tensors

Model card Files Files and versions

sh2orc commited on Aug 4, 2025

Commit

9d9be38

·

verified ·

1 Parent(s): 66c7670

Update README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -33,12 +33,10 @@ pipeline_tag: text-generation
 The Qwen3-Coder-30B model shows specialized strength in mathematical reasoning tasks (GSM-8K) where it significantly outperforms the larger models. However, it lags behind the 235B models in general knowledge (MMLU), commonsense reasoning (Hellaswag), and pronoun disambiguation (Winogrande). This suggests the 30B model may have been optimized specifically for coding and mathematical tasks at the expense of some general capabilities.
 # Qwen3-Coder-30B-A3B-Instruct
-<a href="https://chat.qwen.ai/" target="_blank" style="margin: 2px;">
-    <img alt="Chat" src="https://img.shields.io/badge/%F0%9F%92%9C%EF%B8%8F%20Qwen%20Chat%20-536af5" style="display: inline-block; vertical-align: middle;"/>
-</a>
 ## Highlights

 The Qwen3-Coder-30B model shows specialized strength in mathematical reasoning tasks (GSM-8K) where it significantly outperforms the larger models. However, it lags behind the 235B models in general knowledge (MMLU), commonsense reasoning (Hellaswag), and pronoun disambiguation (Winogrande). This suggests the 30B model may have been optimized specifically for coding and mathematical tasks at the expense of some general capabilities.
+-----
 # Qwen3-Coder-30B-A3B-Instruct
 ## Highlights