Update README.md
Browse files
README.md
CHANGED
|
@@ -22,7 +22,7 @@ The model excels in domains governed by strict formal rules, precise terminology
|
|
| 22 |
|
| 23 |
## Model Description
|
| 24 |
|
| 25 |
-
SteuerLLM is based on an expanded Mistral Small architecture (extended from 24B to 28B parameters through a block expansion method). It was trained on a large-scale synthetic dataset generated from
|
| 26 |
|
| 27 |
The training procedure follows a two-stage approach:
|
| 28 |
1. **Continual Pretraining:** The base model's representations are adapted to tax-specific terminology and concepts by pretraining on domain-filtered web data.
|
|
|
|
| 22 |
|
| 23 |
## Model Description
|
| 24 |
|
| 25 |
+
SteuerLLM is based on an expanded Mistral Small architecture (extended from 24B to 28B parameters through a block expansion method). It was trained on a large-scale synthetic dataset generated from seed questions out of examination material using an automated retrieval-augmented pipeline (websearch).
|
| 26 |
|
| 27 |
The training procedure follows a two-stage approach:
|
| 28 |
1. **Continual Pretraining:** The base model's representations are adapted to tax-specific terminology and concepts by pretraining on domain-filtered web data.
|