Text Generation
Transformers
Safetensors
German
mistral
conversational
text-generation-inference
windprak commited on
Commit
ba55074
·
verified ·
1 Parent(s): 6817d2d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ The model excels in domains governed by strict formal rules, precise terminology
22
 
23
  ## Model Description
24
 
25
- SteuerLLM is based on an expanded Mistral Small architecture (extended from 24B to 28B parameters through a block expansion method). It was trained on a large-scale synthetic dataset generated from authentic German university tax law examination material using a controlled retrieval-augmented pipeline.
26
 
27
  The training procedure follows a two-stage approach:
28
  1. **Continual Pretraining:** The base model's representations are adapted to tax-specific terminology and concepts by pretraining on domain-filtered web data.
 
22
 
23
  ## Model Description
24
 
25
+ SteuerLLM is based on an expanded Mistral Small architecture (extended from 24B to 28B parameters through a block expansion method). It was trained on a large-scale synthetic dataset generated from seed questions out of examination material using an automated retrieval-augmented pipeline (websearch).
26
 
27
  The training procedure follows a two-stage approach:
28
  1. **Continual Pretraining:** The base model's representations are adapted to tax-specific terminology and concepts by pretraining on domain-filtered web data.