Update README.md
Browse files
README.md
CHANGED
|
@@ -33,15 +33,12 @@ This model is part of the Ettin suite - the first collection of paired encoder-o
|
|
| 33 |
- [Encoder Models](#encoder-models)
|
| 34 |
- [Decoder Models](#decoder-models)
|
| 35 |
- [Cross-Objective Models](#cross-objective-models)
|
| 36 |
-
- [Accessing Training Checkpoints](#accessing-training-checkpoints)
|
| 37 |
- [Research Applications](#research-applications)
|
| 38 |
- [Training Details](#training-details)
|
| 39 |
- [Model Architecture](#model-architecture)
|
| 40 |
-
- [Usage Examples](#usage-examples)
|
| 41 |
-
- [Fine-tuning Examples](#fine-tuning-examples)
|
| 42 |
- [Citation](#citation)
|
| 43 |
|
| 44 |
-
##
|
| 45 |
|
| 46 |
### Encoder Tasks (vs. ModernBERT)
|
| 47 |
- **GLUE Average**: 88.9 vs 88.4 (Base), 90.8 vs 90.4 (Large)
|
|
@@ -56,7 +53,7 @@ This model is part of the Ettin suite - the first collection of paired encoder-o
|
|
| 56 |
### Key Finding
|
| 57 |
**Architecture-specific advantages persist**: A 400M encoder outperforms a 1B decoder on classification tasks, while a 400M decoder outperforms a 1B encoder on generation tasks.
|
| 58 |
|
| 59 |
-
##
|
| 60 |
|
| 61 |
### Installation
|
| 62 |
|
|
|
|
| 33 |
- [Encoder Models](#encoder-models)
|
| 34 |
- [Decoder Models](#decoder-models)
|
| 35 |
- [Cross-Objective Models](#cross-objective-models)
|
|
|
|
| 36 |
- [Research Applications](#research-applications)
|
| 37 |
- [Training Details](#training-details)
|
| 38 |
- [Model Architecture](#model-architecture)
|
|
|
|
|
|
|
| 39 |
- [Citation](#citation)
|
| 40 |
|
| 41 |
+
## Performance Highlights
|
| 42 |
|
| 43 |
### Encoder Tasks (vs. ModernBERT)
|
| 44 |
- **GLUE Average**: 88.9 vs 88.4 (Base), 90.8 vs 90.4 (Large)
|
|
|
|
| 53 |
### Key Finding
|
| 54 |
**Architecture-specific advantages persist**: A 400M encoder outperforms a 1B decoder on classification tasks, while a 400M decoder outperforms a 1B encoder on generation tasks.
|
| 55 |
|
| 56 |
+
## Quick Start
|
| 57 |
|
| 58 |
### Installation
|
| 59 |
|