Commit
·
c889ad6
1
Parent(s):
8617b1a
Update README.md
Browse files
README.md
CHANGED
|
@@ -176,7 +176,33 @@ detect_lang(
|
|
| 176 |
|
| 177 |
## Training procedure
|
| 178 |
|
| 179 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 180 |
|
| 181 |
### Framework versions
|
| 182 |
|
|
|
|
| 176 |
|
| 177 |
## Training procedure
|
| 178 |
|
| 179 |
+
|
| 180 |
+
Fine-tuning was done via the `Trainer` API. Here is the [Jupyter notebook](https://github.com/DominguesM/language-detection/blob/main/Language_Detector_Lora.ipynb) with the training code.
|
| 181 |
+
|
| 182 |
+
### Training hyperparameters
|
| 183 |
+
|
| 184 |
+
The following hyperparameters were used during training:
|
| 185 |
+
|
| 186 |
+
- learning_rate: 2e-05
|
| 187 |
+
- train_batch_size: 64
|
| 188 |
+
- eval_batch_size: 128
|
| 189 |
+
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 190 |
+
- num_epochs: 2
|
| 191 |
+
|
| 192 |
+
### Training results
|
| 193 |
+
|
| 194 |
+
The validation results on the `valid` split of the Language Identification dataset are summarised here below.
|
| 195 |
+
|
| 196 |
+
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
|
| 197 |
+
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
|
| 198 |
+
| 1.4403 | 1.0 | 1094 | 0.0591 | 0.9952 | 0.9952 |
|
| 199 |
+
| 0.0256 | 2.0 | 2188 | 0.0272 | 0.9955 | 0.9955 |
|
| 200 |
+
|
| 201 |
+
In short, it achieves the following results on the validation set:
|
| 202 |
+
|
| 203 |
+
- Loss: 0.0298
|
| 204 |
+
- Accuracy: 0.9946
|
| 205 |
+
- F1: 0.9946
|
| 206 |
|
| 207 |
### Framework versions
|
| 208 |
|