End of training
Browse files- README.md +62 -57
- model.safetensors +1 -1
- runs/May30_07-28-46_145a99fee08a/events.out.tfevents.1717054132.145a99fee08a.992.0 +3 -0
- runs/May30_07-33-18_145a99fee08a/events.out.tfevents.1717054406.145a99fee08a.992.1 +3 -0
- runs/May30_07-33-18_145a99fee08a/events.out.tfevents.1717056242.145a99fee08a.992.2 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
|
@@ -1,11 +1,12 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
| 3 |
tags:
|
| 4 |
- generated_from_trainer
|
| 5 |
-
base_model: google/t5-efficient-tiny
|
| 6 |
datasets:
|
| 7 |
- generator
|
| 8 |
metrics:
|
|
|
|
| 9 |
- precision
|
| 10 |
- recall
|
| 11 |
- f1
|
|
@@ -13,8 +14,8 @@ model-index:
|
|
| 13 |
- name: salt_language_Classification
|
| 14 |
results:
|
| 15 |
- task:
|
| 16 |
-
type: text-classification
|
| 17 |
name: Text Classification
|
|
|
|
| 18 |
dataset:
|
| 19 |
name: generator
|
| 20 |
type: generator
|
|
@@ -22,15 +23,18 @@ model-index:
|
|
| 22 |
split: train
|
| 23 |
args: default
|
| 24 |
metrics:
|
| 25 |
-
-
|
| 26 |
-
|
| 27 |
-
|
| 28 |
-
-
|
| 29 |
-
|
| 30 |
-
|
| 31 |
-
-
|
| 32 |
-
|
| 33 |
-
|
|
|
|
|
|
|
|
|
|
| 34 |
---
|
| 35 |
|
| 36 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
@@ -40,10 +44,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 40 |
|
| 41 |
This model is a fine-tuned version of [google/t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny) on the generator dataset.
|
| 42 |
It achieves the following results on the evaluation set:
|
| 43 |
-
- Loss: 0.
|
| 44 |
-
-
|
| 45 |
-
-
|
| 46 |
-
-
|
|
|
|
| 47 |
|
| 48 |
## Model description
|
| 49 |
|
|
@@ -73,48 +78,48 @@ The following hyperparameters were used during training:
|
|
| 73 |
|
| 74 |
### Training results
|
| 75 |
|
| 76 |
-
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1
|
| 77 |
-
|
| 78 |
-
| 0.
|
| 79 |
-
| 0.
|
| 80 |
-
| 0.
|
| 81 |
-
| 0.
|
| 82 |
-
| 0.
|
| 83 |
-
| 0.
|
| 84 |
-
| 0.
|
| 85 |
-
| 0.
|
| 86 |
-
| 0.
|
| 87 |
-
| 0.
|
| 88 |
-
| 0.
|
| 89 |
-
| 0.
|
| 90 |
-
| 0.
|
| 91 |
-
| 0.
|
| 92 |
-
| 0.
|
| 93 |
-
| 0.
|
| 94 |
-
| 0.
|
| 95 |
-
| 0.
|
| 96 |
-
| 0.
|
| 97 |
-
| 0.
|
| 98 |
-
| 0.
|
| 99 |
-
| 0.
|
| 100 |
-
| 0.
|
| 101 |
-
| 0.
|
| 102 |
-
| 0.
|
| 103 |
-
| 0.
|
| 104 |
-
| 0.
|
| 105 |
-
| 0.
|
| 106 |
-
| 0.
|
| 107 |
-
| 0.
|
| 108 |
-
| 0.
|
| 109 |
-
| 0.
|
| 110 |
-
| 0.
|
| 111 |
-
| 0.
|
| 112 |
-
| 0.
|
| 113 |
-
| 0.
|
| 114 |
-
| 0.
|
| 115 |
-
| 0.
|
| 116 |
-
| 0.
|
| 117 |
-
| 0.
|
| 118 |
|
| 119 |
|
| 120 |
### Framework versions
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
base_model: google/t5-efficient-tiny
|
| 4 |
tags:
|
| 5 |
- generated_from_trainer
|
|
|
|
| 6 |
datasets:
|
| 7 |
- generator
|
| 8 |
metrics:
|
| 9 |
+
- accuracy
|
| 10 |
- precision
|
| 11 |
- recall
|
| 12 |
- f1
|
|
|
|
| 14 |
- name: salt_language_Classification
|
| 15 |
results:
|
| 16 |
- task:
|
|
|
|
| 17 |
name: Text Classification
|
| 18 |
+
type: text-classification
|
| 19 |
dataset:
|
| 20 |
name: generator
|
| 21 |
type: generator
|
|
|
|
| 23 |
split: train
|
| 24 |
args: default
|
| 25 |
metrics:
|
| 26 |
+
- name: Accuracy
|
| 27 |
+
type: accuracy
|
| 28 |
+
value: 0.9781586021505376
|
| 29 |
+
- name: Precision
|
| 30 |
+
type: precision
|
| 31 |
+
value: 0.9786579334649282
|
| 32 |
+
- name: Recall
|
| 33 |
+
type: recall
|
| 34 |
+
value: 0.9781586021505376
|
| 35 |
+
- name: F1
|
| 36 |
+
type: f1
|
| 37 |
+
value: 0.97818824673623
|
| 38 |
---
|
| 39 |
|
| 40 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
|
| 44 |
|
| 45 |
This model is a fine-tuned version of [google/t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny) on the generator dataset.
|
| 46 |
It achieves the following results on the evaluation set:
|
| 47 |
+
- Loss: 0.0615
|
| 48 |
+
- Accuracy: 0.9782
|
| 49 |
+
- Precision: 0.9787
|
| 50 |
+
- Recall: 0.9782
|
| 51 |
+
- F1: 0.9782
|
| 52 |
|
| 53 |
## Model description
|
| 54 |
|
|
|
|
| 78 |
|
| 79 |
### Training results
|
| 80 |
|
| 81 |
+
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
|
| 82 |
+
|:-------------:|:-----:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
|
| 83 |
+
| 0.2011 | 0.025 | 500 | 0.4979 | 0.8733 | 0.9001 | 0.8733 | 0.8714 |
|
| 84 |
+
| 0.234 | 0.05 | 1000 | 0.1886 | 0.9345 | 0.9354 | 0.9345 | 0.9345 |
|
| 85 |
+
| 0.2083 | 0.075 | 1500 | 0.1833 | 0.9328 | 0.9391 | 0.9328 | 0.9328 |
|
| 86 |
+
| 0.1838 | 0.1 | 2000 | 0.1457 | 0.9476 | 0.9479 | 0.9476 | 0.9475 |
|
| 87 |
+
| 0.1737 | 0.125 | 2500 | 0.1659 | 0.9409 | 0.9438 | 0.9409 | 0.9411 |
|
| 88 |
+
| 0.1591 | 0.15 | 3000 | 0.1450 | 0.9516 | 0.9524 | 0.9516 | 0.9517 |
|
| 89 |
+
| 0.1571 | 0.175 | 3500 | 0.1351 | 0.9459 | 0.9485 | 0.9459 | 0.9461 |
|
| 90 |
+
| 0.1513 | 0.2 | 4000 | 0.1510 | 0.9456 | 0.9515 | 0.9456 | 0.9460 |
|
| 91 |
+
| 0.1439 | 0.225 | 4500 | 0.1339 | 0.9546 | 0.9578 | 0.9546 | 0.9547 |
|
| 92 |
+
| 0.1394 | 0.25 | 5000 | 0.1052 | 0.9657 | 0.9658 | 0.9657 | 0.9656 |
|
| 93 |
+
| 0.1472 | 0.275 | 5500 | 0.1088 | 0.9610 | 0.9629 | 0.9610 | 0.9609 |
|
| 94 |
+
| 0.1385 | 0.3 | 6000 | 0.0792 | 0.9694 | 0.9696 | 0.9694 | 0.9694 |
|
| 95 |
+
| 0.1349 | 0.325 | 6500 | 0.1063 | 0.9610 | 0.9632 | 0.9610 | 0.9613 |
|
| 96 |
+
| 0.1215 | 0.35 | 7000 | 0.0855 | 0.9688 | 0.9694 | 0.9688 | 0.9687 |
|
| 97 |
+
| 0.133 | 0.375 | 7500 | 0.1049 | 0.9630 | 0.9640 | 0.9630 | 0.9630 |
|
| 98 |
+
| 0.1226 | 0.4 | 8000 | 0.0938 | 0.9667 | 0.9675 | 0.9667 | 0.9667 |
|
| 99 |
+
| 0.1222 | 0.425 | 8500 | 0.1134 | 0.9570 | 0.9604 | 0.9570 | 0.9573 |
|
| 100 |
+
| 0.1165 | 0.45 | 9000 | 0.0997 | 0.9688 | 0.9697 | 0.9688 | 0.9687 |
|
| 101 |
+
| 0.1174 | 0.475 | 9500 | 0.1002 | 0.9661 | 0.9680 | 0.9661 | 0.9659 |
|
| 102 |
+
| 0.1165 | 0.5 | 10000 | 0.0807 | 0.9728 | 0.9728 | 0.9728 | 0.9728 |
|
| 103 |
+
| 0.1065 | 0.525 | 10500 | 0.0750 | 0.9745 | 0.9754 | 0.9745 | 0.9746 |
|
| 104 |
+
| 0.1089 | 0.55 | 11000 | 0.0896 | 0.9688 | 0.9703 | 0.9688 | 0.9689 |
|
| 105 |
+
| 0.1125 | 0.575 | 11500 | 0.0632 | 0.9782 | 0.9787 | 0.9782 | 0.9782 |
|
| 106 |
+
| 0.11 | 0.6 | 12000 | 0.0775 | 0.9691 | 0.9708 | 0.9691 | 0.9692 |
|
| 107 |
+
| 0.1028 | 0.625 | 12500 | 0.0833 | 0.9698 | 0.9708 | 0.9698 | 0.9698 |
|
| 108 |
+
| 0.1052 | 0.65 | 13000 | 0.0663 | 0.9751 | 0.9755 | 0.9751 | 0.9751 |
|
| 109 |
+
| 0.1068 | 0.675 | 13500 | 0.0648 | 0.9772 | 0.9774 | 0.9772 | 0.9772 |
|
| 110 |
+
| 0.1029 | 0.7 | 14000 | 0.0962 | 0.9688 | 0.9706 | 0.9688 | 0.9689 |
|
| 111 |
+
| 0.1014 | 0.725 | 14500 | 0.0686 | 0.9772 | 0.9775 | 0.9772 | 0.9771 |
|
| 112 |
+
| 0.0978 | 0.75 | 15000 | 0.0802 | 0.9745 | 0.9752 | 0.9745 | 0.9745 |
|
| 113 |
+
| 0.095 | 0.775 | 15500 | 0.0646 | 0.9758 | 0.9763 | 0.9758 | 0.9758 |
|
| 114 |
+
| 0.0996 | 0.8 | 16000 | 0.0711 | 0.9758 | 0.9761 | 0.9758 | 0.9758 |
|
| 115 |
+
| 0.0967 | 0.825 | 16500 | 0.0683 | 0.9761 | 0.9768 | 0.9761 | 0.9761 |
|
| 116 |
+
| 0.0939 | 0.85 | 17000 | 0.0572 | 0.9792 | 0.9795 | 0.9792 | 0.9791 |
|
| 117 |
+
| 0.0966 | 0.875 | 17500 | 0.0527 | 0.9792 | 0.9794 | 0.9792 | 0.9791 |
|
| 118 |
+
| 0.0925 | 0.9 | 18000 | 0.0581 | 0.9798 | 0.9802 | 0.9798 | 0.9799 |
|
| 119 |
+
| 0.0945 | 0.925 | 18500 | 0.0693 | 0.9768 | 0.9776 | 0.9768 | 0.9768 |
|
| 120 |
+
| 0.0923 | 0.95 | 19000 | 0.0615 | 0.9785 | 0.9790 | 0.9785 | 0.9785 |
|
| 121 |
+
| 0.0896 | 0.975 | 19500 | 0.0643 | 0.9758 | 0.9766 | 0.9758 | 0.9758 |
|
| 122 |
+
| 0.0979 | 1.0 | 20000 | 0.0619 | 0.9765 | 0.9770 | 0.9765 | 0.9765 |
|
| 123 |
|
| 124 |
|
| 125 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 62563880
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:de644ecbb753ee5c02fb073e5af63e60364f6e087adc53de7f3bab59627a3216
|
| 3 |
size 62563880
|
runs/May30_07-28-46_145a99fee08a/events.out.tfevents.1717054132.145a99fee08a.992.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:40d7e929753059d21d5cd70fcedfcb999c011e546501025bee31f478a5ed2c7b
|
| 3 |
+
size 5455
|
runs/May30_07-33-18_145a99fee08a/events.out.tfevents.1717054406.145a99fee08a.992.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a747b7756023eefe2d4bd560153be3d2c6fb8159a3adddc370966a53ba398ef9
|
| 3 |
+
size 33077
|
runs/May30_07-33-18_145a99fee08a/events.out.tfevents.1717056242.145a99fee08a.992.2
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0e42596e1849d1ea7dc27ae2d831f402089be6a2bd50755f070e6f3879c1455b
|
| 3 |
+
size 569
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5176
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:53fae9d2706ef0285d31158344cc3bade9fcbbcbbc368ab0cec3b98d7d43f9ff
|
| 3 |
size 5176
|