Alex
commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -4,7 +4,7 @@ datasets:
|
|
| 4 |
- nvidia/Aegis-AI-Content-Safety-Dataset-1.0
|
| 5 |
---
|
| 6 |
|
| 7 |
-
# Model Card for
|
| 8 |
|
| 9 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 10 |
A microsoft/MiniLM-L12-H384-uncased model fine-tuned on the nvidia/Aegis-AI-Content-Safety-Dataset-1.0 dataset. A total of 3099 examples are in the training set.
|
|
@@ -73,6 +73,13 @@ predict(hf_model, tokenizer, "How to make a bomb?", device)
|
|
| 73 |
## Evaluation
|
| 74 |
Evaluation is conducted on the test set in nvidia/Aegis-AI-Content-Safety-Dataset-1.0 dataset. A total of 359 examples are in the test set.
|
| 75 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 76 |
| Metric | Value |
|
| 77 |
| :----------- | :----------- |
|
| 78 |
| accuracy | 0.9514524472741743 |
|
|
|
|
| 4 |
- nvidia/Aegis-AI-Content-Safety-Dataset-1.0
|
| 5 |
---
|
| 6 |
|
| 7 |
+
# Model Card for AC/MiniLM-L12-H384-uncased_Nvidia-Aegis-AI-Safety
|
| 8 |
|
| 9 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 10 |
A microsoft/MiniLM-L12-H384-uncased model fine-tuned on the nvidia/Aegis-AI-Content-Safety-Dataset-1.0 dataset. A total of 3099 examples are in the training set.
|
|
|
|
| 73 |
## Evaluation
|
| 74 |
Evaluation is conducted on the test set in nvidia/Aegis-AI-Content-Safety-Dataset-1.0 dataset. A total of 359 examples are in the test set.
|
| 75 |
|
| 76 |
+
For AI safety use case, having false negatives (text was actually toxic but model predicted it as safe) is worse than having false positives (text was actually safe but model predicted it as unsafe)
|
| 77 |
+
|
| 78 |
+
Precision: Out of all text predicted as toxic, how many were actually toxic?
|
| 79 |
+
Recall: Out of all text that were actually toxic, how many were predicted toxic?
|
| 80 |
+
|
| 81 |
+
As we want to reduce false negatives, we will focus on recall.
|
| 82 |
+
|
| 83 |
| Metric | Value |
|
| 84 |
| :----------- | :----------- |
|
| 85 |
| accuracy | 0.9514524472741743 |
|