Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lordofthejars
/
jailbreak-classifier
like
1
Text Classification
Transformers
PyTorch
Safetensors
Open-Orca/OpenOrca
jackhhao/jailbreak-classification
English
bert
jailbreak
security
moderation
prompt-injection
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
e26397c
jailbreak-classifier
1.52 kB
2 contributors
History:
1 commit
lordofthejars
initial commit
e26397c
verified
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago