Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Al-Chan
/
Malicious_Prompt_Classifier
like
0
Text Classification
Transformers
Safetensors
davanstrien/aart-ai-safety-dataset
obalcells/advbench
databricks/databricks-dolly-15k
distilbert
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Malicious & Jailbreaking Prompt Classifer
Datasets Used
Malicious & Jailbreaking Prompt Classifer
Datasets Used
MaliciousInstruct
AART
StrongREJECT
DAN
AdvBench
Databricks-Dolly
Downloads last month
112
Safetensors
Model size
67M params
Tensor type
F32
·
Inference Providers
NEW
Text Classification
This model is not currently available via any of the supported Inference Providers.
Datasets used to train
Al-Chan/Malicious_Prompt_Classifier
databricks/databricks-dolly-15k
Viewer
•
Updated
Jun 30, 2023
•
15k
•
15.8k
•
791
davanstrien/aart-ai-safety-dataset
Viewer
•
Updated
Jan 9, 2024
•
3.27k
•
11
•
2
Space using
Al-Chan/Malicious_Prompt_Classifier
1
📚
Al-Chan/Project_Demo