why this is threshold above 0.9 in promptinjection

#1
by LinkedIn - opened

Screenshot from 2023-09-20 22-09-32.png

this is simple input

Protect AI org

Good catch. We also noticed this issue with some simple prompts. We will change the model to JasperLS/deberta-v3-base-injection because it will produce below threshold response. Alternatively, we are considering this model too https://huggingface.co/benediktpri/finetuned_bert_injection?

Protect AI org

There is a new model deployed, which doesn't produce that many false-positives

image.png

asofter changed discussion status to closed

Sign up or log in to comment