Capability
Adversarial Input Injection Runtime Testing
10 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
via “prompt injection and jailbreak vulnerability testing”
Meta's safety classifier for LLM content moderation.
Unique: CyberSecEval's prompt injection benchmark includes both textual and visual injection vectors (v3+), with multilingual variants (machine-translated MITRE prompts) and explicit measurement of false refusal rates, enabling more nuanced evaluation than binary safe/unsafe classification.
vs others: More systematic than manual prompt injection testing because it provides reproducible, quantified results across multiple injection techniques and models, and includes false refusal measurement which is often overlooked in simpler safety evaluations.