Capability

Adversarial Input Injection Runtime Testing

10 artifacts provide this capability.

Want a personalized recommendation?

Top Matches

via “prompt injection and jailbreak vulnerability testing”

Meta's safety classifier for LLM content moderation.

Unique: CyberSecEval's prompt injection benchmark includes both textual and visual injection vectors (v3+), with multilingual variants (machine-translated MITRE prompts) and explicit measurement of false refusal rates, enabling more nuanced evaluation than binary safe/unsafe classification.

vs others: More systematic than manual prompt injection testing because it provides reproducible, quantified results across multiple injection techniques and models, and includes false refusal measurement which is often overlooked in simpler safety evaluations.

Adversarial Input Injection Runtime Testing

Top Matches

Also Known As

Company