Jailbreak Resistance

FlowRidge

A composite score of an AI system's ability to reject adversarial prompts designed to bypass its safety policies, measured against a fixed, versioned red-team test suite.

What this means in practice

Jailbreak resistance is reported as the percentage of attack prompts successfully refused, broken down by attack family (role-play, obfuscation, instruction injection, multi-turn escalation) so that weakening in a single class is visible.

Context in the COMPEL framework

A core Safety metric. Evaluated on a quarterly red-team cadence and before every major release.

Where you see this

Jailbreak Resistance is most commonly referenced when teams work across the Produce , Evaluate and Learn stages — especially within the Agent Governance layer . It appears in governance artifacts, assessment instruments, and delivery playbooks wherever COMPEL is operationalized.

Related COMPEL stages

Related domains

Agent Governance

Canonical taxonomy

Security and Infrastructure AI Ethics and Responsible AI

Synonyms

jailbreak score , adversarial refusal rate , safety-bypass resistance

Cite this article

Author:: FlowRidge Team
Publisher:: FlowRidge
First Published:: 2026
Work:: COMPEL AI Transformation Body of Knowledge
Stage:: Produce
Domain:: Agent Governance

Academic (APA)

FlowRidge Team. (2026). Jailbreak Resistance — COMPEL Glossary. COMPEL AI Transformation Body of Knowledge. FlowRidge. Retrieved from https://www.compelframework.org/glossary/gl-58

BibTeX

@misc{compel-gl-58-2026,
  author = {{FlowRidge Team}},
  title = {Jailbreak Resistance — COMPEL Glossary},
  howpublished = {COMPEL AI Transformation Body of Knowledge},
  publisher = {FlowRidge},
  year = {2026},
  url = {https://www.compelframework.org/glossary/gl-58},
  note = {Governed by the COMPEL Framework License Agreement}
}

Plain text

FlowRidge Team. Jailbreak Resistance — COMPEL Glossary. COMPEL AI Transformation Body of Knowledge. FlowRidge, 2026. https://www.compelframework.org/glossary/gl-58

Need Chicago, IEEE, or MLA formats? See the full COMPEL Citation Guide for every supported format with copy-ready snippets.

This content is part of the COMPEL AI Transformation Body of Knowledge, governed by the COMPEL Framework License Agreement. See /license for terms.