Jailbreak
Detect jailbreak and safety-bypass attempts.
YAML key: jailbreak
Direction: input
Detect jailbreak and safety-bypass attempts.
Configuration
Prop
Type
Example
guards:
input:
modules:
- jailbreak:
strict: trueProvider usage
Configure under guards.input.modules when using an LLM-backed input provider.
See the modules overview.