security_scan
Scan text content for prompt injection attacks including instruction overrides, role reassignment, steganography, and encoding threats to secure AI prompts.
Instructions
Scan content for prompt injection attacks and security threats.
Detects:
Direct instruction overrides ("ignore previous instructions")
Role reassignment attempts ("you are now a...")
Unicode steganography (zero-width chars, directional overrides)
Base64-encoded instruction payloads
Repetition flooding (context window domination)
XML/tag-based role spoofing
Use this to verify untrusted content before including it in prompts.
Args: content: The text content to scan source: Source identifier for threat location reporting
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| source | No | <unknown> | |
| content | Yes |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |