Guardrails

Guardrails are real-time content inspection rules that run on every request before it reaches the LLM provider. They detect sensitive content, block harmful topics, and enforce content policies — all evaluated in priority order.

Guardrail Types

Raven supports four guardrail types, each designed for a different class of content risk.

Type	Purpose	Example
block_topics	Prevent requests about specific subjects	Block requests mentioning “weapons” or “illegal activities”
pii_detection	Detect personally identifiable information	Flag SSNs, emails, or credit card numbers in prompts
content_filter	Filter content by category	Block “violence” or “adult” content categories
custom_regex	Match arbitrary patterns	Catch internal project codenames or proprietary terms

Actions

Each guardrail has an action that determines what happens when content matches.

block — Immediately reject the request with an error. The request never reaches the provider.
warn — Allow the request but include a warning in the response. Logged for review.
log — Silently log the match for auditing. The request proceeds normally.

PII Types Supported

The pii_detection guardrail can detect the following PII types:

PII Type	Pattern	Example Match
`ssn`	Social Security Numbers	`123-45-6789`
`email`	Email addresses	`[email protected]`
`phone`	Phone numbers	`555-123-4567`
`creditCard`	Credit card numbers	`4111 1111 1111 1111`
`ipAddress`	IP addresses	`192.168.1.1`

You can enable all PII types or select specific ones in the guardrail configuration.

Creating a Guardrail

Via the Dashboard

Navigate to Guardrails

Go to Guardrails in the dashboard sidebar.

Click Create Guardrail

Select the guardrail type you want to create.

Configure the Rule

Set the name, action, and type-specific configuration (topics, PII types, regex pattern, or content categories).

Set Priority

Assign a priority number. Lower numbers are evaluated first.

Enable the Guardrail

Toggle the guardrail to enabled. It takes effect immediately on all new requests.

Configuration Examples

Block Topics
PII Detection
Content Filter
Custom Regex

{
  "name": "Block harmful topics",
  "type": "block_topics",
  "action": "block",
  "config": {
    "topics": ["weapons", "illegal activities", "self-harm"]
  }
}

{
  "name": "Detect all PII",
  "type": "pii_detection",
  "action": "warn",
  "config": {
    "piiTypes": ["ssn", "email", "phone", "creditCard", "ipAddress"]
  }
}

{
  "name": "Filter unsafe content",
  "type": "content_filter",
  "action": "block",
  "config": {
    "categories": ["violence", "adult", "hate_speech"]
  }
}

{
  "name": "Catch internal codenames",
  "type": "custom_regex",
  "action": "log",
  "config": {
    "pattern": "\\b(PROJECT_ALPHA|INTERNAL_ONLY)\\b"
  }
}

Regex patterns are limited to 500 characters and content is truncated to 10,000 characters to prevent ReDoS attacks.

Priority Ordering

Guardrails are evaluated in ascending priority order. A guardrail with priority 1 runs before priority 10. If a block action triggers, evaluation stops immediately and the request is rejected.

Use low priority numbers for your most critical rules (like PII blocking) and higher numbers for informational logging rules.

How It Works

When a request arrives at the Raven proxy:

Request --> Auth --> Rate Limit --> Guardrails --> Route --> Provider

All enabled guardrails for the organization are loaded, sorted by priority.
Message content is extracted from the request body.
Each guardrail evaluates the content against its configured rules.
On a block match, a GuardrailError is thrown and the provider never receives the request.
On a warn match, the warning is recorded and the request continues.
On a log match, the match is silently recorded.

Because blocked requests never reach the provider, guardrails save you tokens and cost in addition to providing safety.

Getting Started

Self-Hosting

Core Features

Governance & Safety

Cost Management

Advanced Features

Security

Guides

Guardrail Types

Actions

PII Types Supported

Creating a Guardrail

Via the Dashboard

Configuration Examples

Priority Ordering

How It Works

Getting Started

Self-Hosting

Core Features

Governance & Safety

Cost Management

Advanced Features

Security

Guides

​Guardrail Types

​Actions

​PII Types Supported

​Creating a Guardrail

​Via the Dashboard

​Configuration Examples

​Priority Ordering

​How It Works

Guardrail Types

Actions

PII Types Supported

Creating a Guardrail

Via the Dashboard

Configuration Examples

Priority Ordering

How It Works