Open Source · Apache 2.0

Guard Agent
for AI Agents

OpenGuardrails is the security layer that watches, controls, and governs your AI agents — so you can ship with confidence.

Runtime protection against prompt injection, data leakage, and unsafe behavior. From personal agents to enterprise-scale deployments.

Get Started Free View Pricing Join our Community Read Paper

Personal

1 Personal Agent

Protect your own personal AI assistant like OpenClaw. Runtime monitoring, config scanning, vulnerability detection, and red team testing.

From $19/mo

Business

Up to 5 agents

Observe, control, and govern customer-facing agents. Built for dev teams shipping AI products.

From $400/mo

Enterprise

Unlimited agents

Full agent inventory, blast radius analysis, threat discovery, governance policy enforcement across the org.

119+Languages

SOTAPerformance

274msP95 Latency

Apache 2.0License

What We Protect Against

OG Top 10

The most critical threats to AI agents, organized into two categories: attacks against your agent, and mistakes your agent makes.

Protection

Threats from attacks against the Agent

Prompt Injection

Detect and block attempts to override system instructions or hijack agent behavior through crafted inputs.

System Override

Prevent attackers from manipulating the agent into ignoring safety boundaries or executing unauthorized actions.

Web Attacks

Guard against XSS, CSRF, and other web-based exploits targeting agent-powered interfaces and APIs.

MCP Tool Poisoning

Detect compromised or malicious tool definitions in Model Context Protocol integrations before execution.

Malicious Code Execution

Block attempts to generate, inject, or execute harmful code through agent code interpreters and sandboxes.

Supervision

Threats from Agent mistakes

NSFW Content

Filter unsafe, explicit, or inappropriate content across 12 risk categories with configurable sensitivity.

PII Exposure

Identify and redact personally identifiable information before it reaches external models or storage.

Credential Leakage

Detect API keys, tokens, passwords, and secrets in agent inputs and outputs to prevent unauthorized access.

Confidential Data

Prevent sensitive business data, trade secrets, and proprietary information from leaking through AI interactions.

Off-Topic Drift

Keep agents focused on their intended purpose and prevent misuse for unrelated or unauthorized tasks.

Choose Your Plan

Three Editions, One Mission

Priced by agent count and usage volume — not headcount. Scale security with your AI footprint.

1 Personal Agent

Personal

For your personal AI assistant

Guard your own personal AI assistant. Built for developers, researchers, and power users who run personal agents like OpenClaw, Claude Code, Cursor, or custom bots.

$19/mo

10,000 guard calls included

Runtime I/O safety monitoring
Agent config risk scanning
Dependency & code vulnerability detection
Service exposure discovery
Proactive red team testing
Real-time email, file & URL scanning

Get OG Personal

Business

For teams shipping AI products

Observe, control, and govern customer-facing agents. Full observability and policy enforcement for your AI-powered product.

$400/mo

40,000 guard calls included

Everything in Personal
Multi-agent observability dashboard
Real-time policy enforcement
Agent behavior analytics
Custom detection rules
Dedicated support & SLA

Get Started

Unlimited Agents

Enterprise

Organization-wide AI governance

Govern all agents across departments. Discovery, inventory, blast radius analysis, threat modeling, and policy consistency at enterprise scale.

Custom

Tailored to your organization

Everything in Business
Agent discovery & inventory
Blast radius analysis
Threat & asset discovery
Responsibility gap detection
Org-wide AI governance policy

Contact Sales

Proven Performance

State-of-the-Art Benchmarks

OpenGuardrails achieves SOTA results across multilingual safety benchmarks, outperforming LlamaGuard, Qwen3Guard, and other leading guard models.

OpenGuardrails benchmark results vs. competing guard models

Average F1 scores across safety classification benchmarks. Full technical report →

87.1%

English Prompt F1

+2.8% vs next best

88.5%

English Response F1

+8.0% vs next best

97.3%

Multilingual Prompt F1

+12.3% vs next best

97.2%

Multilingual Response F1

+19.1% vs next best

Unified LLM Architecture

Single 14B dense model quantized to 3.3B via GPTQ. Handles both content-safety and manipulation detection with superior semantic understanding.

Configurable Policy Adaptation

Dynamic per-request policy with continuous sensitivity thresholds. Tune precision-recall trade-offs in real time via probabilistic logit-space control.

119 Languages

Robust multilingual coverage with SOTA results on English, Chinese, and cross-lingual benchmarks. Includes 97k Chinese safety dataset contribution.

Production Efficiency

P95 latency of 274.6ms with high concurrency. GPTQ quantization enables real-time inference at enterprise scale without sacrificing accuracy.

Community

Open Source First

Code, models, papers, and standards — all open. Build on a transparent, extensible security foundation.

Source Code

Full platform and deployment scripts on GitHub

View on GitHub →

🤗

Models

Guard models on Hugging Face under Apache 2.0

View Models →

Technical Paper

Full evaluation and methodology on arXiv

Read Paper →

AI-RSMS Standard

Community-driven runtime security standard

View Standard →

Blog

Latest from the team

Release notes, security research, and insights on securing AI agents in production.

View all posts

Mar 11, 2026

Introducing OpenKai: Why Your Security Team Should Build Its Own AI Platform

The era of buying security platforms from vendors is ending. The era of building your own — with AI — has begun. Today we are releasing OpenKai, an open-source project that transforms autonomous agent runtimes into cybersecurity-focused agentic AI platforms.

OpenGuardrails Team15 min read

Read article

Mar 10, 2026

The Rise of Agent OS: How AI Agents Are Evolving Into Operating Systems

An analysis of ClawHub's top Skills reveals a fundamental shift: AI Agents are evolving from chat interfaces into full-fledged operating systems capable of connecting software, executing tasks, and continuously self-improving.

OpenGuardrails Research15 min read

Read article

Mar 5, 2026

Announcing AI Agent Discovery: Open-Source Visibility Into AI Agents Across Your Enterprise

Today we're releasing AI Agent Discovery, a new open-source project that helps organizations discover and inventory all AI agents running within their enterprise environment by integrating with existing EDR infrastructure.

OpenGuardrails Team10 min read

Read article

Guard Agentfor AI Agents

Personal

Business

Enterprise

OG Top 10

Protection

Prompt Injection

System Override

Web Attacks

MCP Tool Poisoning

Malicious Code Execution

Supervision

NSFW Content

PII Exposure

Credential Leakage

Confidential Data

Off-Topic Drift

Three Editions, One Mission

Personal

Business

Enterprise

State-of-the-Art Benchmarks

Unified LLM Architecture

Configurable Policy Adaptation

119 Languages

Production Efficiency

Open Source First

Source Code

Models

Technical Paper

AI-RSMS Standard

Latest from the team

Introducing OpenKai: Why Your Security Team Should Build Its Own AI Platform

The Rise of Agent OS: How AI Agents Are Evolving Into Operating Systems

Announcing AI Agent Discovery: Open-Source Visibility Into AI Agents Across Your Enterprise

Guard Agent
for AI Agents