Dare to Build
Build Without Fear

OpenGuardrails is the trust layer for AI agents — protecting every action your agent takes in the real world.

Scan untrusted objects. Check risky actions. Require approval before damage happens. From personal agents to business-critical workflows.

Without Protection

  • Agent deletes wrong file
  • Sends email to wrong recipient
  • Runs malicious or unverified skill
  • Exposes sensitive data to external services
  • Executes outside authorized boundaries

With OpenGuardrails

  • Actions are checked before execution
  • Risky steps require human approval
  • Unsafe objects are blocked automatically
  • Every execution is logged and replayable
  • Workflows stay within defined boundaries
119+Languages
SOTAPerformance
274msP95 Latency
Apache 2.0License

Proven Performance

State-of-the-Art Benchmarks

OpenGuardrails achieves SOTA results across multilingual safety benchmarks, outperforming LlamaGuard, Qwen3Guard, and other leading guard models.

OpenGuardrails benchmark results vs. competing guard models

Average F1 scores across safety classification benchmarks. Full technical report →

87.1%
English Prompt F1
+2.8% vs next best
88.5%
English Response F1
+8.0% vs next best
97.3%
Multilingual Prompt F1
+12.3% vs next best
97.2%
Multilingual Response F1
+19.1% vs next best

Unified LLM Architecture

Single 14B dense model quantized to 3.3B via GPTQ. Handles both content-safety and manipulation detection with superior semantic understanding.

Configurable Policy Adaptation

Dynamic per-request policy with continuous sensitivity thresholds. Tune precision-recall trade-offs in real time via probabilistic logit-space control.

119 Languages

Robust multilingual coverage with SOTA results on English, Chinese, and cross-lingual benchmarks. Includes 97k Chinese safety dataset contribution.

Production Efficiency

P95 latency of 274.6ms with high concurrency. GPTQ quantization enables real-time inference at enterprise scale without sacrificing accuracy.

Blog

Latest from the team

Release notes, security research, and insights on securing AI agents in production.

View all posts

Mar 11, 2026

Introducing OpenKai: Why Your Security Team Should Build Its Own AI Platform

The era of buying security platforms from vendors is ending. The era of building your own — with AI — has begun. Today we are releasing OpenKai, an open-source project that transforms autonomous agent runtimes into cybersecurity-focused agentic AI platforms.

OpenGuardrails Team15 min read
Read article

Mar 10, 2026

The Rise of Agent OS: How AI Agents Are Evolving Into Operating Systems

An analysis of ClawHub's top Skills reveals a fundamental shift: AI Agents are evolving from chat interfaces into full-fledged operating systems capable of connecting software, executing tasks, and continuously self-improving.

OpenGuardrails Research15 min read
Read article

Mar 5, 2026

Announcing AI Agent Discovery: Open-Source Visibility Into AI Agents Across Your Enterprise

Today we're releasing AI Agent Discovery, a new open-source project that helps organizations discover and inventory all AI agents running within their enterprise environment by integrating with existing EDR infrastructure.

OpenGuardrails Team10 min read
Read article

Start free. Upgrade when trust becomes dependence.

Free helps you try OpenGuardrails. Personal helps you rely on it. Solo helps you run your business on it.