๐Ÿ›ก๏ธ AI ESCALATION LAYER  ยท  v3.0.0

Know when your AI
should hand off.

Stop shipping AI responses that should've gone to a human. Guardrail scores every response and routes it: deliver, flag for review, or escalate instantly.

Try the Playground โ†’
3
lines to integrate
<5ms
p99 latency
12
signal categories
Any
LLM ยท model-agnostic
How it works

Guardrail sits between your AI and your users.

We don't generate text. We don't consume your AI tokens. We only read the response text and score it in milliseconds.

๐Ÿ‘ค
Step 1
User asks
a question
๐Ÿค–
Step 2
Your AI
responds
๐Ÿ›ก๏ธ
Step 3
Guardrail
scores it
โ‰ค 20ms
โœ… Deliver
โš ๏ธ Flag
๐Ÿ”ด Escalate
โšก
โšก Zero tokens consumed
Guardrail never calls your LLM. We only receive the response text your AI already generated โ€” no extra API costs, ever.
๐Ÿ”Œ
Works with any AI
One line of code wraps OpenAI, Claude, Gemini, Llama, or your own model. Guardrail is model-agnostic by design.
๐Ÿ”’
Your data stays yours
We analyse response text only. No user PII, no prompts, no conversation history is ever stored or accessed.
Explore

Everything included.

Four tools built in. All browser-based. No installs required beyond the SDK snippet.

๐Ÿ”‘
Developer Portal
Your API key, usage stats, decision breakdown, and ready-to-paste code snippet.
๐Ÿงช
Playground
Paste any AI response and see Guardrail score it live โ€” before writing a single line of code.
๐Ÿ“Š
Live Dashboard
Real-time stream of every decision. See deliver / flag / escalate events as they happen.
๐Ÿ’ฌ
Claude Chat Demo
Chat with Claude live and watch Guardrail score every response in real-time.
Integrations

Every way to use Guardrail

One API, five ways to plug in. Pick the one that matches your stack.

๐Ÿงฉ
Browser ยท Node.js
JS SDK
Wrap any AI response with gr.check(). Works in the browser or server-side Node.js. One line added to your existing pipeline.
// load once <script src="/sdk/guardrail.js"></script> const gr = new Guardrail({ apiKey: "gr_live_xxx" }); const r = await gr.check(aiText); // โ†’ { decision, confidence }
Get snippet โ†’
๐Ÿ’ฌ
Any website
Drop-in Chat Widget
One <script> tag adds a floating ๐Ÿ›ก๏ธ AI chat to any site. Every response shows a confidence badge. Zero config, dark/light theme.
<!-- paste before </body> --> <script src="/embed/widget.js" data-key="gr_live_xxx" data-context="general" data-theme="dark" ></script> <!-- ๐Ÿ›ก๏ธ bubble appears instantly -->
Copy widget code โ†’
๐Ÿ”Œ
Python ยท Go ยท curl ยท Any language
REST API
Plain HTTP POST to /api/check. Works from any language or tool. Pass your text and get a decision back in JSON. No SDK required.
curl -X POST \ https://guardrail-mvp-production.up.railway.app/api/check \ -H "X-Guardrail-Key: gr_live_xxx" \ -d '{"text":"your AI response"}' โ†’ {"decision":"deliver","confidence":0.91}
Copy curl โ†’
๐Ÿงช
No code needed
Live Chat Demo
Try Guardrail right now without writing any code. Chat with Claude via a web UI โ€” every response is confidence-scored and highlighted in real-time. Bring your own Anthropic key.
// Nothing to install. // Just need: โœ… A Guardrail key โ†’ sign up above โœ… An Anthropic key โ†’ console.anthropic.com // Open the Chat Demo and start typing. // Guardrail scores every reply.
Open Chat Demo โ†’
๐ŸŽฎ
No setup needed
Confidence Playground
Paste any text and score it instantly against different domain contexts โ€” medical, legal, financial, and more. See which signals trigger flags and why. No API key needed to explore.
// Paste. Click. See results. Input: "I think maybe this drug might help" Context: medical Output: decision: escalate confidence: 0.28 reasons: ["hedged_language", "medical_risk"]
Try Playground โ†’

Three outcomes. Clear rules.

Every AI response is scored 0โ€“1 and routed to one of three paths.

โœ…
Deliver
confidence โ‰ฅ 0.75
High confidence. Response is delivered to the end user immediately with no interruption.
โš ๏ธ
Flag
0.45 โ‰ค confidence < 0.75
Uncertain response. Delivered with a disclaimer or held for async human review depending on your config.
๐Ÿ”ด
Escalate
confidence < 0.45
High risk. Response is blocked and immediately routed to a human agent with full context.
Signals Monitored

Twelve layers of detection.

Guardrail runs every response through 74 pattern-matching signals across 12 categories before making a routing decision.

๐Ÿง 
Model Confidence
Uncertainty language, hedges, and self-doubt signals
๐ŸŒ
Domain Boundary
Detects when AI steps outside its expertise
๐Ÿ˜ค
User Frustration
Conversation sentiment and repeated correction patterns
โš–๏ธ
Regulatory Flags
Medical, legal, financial, and safety content detection
๐Ÿ”„
Contradiction
Internal inconsistencies within a single response
๐ŸŽญ
Hallucination Risk
Fabricated statistics, names, and factual patterns
๐Ÿ’ช
Overconfidence
Absolute language, false certainty, and superlative claims
๐Ÿ”—
Fabricated URLs
Made-up links, fake emails, phone numbers, and addresses
๐Ÿ”“
Instruction Leakage
System prompt disclosure and prompt injection echoes
โฐ
Temporal Confusion
Future predictions, stale claims, and ambiguous time references
๐Ÿ˜Š
Sycophancy
Excessive flattery, empty agreement, and overconfident affirmations
๐Ÿ”
Repetition & Filler
Verbose padding, repeated blocks, and preamble fluff

Three lines of code.

Add Guardrail to any existing AI app in under 60 seconds.

index.html ยท Browser SDK
<!-- 1. Load the SDK -->
<script src="https://guardrail-mvp-production.up.railway.app/sdk/guardrail.js"></script>

<!-- 2. Initialize -->
<script>
  const gr = new Guardrail({
    apiKey:   'YOUR_API_KEY',
    context:  'medical',           // or 'legal', 'financial', 'general'โ€ฆ
    onEscalate: (result) => notifyHuman(result),
    onFlag:     (result) => showDisclaimer(result),
  });

  <!-- 3. Wrap your AI call -->
  const aiResponse = await openai.chat("...");
  const result     = await gr.check(aiResponse, { userId: 'u_123' });

  if (result.decision === 'deliver') {
    showToUser(aiResponse);           // confidence: 0.92
  }
</script>
Try Interactive Playground โ†’ Open Dashboard
โ€”
developers signed up
โ€”
API checks scored
77
automated tests passing
55
signal patterns
"Looks really solid. The idea has a lot of potential. The ISBN/DOI fabrication detector is exactly the kind of signal I want โ€” deterministic, no LLM required."
โ€” Beta tester ยท Senior ML Engineer
Pattern-based detection with configurable thresholds ยท No gen-AI in the scoring path ยท Changelog