🚀 Now accepting early access signups

Uptime monitoring
for AI chatbot quality

Your chatbot passes QA today. Will it tomorrow? Agent Canary runs continuous test conversations against your AI agents and alerts you the moment quality degrades — before your users notice.

Request Early Access See How It Works →

< 60s Setup time

24/7 Monitoring

Semantic AI-powered scoring

canary monitor

$ python canary.py check --target "Support Bot"

✓ "What are your hours?"
  Score: 0.94 | Latency: 342ms

✓ "How do I reset my password?"
  Score: 0.91 | Latency: 287ms

⚠ "What's your refund policy?"
  Score: 0.68 | Latency: 1204ms
  ↓ Regression detected — score dropped 0.23 from baseline

───────────────────────────────────
Target: Support Bot  |  Avg Score: 0.84  |  Alerts: 1 warning
Next check in 5 minutes...

The Problem

AI chatbots break silently

A model update, a prompt change, a knowledge base edit — and suddenly your chatbot gives wrong answers. You find out when customers complain. That's too late.

🔄

Model updates

Provider ships a new version. Your carefully tuned prompts now produce different outputs.

📝

Prompt drift

Someone on the team tweaks the system prompt. Quality drops in ways nobody catches.

📚

Knowledge rot

Your RAG pipeline indexes stale docs. The bot starts confidently quoting outdated info.

⏱️

Latency spikes

Response times creep up. Users abandon conversations. You don't notice for days.

Features

Everything you need to sleep at night

Set up canary queries. Get alerts. Fix problems before users see them.

🎯

Semantic Scoring

AI-powered similarity scoring compares responses against expected answers. Not string matching — actual meaning.

📉

Regression Detection

Moving average baselines catch gradual drift and sudden drops. Know if quality is trending down before it craters.

🔔

Multi-channel Alerts

Slack, email, webhooks — get notified wherever you work. Configure thresholds per query or per target.

⚡

Latency Tracking

Monitor response times alongside quality. Catch performance regressions that hurt user experience.

📊

Quality Reports

Historical dashboards show quality trends over time. Share with stakeholders. Prove your bot is improving.

🔌

Any Endpoint

Works with any chatbot that has an HTTP API. OpenAI, Anthropic, custom models, Rasa, Dialogflow — all supported.

How It Works

Three steps. Five minutes.

No SDK. No code changes. Just point Agent Canary at your chatbot.

Add your chatbot endpoint

Point Agent Canary at any HTTP endpoint. Supports custom headers, auth tokens, and request formats.

canary add-target --url "https://api.example.com/chat" --name "Support Bot"

Define canary queries

Write test questions with expected answers. These are the "canaries in the coal mine" that detect quality changes.

canary add-canary --target "Support Bot" --query "What are your hours?" --expected "We are open 9-5 Monday to Friday"

Monitor continuously

Agent Canary runs your test queries on a schedule and alerts you when quality drops below your threshold.

canary monitor --interval 5 --alert slack

Pricing

Simple, honest pricing

Start free. Scale when you need to.

Starter

Free

forever

1 chatbot target
5 canary queries
Hourly checks
Email alerts
7-day history

Get Started

Be the first to know
when things break

Join the early access list. We'll onboard you personally and give you Pro features free during beta.

Uptime monitoringfor AI chatbot quality