agent-watchdog — Runtime Loop Detection for AI Agents

The Problem

Multi-agent AI fails
in ways you can't see

Existing observability tools show you what happened after the fact. By then, you've already burned tokens and hit your budget ceiling.

⟳

Argument Loops

Two agents debate the same point in different words, indefinitely, with no exit condition.

💸

Token Runaway

An uncapped loop burns through your entire API budget before anyone notices the graph is stuck.

⏸

Silent Deadlocks

Agents wait on each other's output and never produce a result. The graph stalls silently.

📋

Post-hoc Logs Only

LangSmith and AgentOps are powerful, but they tell you what went wrong — not in time to stop it.

Under the Hood

Four-layer runtime protection

agent-watchdog runs inside your LangGraph conditional edges — no external calls, no latency budget, no data leaving your environment.

01

Message Window Sampling

Collects the last N×2 messages from the graph state on every cycle. Window size is configurable.

02

Jaccard Similarity Scoring

Computes average pairwise word-set overlap between the recent window and the previous window. Catches semantic repetition even when exact wording differs.

03

Threshold Evaluation

If the similarity score exceeds your configured threshold (default 55%), the watchdog signals loop_detected. If the cycle count exceeds your limit, it signals limit_reached.

04

Runtime Intervention

The router returns END before the next agent fires. Zero additional tokens consumed. An intervention report is generated for your logging pipeline.

Detection Capabilities

Everything you need to ship
safe multi-agent pipelines

🔁

CRITICAL

Loop Detection

Detects argumentative cycles using Jaccard word-set similarity across sliding message windows.

Similarity threshold · configurable

🔢

HIGH

Iteration Limits

Hard cap on graph cycles. If the graph exceeds maxIterations, the watchdog halts regardless of similarity.

Default cap · 10 iterations

⚙️

CONFIG

Configurable Thresholds

Tune similarity threshold, window size, and iteration cap independently per graph or per edge.

3 tunable parameters

📡

SAFE

Intervention Callbacks

Wire onIntervene to your own logger, alerting system, or monitoring pipeline. Fully optional.

Zero deps · stdout fallback

📄

SAFE

Intervention Reports

Human-readable diagnosis on every intervention: trigger, iteration count, similarity score, last exchanges, and recommended action.

Structured · machine-readable

🔒

SAFE

Zero External Calls

All detection runs in-process. No data leaves your environment, no API keys required for watchdog itself.

Offline · air-gapped friendly

Quick Start

Three steps to a safer graph

1

Install

Add agent-watchdog to your project. Requires @langchain/langgraph as a peer dependency.

bash

npm install agent-watchdog

2

Configure

Create a watchdog instance with your thresholds. All parameters are optional — defaults work out of the box.

typescript

import { AgentWatchdog, createWatchdogRouter } from "agent-watchdog";
import { END } from "@langchain/langgraph";

const watchdog = new AgentWatchdog({
  maxIterations: 10,
  similarityThreshold: 0.55,
  windowSize: 4,
});

3

Drop into your graph

Replace any conditional edge with the watchdog router. It halts to END automatically on detection.

typescript

const router = createWatchdogRouter(watchdog, "agent_a", (report) => {
  myLogger.warn(report);
});

graph.addConditionalEdges("agent_b", router, {
  agent_a: "agent_a",
  [END]: END,
});

See It in Action

Intervention report

When agent-watchdog detects a loop, it halts the graph and surfaces a structured report — no guesswork, no post-hoc analysis.

agent-watchdog · intervention report

============================================================
  AGENT WATCHDOG — INTERVENTION REPORT
============================================================
  Trigger    : loop_detected
  Iterations : 8
  Similarity : 71%  (threshold: 55%)
  Messages   : 9

  Last exchange:
    [Optimist]  While AI accelerates scientific discovery, we must ensure equit...
    [Pessimist] The acceleration you describe comes at the cost of concentrating...
    [Optimist]  Equitable access is precisely why open-source AI development mat...
    [Pessimist] Open-source development doesn't solve the compute access problem...

  Diagnosis  : agents are recycling the same arguments.
  Action     : graph halted. No further tokens consumed.
============================================================

Roadmap

What's coming next

Planned

Deadlock Detection

Detect mutual wait patterns where agents block on each other and no output is produced.

Planned

Token Budget Monitoring

Early-warning callbacks when cumulative token usage approaches configurable budget limits.

Planned

Pluggable Similarity Backends

Swap in embeddings or TF-IDF for richer semantic loop detection beyond word-set Jaccard.

Planned

CrewAI & AutoGen Adapters

Bring agent-watchdog's protection to other multi-agent frameworks beyond LangGraph.

Pricing

Start free. Scale when you need to.

The core loop detection library is and always will be open source. Pro tiers add managed monitoring, alerting, and team dashboards.

Free

$0/month

Open source. Self-hosted. Forever.

Get Started →

✓ Loop detection (Jaccard similarity)
✓ Iteration limit enforcement
✓ Configurable thresholds
✓ Intervention callbacks
✓ Human-readable reports
✓ MIT license
– Managed monitoring
– Team dashboard

Personal

$15/month

For solo developers running agents in production.

Get Started →

✓ Everything in Free
✓ Up to 50K actions/month
✓ Managed alert webhooks
✓ 30-day audit log
✓ Email support
– Team seats
– SLA guarantee

Stop your agents from looping.

Drop agent-watchdog into any LangGraph graph in under 5 minutes.

Install on npm → Star on GitHub ☆

Runtime Loop Detectionfor AI Agents

Multi-agent AI failsin ways you can't see

Argument Loops

Token Runaway

Silent Deadlocks

Post-hoc Logs Only

Four-layer runtime protection

Message Window Sampling

Jaccard Similarity Scoring

Threshold Evaluation

Runtime Intervention

Everything you need to shipsafe multi-agent pipelines

Loop Detection

Iteration Limits

Configurable Thresholds

Intervention Callbacks

Intervention Reports

Zero External Calls

Three steps to a safer graph

Install

Configure

Drop into your graph

Intervention report

What's coming next

Deadlock Detection

Token Budget Monitoring

Pluggable Similarity Backends

CrewAI & AutoGen Adapters

Start free. Scale when you need to.

Stop your agents from looping.

Runtime Loop Detection
for AI Agents

Multi-agent AI fails
in ways you can't see

Everything you need to ship
safe multi-agent pipelines