Audit Kit · Free during launch

AI Agent Audit

Audit an AI agent against the 50 Laws of AI Agents and return prioritized, evidence-backed issues with concrete fixes.

This is a gray-box audit workflow, not a magic scanner. Prefer direct evidence from the place where the agent lives: source files, workflow exports, prompts, tool schemas, traces, evals, logs, screenshots, or transcripts. If only behavior is available, run a black-box audit and label confidence lower.

Pick The Audit Mode

Choose one mode before asking for inputs:

If the user has not named a mode, infer the lightest useful mode from the artifacts they provide. Do not block on perfect inputs.

Inputs To Request

Ask for only the missing inputs needed to run the chosen mode. Start with the minimum viable audit before requesting a full packet.

Minimum viable audit:

Full evidence packet:

If the user has limited artifacts, run a lighter audit and label confidence clearly.

For platform-specific intake, use:

Audit Workflow

  1. Read references/50-laws-audit-rubric.md.
  2. Read assets/platform-intake.md when the audit involves n8n/workflow builders, SDK/API projects, black-box testing, or client delivery.
  3. Build a short system map: user input, context assembly, model calls, tools, memory, side effects, evals, and human handoffs.
  4. Identify concrete failure modes. Do not list laws abstractly.
  5. Map each issue to the most relevant law or laws.
  6. Rank issues by severity:
    • Critical: can leak data, perform unauthorized side effects, corrupt user/business state, or create severe compliance/security exposure.
    • High: likely to create wrong production outcomes, silent failures, or bad user decisions.
    • Medium: reduces reliability, debuggability, or maintainability but has bounded blast radius.
    • Low: polish, clarity, or future-risk issue.
  7. For every issue, include evidence from the provided artifact, the violated law, why it matters, exact fix, and verification.
  8. Separate confirmed issues from hypotheses. Never invent architecture details.
  9. End with the shortest next-action list that would reduce the most risk fastest.

Output Format

Use this structure:

# AI Agent Audit

## Executive Summary
- Overall risk: Critical|High|Medium|Low
- Audit mode:
- Confidence: High|Medium|Low
- Top risk:
- Fastest useful fix:

## System Map
Short paragraph or bullets describing how the agent works.

## Findings

### 1. <Issue Title>
Severity:
Laws:
Evidence:
Why it matters:
Fix:
Verification:
Owner:

## What Looks Solid
- ...

## Unknowns / Needed Evidence
- ...

## 7-Day Fix Plan
1. ...

Use assets/audit-report-template.md if the user asks for a reusable report.

Quality Bar

Buyer Assets

View raw Download zip

All kit files