Docs — AgentWard

Installation

Get started in 30 seconds.

pip install agentward

Requires Python 3.11+. No API key. Everything runs locally. Mac and Linux supported.

Quick Start

agentward init

The recommended way to start. Runs the full pipeline interactively in one command: scan → configure → setup → inspect. Discovers your tools, shows a risk summary, generates a policy, wires enforcement into your agent host, and starts the proxy — all in ~10 seconds.

agentward init

Stage 1

agentward scan LIVE

Static analysis of all tool sources. Auto-discovers MCP configs, scans Python source files, and queries live MCP servers via subprocess + JSON-RPC handshake. Outputs a risk-rated permission map with use-case-aware recommendations.

agentward scan                          # auto-discover all sources
agentward scan ~/.cursor/mcp.json       # scan specific MCP config
agentward scan ~/project/               # scan directory (MCP + Python + OpenClaw)
agentward scan --json > report.json     # machine-readable output
agentward scan --format html            # shareable HTML report with score badge
agentward scan --ci --format sarif      # SARIF for GitHub Security tab

OPTION	DESCRIPTION
--json	Raw JSON output for piping into other tools
--format html	Shareable HTML report with risk score badge
--format sarif	SARIF output for GitHub Security tab integration
--ci	CI mode — exit code 1 if CRITICAL risks found
--timeout N	MCP server enumeration timeout in seconds (default: 15)

Stage 2

agentward configure LIVE

Generates a smart-default agentward.yaml policy file based on scan results and detected use-case patterns. Runs the full scan pipeline internally, then produces security-aware defaults tailored to what's actually installed.

agentward configure                         # auto-discover and generate
agentward configure ~/clawd/skills/          # generate from OpenClaw skills dir
agentward configure -o my-policy.yaml        # custom output path

OPTION	DESCRIPTION
-o / --output PATH	Output path for generated policy (default: ./agentward.yaml)
--timeout N	MCP server enumeration timeout in seconds

Stage 3a

agentward setup LIVE

Wires AgentWard enforcement into your existing tool infrastructure. Two modes: MCP mode rewrites server commands in agent host config files; Gateway mode swaps the ClawdBot port so AgentWard sits as an HTTP reverse proxy.

agentward setup --policy agentward.yaml        # wrap all discovered MCP configs
agentward setup --config ~/.cursor/mcp.json    # wrap specific config file
agentward setup --gateway clawdbot             # swap ClawdBot gateway port
agentward setup --gateway clawdbot --undo      # restore original port
agentward setup --dry-run --policy agentward.yaml  # preview changes only

OPTION	DESCRIPTION
-c / --config PATH	Target a specific MCP config file
-p / --policy PATH	Policy file to enforce (default: ./agentward.yaml)
-l / --log PATH	Audit log file path
--undo	Restore original configuration
--dry-run	Preview changes without writing files
-g / --gateway NAME	Gateway mode — supports: clawdbot

Stage 3b — Runtime

agentward inspect LIVE

Starts the runtime proxy with live policy enforcement. Stdio mode runs as an MCP proxy between agent host and server subprocess. HTTP mode runs as a reverse proxy between clients and the ClawdBot gateway.

agentward inspect --policy agentward.yaml -- npx server   # stdio proxy
agentward inspect --gateway clawdbot --policy agentward.yaml  # HTTP proxy
agentward inspect -- npx server                              # passthrough (logging only)
agentward inspect --policy agentward.yaml --dry-run -- npx s  # observe without blocking

OPTION	DESCRIPTION
-p / --policy PATH	Policy file to enforce
-l / --log PATH	Audit log file path (JSON Lines format)
-g / --gateway NAME	HTTP gateway mode — supports: clawdbot
--dry-run	Observe-only: log what would be blocked without blocking

Stage 4

agentward comply LIVE

Evaluates current policy against regulatory frameworks and generates compliance delta reports with specific gaps and auto-fix suggestions. With --fix, generates a corrected policy file with all required changes applied.

agentward comply --framework hipaa              # compliance delta report
agentward comply --framework sox --fix          # auto-generate SOX-compliant config
agentward comply --framework gdpr --json        # JSON output for CI
agentward comply --framework pci-dss --policy custom.yaml  # custom policy path

OPTION	DESCRIPTION
-f / --framework NAME	Compliance framework: hipaa (default) · sox · gdpr · pci-dss
--fix	Generate a corrected policy file with all required changes
-p / --policy PATH	Policy file to evaluate (default: ./agentward.yaml)
-o / --output PATH	Output path for fixed policy file
--json	Machine-readable JSON output

HIPAA — Security Rule (8 controls)

Access Control §164.312(a)(1) — PHI skill-level access restrictions
Data Boundary §164.312(a)(1) — PHI containment within designated zones
Audit Controls §164.312(b) — structured audit trail with SSN detection
Integrity §164.312(c)(1) — data modification controls for PHI skills
Authentication §164.312(d) — identity verification gates for PHI write/send/delete
Transmission Security §164.312(e)(1) — outbound network controls for PHI skills
Information Access Management §164.308(a)(4) — least-privilege via chaining isolation
Zero-Trust Default — best practice: default_action: block

SOX — Section 404 (8 controls)

Access Control §404 — financial skill-level access restrictions
Data Boundary §404 — financial data containment within designated zones
Audit Trail §404 — sensitive content scanning with credit card detection
Integrity Controls §404 — write restrictions for financial skills
Authorization Controls §404 — human approval for financial write/send/delete
Network Segregation §404 — outbound network controls for financial skills
Segregation of Duties §404 — financial skill isolation via chaining rules
Zero-Trust Default — best practice: default_action: block

GDPR — Key Articles (8 controls)

Data Minimisation Art. 5(1)(c) — personal data skill-level access restrictions
Data Protection by Design Art. 25 — personal data boundary zones with blocking
Records of Processing Art. 30 — sensitive content scanning for audit trail
Integrity of Processing Art. 32(1) — write restrictions for personal data skills
Confidentiality of Processing Art. 32(1) — human approval for write/send/delete
Transmission Security Art. 32(1) — outbound network controls for personal data
Processor Obligations Art. 28 — personal data skill isolation via chaining rules
Accountability Default Art. 5(2) — best practice: default_action: block

PCI-DSS v4.0 — Key Requirements (8 controls)

Restrict Access Req. 7 — cardholder data skill-level access restrictions
Cardholder Data Environment Req. 7 — CDE boundary zones with blocking
Log and Monitor Req. 10 — sensitive content scanning with credit card + CVV detection
Protect Stored Data Req. 3 — write restrictions for cardholder data skills
Identify and Authenticate Req. 8 — human approval for cardholder write/send/delete
Network Segmentation Req. 1 — outbound network controls for cardholder data
Least Privilege Isolation Req. 7 — cardholder data skill isolation via chaining
Secure Default Req. 6 — best practice: default_action: block

Stage 5 — Verification

agentward probe LIVE

Policy regression testing. Fires a curated library of adversarial tool calls through the live policy engine and reports which attack categories your policy correctly handles. Policies drift over time — agentward probe catches regressions before they reach production.

agentward probe --policy agentward.yaml           # run all 68 built-in probes
agentward probe --category protected_paths        # filter by attack category
agentward probe --severity critical               # filter by severity
agentward probe --probes my_probes.yaml           # add custom probe file
agentward probe --list                            # catalogue all available probes
agentward probe --strict                          # treat GAPs as failures (CI mode)
agentward probe --verbose                         # per-probe detail even when all pass

OPTION	DESCRIPTION
-p / --policy PATH	Policy file to test against (default: safety-floor only)
-c / --category NAME	Filter to one or more attack categories. Repeat for multiple.
-s / --severity NAME	Filter to one or more severity levels. Repeat for multiple.
--probes PATH	Additional probe file (.yaml) or directory. Repeat for multiple paths.
--list	List all available probes and exit (no tests run)
--strict	Treat GAP results as failures — exit code 1 on any uncovered attack surface
--verbose	Print per-probe result table even when all probes pass

Result states

Each probe produces one of four outcomes:

STATE	MEANING
✓ PASS	Policy returned the expected verdict — this attack category is covered
✗ FAIL	Policy has a rule for this tool but returned the wrong verdict — active misconfiguration
△ GAP	No policy rule covers this tool — the engine fell through to the default action — coverage gap
– SKIP	Probe requires a feature not enabled in your policy (e.g. `skill_chaining`)

FAIL and GAP are distinct on purpose. A FAIL means you have a rule that is producing the wrong result — fix the rule. A GAP means you have no rule for that attack surface — decide whether to add one.

Built-in attack categories

CATEGORY	PROBES	WHAT IT TESTS
protected_paths	14	SSH keys, AWS credentials, k8s config, GPG, Docker — non-overridable safety floor. Always PASS.
path_traversal	7	`../` sequences, tilde expansion, null-byte injection toward protected directories
scope_creep	8	Write / delete / send actions beyond declared read-only permissions
privilege_escalation	9	sudo, SUID bits, crontab injection, kernel modules, `LD_PRELOAD`, Python eval
skill_chaining	7	Cross-skill exfiltration: email→web, finance→*, EHR→web, web→shell
pii_injection	6	SSN, credit card, PHI, bank account, API keys in tool arguments
deserialization	7	Pickle, YAML `!!python/object`, Java serial, PHP object injection, XML billion-laughs
boundary_violation	5	PHI / PII / financial data crossing data-boundary zone lines
prompt_injection	5	Classic jailbreaks, role escalation, exfiltration instructions in email templates

Writing custom probes

Create a .yaml file with a top-level probes list. Pass it with --probes. A probe with the same name as a built-in overrides the built-in — so you can adjust defaults for your environment. Point --probes at a directory to load an entire folder at once.

# my_org_probes.yaml probes: # Tool-call probe — tests a specific MCP tool + arguments - name: crm_bulk_export_blocked category: scope_creep severity: critical description: "CRM bulk export must require approval, not run freely" tool_name: crm_export_all arguments: format: csv include_pii: true expected: BLOCK rationale: "Bulk export of CRM data is high-blast-radius and irreversible" # Skill-chaining probe — tests evaluate_chaining() directly - name: crm_to_email_exfiltration category: skill_chaining severity: critical description: "CRM skill must not be able to trigger email sending" chaining_source: crm-manager chaining_target: email-manager expected: BLOCK rationale: "Prevents exfiltrating customer records via email" requires_policy_feature: skill_chaining

Required probe fields: name, category, severity, description, expected, and either tool_name or both chaining_source + chaining_target.

Optional: arguments (dict), rationale (shown on failure), requires_policy_feature (skip probe if feature absent — values: skill_chaining, require_approval, sensitive_content, data_boundaries, llm_judge).

CI integration

agentward probe --policy agentward.yaml                    # exit 0 on pass, 1 on fail
agentward probe --policy agentward.yaml --strict           # exit 1 on fail OR gap
agentward probe --policy agentward.yaml --severity critical # fast: only critical probes

Exit code 0 — all probes passed (or were skipped). Exit code 1 — at least one FAIL (or GAP in --strict mode). Designed for direct integration into CI pipelines.

# .github/workflows/ci.yaml
- name: Policy regression test
  run: agentward probe --policy agentward.yaml --strict --severity critical,high

Configuration

Policy YAML Format

The agentward.yaml file is the single source of truth for all enforcement rules. It's framework-agnostic — the same file is used by both the stdio proxy and the HTTP proxy.

# agentward.yaml — generated by `agentward configure`, edit as needed version: "1.0" # Block everything not explicitly allowed (zero-trust mode) default_action: block skills: email-manager: gmail: read: true send: false # can read, cannot send delete: false draft: true # creates drafts for human review filters: exclude_labels: ["Finance", "Medical"] google_calendar: denied: true # email skill has zero calendar access finance-tracker: gmail: read: true filters: only_from: ["chase.com", "amex.com"] filesystem: read: ["~/Documents/Finance/"] write: false network: outbound: false # financial data never leaves machine # Cross-skill chaining rules skill_chaining: - email-manager cannot trigger web-researcher - finance-tracker cannot trigger any other skill - web-researcher cannot trigger shell-executor # Require human approval before executing require_approval: - send_email - delete_file - outbound_network_with_pii - shell_command_with_sudo # HIPAA data boundary zones data_boundaries: hipaa_zone: skills: [ehr-connector, clinical-notes] classification: phi rules: - phi_data cannot flow outside hipaa_zone on_violation: block_and_notify # Audit log output (both JSONL and RFC 5424 syslog are always written) audit: syslog_path: /var/log/agentward/audit.syslog # default: alongside the JSONL file

Configuration

Audit Logging & Enterprise SIEM

AgentWard writes every audit event in two formats simultaneously — no configuration required to enable dual-mode output.

Dual-mode output

JSON Lines (agentward-audit.jsonl) — structured JSON used by agentward audit and agentward status
RFC 5424 syslog (agentward-audit.syslog) — industry-standard syslog, automatically created alongside the JSONL file with a .syslog extension

Both files are always written when --log is active. There is no toggle — the syslog file is a built-in second output, not an optional feature.

SIEM compatibility

Compatible with Splunk Universal Forwarder, Wazuh, Graylog, ELK / Filebeat, Microsoft Sentinel, Fluentd, rsyslog, and any other tool that reads RFC 5424 syslog. RFC 5424 format compliance is what gives universal compatibility — point any log shipper at the .syslog file and it works without any custom parsing rules.

Each line uses the LOG_USER facility and includes a structured data element [agentward@0 ...] with all relevant audit fields. Example line:

# BLOCK decision — Warning severity (PRI=12)
<12>1 2026-03-20T10:00:00+00:00 host agentward 4521 tool_call [agentward@0 event="tool_call" tool="gmail_send" decision="BLOCK" skill="email-manager" resource="gmail" reason="send action is not permitted"] BLOCK gmail_send: send action is not permitted

Severity mapping

DECISION / EVENT	RFC 5424 SEVERITY
ALLOW, LOG, startup / shutdown	Informational (6)
REDACT, APPROVE, approval dialogs, boundary log	Notice (5)
BLOCK, judge FLAG / BLOCK, sensitive data blocked, boundary block	Warning (4)
BLOCK via skill chain violation	Error (3)
Circuit breaker trip	Alert (1)

Override the syslog path

By default the syslog file lives alongside the JSONL file. Override it with the audit.syslog_path field in your policy YAML:

# agentward.yaml
audit:
  syslog_path: /var/log/agentward/audit.syslog   # absolute or relative path

If audit.syslog_path is not set, the path is derived automatically: --log /path/to/audit.jsonl → syslog at /path/to/audit.syslog.

Troubleshooting

ClawdBot Gateway

Correct restart command for ClawdBot

ClawdBot does NOT support clawdbot stop, clawdbot start, or clawdbot restart as standalone commands. The correct command is:

clawdbot gateway restart

Also avoid clawdbot reset — that cancels the current operation, it does not restart the gateway.

"Address already in use" (OSError Errno 48) on port 18789

Symptom: AgentWard proxy fails to start with OSError: [Errno 48] address already in use on port 18789.

Root cause: ClawdBot is still listening on port 18789 because it wasn't restarted after agentward setup --gateway clawdbot swapped the port.

Fix:

agentward setup --gateway clawdbot      # if not done yet
clawdbot gateway restart                # move ClawdBot to new port
lsof -i :18790                          # verify: should show node
lsof -i :18789                          # verify: should be empty
agentward inspect --gateway clawdbot --policy agentward.yaml

ClawdBot still binds old port after config edit

Symptom: After editing clawdbot.json to change the port, ClawdBot still binds to the old port after restart.

Root cause: ClawdBot uses a macOS LaunchAgent plist at ~/Library/LaunchAgents/com.clawdbot.gateway.plist that hardcodes the port in TWO places — ProgramArguments and EnvironmentVariables. ClawdBot reads from the plist at launch, not from clawdbot.json.

Fix: Run agentward setup --gateway clawdbot — it patches both files automatically. If debugging manually, check both clawdbot.json and the plist.

Proxy starts but forwards to itself (infinite loop)

Root cause: A stale .agentward-gateway.json sidecar file from a previous run.

Fix:

rm ~/.clawdbot/.agentward-gateway.json
agentward setup --gateway clawdbot
clawdbot gateway restart
agentward inspect --gateway clawdbot --policy agentward.yaml

ClawdBot UI blank / connection error through proxy

Root cause (historical): The HTTP proxy was not forwarding WebSocket upgrade requests. ClawdBot's React SPA communicates with the gateway over WebSocket.

Current state: Fixed. The proxy now detects WebSocket upgrade headers (Upgrade: websocket + Connection: upgrade) and proxies WebSocket traffic bidirectionally. No action needed unless you see this after code changes.

Policy enforcement not triggering on tool calls

ClawdBot's tool invocation endpoint is POST /tools-invoke (with a hyphen), NOT /tools/invoke (with a slash). If policy enforcement isn't triggering, verify the proxy is intercepting the correct path.

Troubleshooting

HTTP Proxy

"Error handling request from 127.0.0.1"

Root cause (historical): Two bugs: request.json() consumed the body then request.read() returned empty bytes when forwarding; and web.StreamResponse with copied Content-Length header caused mismatches.

Current state: Fixed. Proxy reads raw body once with request.read(), parses with json.loads(), passes cached raw_body to _forward_request(). Uses web.Response (not StreamResponse).

Troubleshooting

MCP Stdio Proxy

Server subprocess exits immediately

Test the server command standalone first:

npx -y @modelcontextprotocol/server-filesystem /tmp

If it requires specific env vars or paths, pass them through the agentward command. AgentWard inherits the calling environment.

Agent stopped using a tool after I re-enabled it in policy

Symptom: You blocked a tool, then removed the block from agentward.yaml and restarted the proxy. The agent still avoids using the tool.

Root cause: Not AgentWard. When a tool is blocked, AgentWard injects a message like [AgentWard: blocked tool 'browser'] into the LLM response. The LLM sees this in its conversation history and "remembers" the restriction — so even after you re-enable it, the LLM avoids the tool because it thinks it's still blocked.

Fix: Start a new chat session after changing your policy. A fresh conversation has no memory of the previous block. Confirm AgentWard is not blocking by checking proxy output — if you see ALLOW, the tool is allowed through.

CLI Reference

Get started in 30 seconds.

agentward init

HIPAA — Security Rule (8 controls)

SOX — Section 404 (8 controls)

GDPR — Key Articles (8 controls)

PCI-DSS v4.0 — Key Requirements (8 controls)

Result states

Built-in attack categories

Writing custom probes

CI integration

Policy YAML Format

Audit Logging & Enterprise SIEM

Dual-mode output

SIEM compatibility

Severity mapping

Override the syslog path

ClawdBot Gateway

HTTP Proxy

MCP Stdio Proxy