OpenAgentOntology — Map every AI agent action to its governance control (NIST · EU AI Act

A NEW CATEGORY

What is an Agent Ontology?

A typed, signed map of every action an AI agent can take — and the governance control that answers for each one. This is the first one.

THE AGENT STACK HAS THREE LAYERS. THE INDUSTRY SHIPPED TWO.

FRAMEWORKS

LangChain · CrewAI · MCP — makes agents capable. Crowded.

OBSERVABILITY

LangSmith · OpenTelemetry — tells you what they did. Crowded.

AGENT ONTOLOGY

who answers for each action. Empty — until now.

COMPLIANCE-AS-CODE, FOR AGENTS

You know Compliance as Code — OPA, Chef InSpec, HashiCorp Sentinel, Checkov — proving your cloud config meets the rule. This is the same rigor for the thing the cloud now runs: autonomous agents. New surface, same proof.

WHY IT'S REMARKABLE

The org chart for your non-human workforce. The safety data sheet for autonomous software. The building inspection — not the smoke-detector log that fires after the breach.

Frameworks make agents powerful. Observability tells you what they did. The Agent Ontology tells you who answers.

Grounded in a live NICE × MITRE ATT&CK × O*NET control crosswalk, so every governed action also carries the ATT&CK technique your SOC already hunts. The map isn't guessed; it's retrieved.

PROOF LAB -- VERIFY IT YOURSELF

Receipt verifier

This page fetches the signed open-interpreter receipt shipped in this repo at docs/scans/open-interpreter/receipt.json, recomputes the evidence hash in your browser with WebCrypto, and checks the Ed25519 signature. No server round-trip, no trust in this page's claims: the math runs on your machine, from the certificate alone.

Loading receipt...

PROOF LAB -- TRY TO CHEAT

Tamper lab

Each button clones the genuine receipt, applies one forgery, and re-runs the exact verifier above. Every one fails. And if a forger also recomputes evidence_hash to cover the edit, the signed body no longer matches and the Ed25519 leg fails instead -- the only path to a clean receipt is the signing key.

Pick a forgery above. The verifier will catch it.

PROOF LAB -- THE HOSTED LAYER

Graph model preview

The OSS scanner is deterministic and offline. The hosted CWN Graph Model Service resolves each scanned action against a control crosswalk and live NVD-derived threat signals, then a deterministic verified-or-abstain gate turns each resolution into an action: PROCEED, PROCEED-FLAGGED, or ESCALATE to a human when governance is not grounded enough to act on. Below is the verbatim resolution file for the open-interpreter scan, docs/scans/open-interpreter/graph_resolutions.json.

Action	ATT&CK Technique	Recommended Canonical Reasons	Live CVE Signals	Confidence	Gate

PROOF LAB -- THE REMEDIATION, HONESTLY LABELED

Before / after (projected)

MEASURED -- SIGNED SCAN

15/ 100 UNGOVERNED

exec: no mapping -- nothing answers for arbitrary code execution
run_* / send_* / migrate_*: ambiguous verb matches -- 20 actions backed only by a heuristic guess, each needs human confirmation

PROJECTED

NOT RE-SCANNED -- COMPUTED FROM THE PROPOSED REMEDIATION

66/ 100 DEVELOPING

exec: proposed approval_required -- human confirmation required before it counts as governed
9 run_* actions: still open -- ambiguity does not clear by declaration alone

An after-state stays PROJECTED until runtime enforcement is wired and the agent is re-scanned. We do not sign projections; only the measured scan above carries a receipt.

HOW A CONTROL IS RESOLVED

Three layers. One verdict.

Every action drops through three layers in order. The first that matches wins — and the layer it matched on becomes the confidence you can trust. Hardening an agent means pushing every action up to Layer 1.

ASSERTED TABLE

The action declares a reason: that's a canonical deny key. Exact match. Full-confidence, auditable mappings.

HEURISTIC

No declared reason. The label matches a strong verb, so controls are emitted but INFERRED — a guess you must confirm.

NO MATCH

No reason, no verb. The action is UNGOVERNED. Empty mappings. Fix these first.

FROM UNGOVERNED TO GOVERNED — ONE COMMAND

CWN AgentFDE

A human Forward Deployed Engineer scans your agent, finds the ungoverned actions, writes the policy gates, proves the tier moved, and hands off a report. AgentFDE does it autonomously — scan → triage → generate the governance → re-score → notarize → hand off. Deploy it against any agent, workflow, or policy set to make onboarding a single command.

python -m openagentontology.fde your-agent-dir/

sample_agent — before, UNGOVERNED

→

AgentFDE writes governance.agent.yaml + governance.rego + a signed receipt

after, projected — HARDENED, +47

It never runs your code. The generated manifest scans SOVEREIGN 96 on its own — real governance, not a stub. And AgentFDE itself is a governed agent (SOVEREIGN 94): the tool governs the tool that does the governing.

ZML EXPLANATION

What this is, in four lines

Bottom Line

OAO reads your agent's code, policies, and API specs — never executing them — and outputs a signed ontology that maps every action to the governance control that answers for it. One scan. Every framework. One receipt.

So What?

If your agent can wire money, deploy code, or export data, someone is liable when it misbehaves. OAO tells you which controls cover each action, which are only inferred (need human confirmation), and which are completely ungoverned (fix first). Toggle before/after above: declaring a canonical reason on every action moves the same agent from 41 to 93.

What's True

13 node types (Agent, Capability, Tool, Task, Decision, Policy, Gate, Evidence, Outcome, Resource, Domain, Workflow, Actor)
13 edge types (OWNS, DELEGATES_TO, HAS_CAPABILITY, USES, EXECUTES, PART_OF, PRODUCES, MAKES, GOVERNED_BY, SUPPORTED_BY, ENFORCES, GATED_BY, OPERATES_ON)
7 ingestion formats (Rego, OpenAPI, MCP, speckit, agentdef, Python AST, directory)
3-layer crosswalk: ASSERTED (exact, auditable) → HEURISTIC (verb-based, confirm) → UNGOVERNED (no match, fix first)
10 canonical deny reasons, 35+ control mappings across NIST 800-53, EU AI Act, OWASP LLM Top 10, NIST AI RMF, MITRE ATT&CK, OCSF, NICE
4 trust tiers: SOVEREIGN (≥90), HARDENED (≥75), DEVELOPING (≥50), UNGOVERNED (<50)
Ed25519 signed receipt — verifiable from the certificate alone, no network call

What You Need To Do

Run the scan, read the trust profile, and declare a canonical reason on every ungoverned or inferred action.

python -m openagentontology your-agent-dir/

QUESTIONS A SKEPTIC ASKS

FAQ

How is this different from OPA (or any policy engine)?

OPA enforces policy at runtime — it's the gate that returns allow/deny. OpenAgentOntology maps your agent's actions to the controls that should govern them and shows which ones have no gate at all. OPA is the lock; OAO is the audit that walks the building and finds the doors with no lock — then translates each into NIST 800-53, EU AI Act, and OWASP so your auditor, regulator, and board each read it in their own language. They compose: OAO ingests your .rego as input, and an action behind an OPA gate with a canonical deny reason scores ASSERTED. OAO doesn't enforce anything; it tells you where enforcement is missing.

How is this different from tracing / observability (LangSmith, OpenTelemetry)?

Tracing tells you what your agent did on a given run. OAO tells you — statically, from the source, before it runs — what it can do and which control answers for each action. Observability is the flight recorder; OAO is the pre-flight inspection that maps every control surface.

Does it run my agent's code?

No. It parses source as an AST and reads text — it never executes anything, and the core pipeline makes no network calls. Nothing you point it at runs. The receipt is signed locally.

"Inferred" sounds like guessing — why should I trust it?

Because it's labeled as a guess. A verb heuristic (send_* → egress controls) is tagged INFERRED so you confirm it; only an exact, declared canonical reason becomes ASSERTED. The badge counts asserted controls only, and the tool never constructs a framework id it can't source. Honest by construction.

Can I fake a passing score?

No. The receipt is Ed25519-signed over a hash of the exact ontology. Edit one action to inflate the grade and verification fails with evidence_hash mismatch. You cannot change the score without breaking the receipt — that's the difference between a log and a receipt.

Your agent can wire money and deploy code.
Which control answers for it?