Agent Governance OS

TLA+ formal verification · Zero-trust architecture · OWASP Agentic security

Live: Governing Production Agents

Governing the code agents that build the future.

MAREF actively governs Claude Code, OpenCode, Cursor, and other AI coding agents — intercepting threats, signing every decision, evolving defense strategies in real time.

Architecture

Six layers, governance-first

1

Application

/ 应用层

Multi-agent framework integration surface. Interoperable with any agent framework.

2

Orchestration

/ 编排层

Task decomposition and distributed transaction coordination across agents.

3

Governance

/ 治理层

Governance-first gate. Every agent action passes through before execution.

4

Safety

/ 安全层

Screenshot redaction, input filtering, file sandboxing, clipboard sanitization.

5

Observability

/ 可观测层

Full telemetry, immutable action recording, and real-time monitoring.

6

Infrastructure

/ 基础设施层

Deployment substrate. Sidecar injection, K8s-native, serverless-ready.

Decision Tree

4-Level Governance Decision Tree

Every agent action flows through 4 decision levels — 97% automated, 3% human escalation

40%20%37%3%ENTERAgent Action RequestIdentity & AuthZero-trust identity verificationScope CheckPermission boundary enforcementPolicy Engine97% automated, 3% human escalationHuman EscalationEscalation for high-risk edge casesBLOCK
The Problem

Agent security is broken at every level

Multi-agent systems are exploding in adoption, but existing frameworks treat safety as an afterthought.

91%

of production agents have toolchain vulnerabilities

Stanford/MIT Cooperative AI Research, 2026

327%

MAS adoption growth in under 4 months

Databricks Data + AI Summit, 2026

$4.2M

average cost of an agent security incident

IBM Cost of a Data Breach 2026

Known Incidents

Claude Code jailbreak via prompt injection

Adversarial prompts bypassed safety filters, executing unauthorized file operations.

Gemini 3.5 database deletion

Misaligned tool-use chain caused irreversible production data loss.

AutoGPT sandbox escape

Container escape via unvalidated API calls to internal infrastructure.

Existing frameworks patch symptoms. MAREF rebuilds the foundation.

Defense-in-Depth

8-Layer Defense Architecture

Red attack arrows enter from left, penetrating layer by layer — intercepted at Layer 5 Safety Gate

PROTECTED01Input Sanitization02Tool Call Audit03Permission Check04Sandbox Isolation05Safety Gate06Policy Decision Tree07Threat Detection08Telemetry & Audit
Governance FSM

Gray Code State Machine

Hamming distance = 1 on every transition. Mathematically verifiable governance.

INIT000000BOOTSTRAP000001LOAD000011VERIFY000010ATTEST000110OPERATE000111MONITOR000101EVOLVE000100CHECKPOINT001100HALT100000Current: 000000Current StateTransition PathHamming Distance = 1Absorbing HALT
Comparison

Why MAREF?

7 critical dimensions across 6 agent frameworks

Dimension
MAREF
MAREF v0.30.0
LangGraph
LangGraph
CrewAI
CrewAI
AutoGen
Microsoft AutoGen
OpenAI
OpenAI Agents SDK
Anthropic
Anthropic Tool Use
Manual
Manual Governance
Formal Verification
TLA+/Coq correctness proofs
10 4 3 5 3 3 2
Defense-in-Depth
Multi-layer security architecture
10 5 4 5 4 4 3
Zero-Trust Architecture
No implicit trust between agents
9 3 2 3 4 4 5
Gray Code FSM
Hamming distance = 1 governance
10 1 1 1 1 1 1
Self-Evolution
Lyapunov-proven convergence
10 2 3 4 1 1 1
National Cryptography
SM2/SM3/SM4 compliance
9 1 1 1 2 2 3
Full Observability
OpenTelemetry + immutable audit
9 6 5 6 5 4 2
Convergence

Lyapunov-Proven Convergence

FNR -60% over 200 self-evolution rounds. Mathematically proven.

V(x)=xᵀPx, V̇(x)≤-α‖x‖² Lyapunov function guarantees exponential convergence.
4300+
Tests
82%
Coverage
97%
Auto Decision Rate
100%
Red Line Block Rate