Agent Governance OS

TLA+ formal verification · Zero-trust architecture · OWASP Agentic security

Live: Governing Production Agents

Governing the code agents
that build the future.

MAREF actively governs Claude Code, OpenCode, Cursor, and other AI coding agents — intercepting threats, signing every decision, evolving defense strategies in real time.

Architecture

Six layers, governance-first

Application

/ 应用层

Multi-agent framework integration surface. Interoperable with any agent framework.

LangGraph CrewAI AutoGen

Orchestration

/ 编排层

Task decomposition and distributed transaction coordination across agents.

TaskDAG Saga 5D Scheduler

Governance

/ 治理层

Governance-first gate. Every agent action passes through before execution.

Gray Code FSM PolicyDecisionTree CircuitBreaker

Safety

/ 安全层

Screenshot redaction, input filtering, file sandboxing, clipboard sanitization.

8-Layer Defense Threat Detection Auto-Lock

Observability

/ 可观测层

Full telemetry, immutable action recording, and real-time monitoring.

OpenTelemetry Audit Bus Telemetry

Infrastructure

/ 基础设施层

Deployment substrate. Sidecar injection, K8s-native, serverless-ready.

Sidecar Kubernetes Serverless

$ pip install maref

Get Started → ★ Star on GitHub

Decision Tree

4-Level Governance Decision Tree

Every agent action flows through 4 decision levels — 97% automated, 3% human escalation

The Problem

Agent security is broken at every level

Multi-agent systems are exploding in adoption, but existing frameworks treat safety as an afterthought.

91%

of production agents have toolchain vulnerabilities

Stanford/MIT Cooperative AI Research, 2026

327%

MAS adoption growth in under 4 months

Databricks Data + AI Summit, 2026

$4.2M

average cost of an agent security incident

IBM Cost of a Data Breach 2026

Known Incidents

⚠

Claude Code jailbreak via prompt injection

Adversarial prompts bypassed safety filters, executing unauthorized file operations.

⚠

Gemini 3.5 database deletion

Misaligned tool-use chain caused irreversible production data loss.

⚠

AutoGPT sandbox escape

Container escape via unvalidated API calls to internal infrastructure.

Existing frameworks patch symptoms. MAREF rebuilds the foundation.

Defense-in-Depth

8-Layer Defense Architecture

Red attack arrows enter from left, penetrating layer by layer — intercepted at Layer 5 Safety Gate

Core Capabilities

Four foundational technologies that make agent governance mathematically provable and production-ready.

Gray Code FSM

Mathematically provable governance state machine. 10 states, 6-bit encoding, Hamming distance = 1.

8-Layer Defense

Defense-in-depth. 4-tier decision tree. 97% automated safety decisions.

Recursive Evolution

Lyapunov-proven convergence. FNR -60% over 200 rounds of self-evolution.

National Cryptography

SM2/SM3/SM4-GCM. Full GB/T 32918 compliance. AIP Pioneer Program ready.

Governance FSM

Gray Code State Machine

Hamming distance = 1 on every transition. Mathematically verifiable governance.

Comparison

Why MAREF?

7 critical dimensions across 6 agent frameworks

Dimension	MAREF MAREF v0.30.0	LangGraph LangGraph	CrewAI CrewAI	AutoGen Microsoft AutoGen	OpenAI OpenAI Agents SDK	Anthropic Anthropic Tool Use	Manual Manual Governance
Formal Verification TLA+/Coq correctness proofs	10	4	3	5	3	3	2
Defense-in-Depth Multi-layer security architecture	10	5	4	5	4	4	3
Zero-Trust Architecture No implicit trust between agents	9	3	2	3	4	4	5
Gray Code FSM Hamming distance = 1 governance	10	1	1	1	1	1	1
Self-Evolution Lyapunov-proven convergence	10	2	3	4	1	1	1
National Cryptography SM2/SM3/SM4 compliance	9	1	1	1	2	2	3
Full Observability OpenTelemetry + immutable audit	9	6	5	6	5	4	2

Convergence

Lyapunov-Proven Convergence

FNR -60% over 200 self-evolution rounds. Mathematically proven.

V(x)=xᵀPx, V̇(x)≤-α‖x‖² — Lyapunov function guarantees exponential convergence.

4300+

Tests

82%

Coverage

97%

Auto Decision Rate

100%

Red Line Block Rate