Twelve components. All production-engineered. The gateway ships first and delivers the core value. Everything else layers on.
Cloud API Gateway
Core · Ships first
Go/Rust reverse proxy on Cloudflare Workers. OpenAI-compatible API surface. mTLS per enrolled agent. AES-256-GCM + Ed25519 lineage inline. Multi-region auto-failover. Sub-50ms overhead. Health check + real-time telemetry. The entire product in one URL.
Go / RustCloudflare Workersaws-lc-rs FIPSmTLS
DLP Scanning Engine
Core · Ships first
Inline payload scan before AES encryption and before the LLM sees the content. 40+ pattern types: SSN, PHI, credit card PANs, API keys, credentials, source code. ML classifier for secrets. Block | Redact | Warn dispatch in <5ms. Violation logged without storing content.
Regex + MLHIPAA patterns<5ms overhead
Key Management System
Core · Ships first
ECDH P-384 key delivery (Citadel tier). Per-channel key derivation with 38/38 stress tests passed. Rotation cascade: one API call rotates entire fleet. Ed25519 per-agent signing keypair provisioned at enrollment. Shamir 5-of-3 threshold for Cosmos tier.
ECDH P-384Ed25519Shamir
Threat Detection Engine
Core · Ships first
Prompt injection pattern library. Goal hijacking and indirect injection detection. Behavioral fingerprint baseline per agent. Excessive agency scope validator. Data exfiltration anomaly scoring. Agent impersonation via Ed25519 verification. Auto-quarantine on HIGH severity in <30 seconds.
OWASP Top 10Behavioral MLAuto-quarantine
Audit Log Pipeline
Core · Ships first
In-process buffer → async flush → Ed25519 signed NDJSON → S3 / SIEM. Every interaction logged: agent ID, model version, token counts, trust score, DLP result, lineage hash. Immutable hash-chained. Queryable from portal. SIEM export to Splunk, PagerDuty, custom webhook.
NDJSONS3 / SIEMHash-chained
Incident Response Automation
Q2 2026
Auto-quarantine on threat detection. Signed PDF incident report in <30 seconds. PagerDuty + Slack webhooks. 4 default playbooks: injection, exfiltration, DLP, budget exceeded. Admin releases agent via portal — logged action. Mean time to contain target: <60 seconds end-to-end.
PagerDutySlackSigned PDF
Cost Attribution + Budget Controls
Q2 2026
Token spend by team, agent, and project in real time. Configurable budget limits with three enforcement tiers: soft alert at 80% (email + Slack), hard alert at 95% (PagerDuty), hard stop at 100% (gateway returns 429, all calls blocked). Finance and IT both get what they need.
Real-time attributionHard budget stopsPer-agent granularity
Compliance Report Generator
Q2 2026
Queries audit log → auto-generates structured compliance documentation. EU AI Act Article 11 technical documentation. HIPAA §164.312 audit controls. SOC 2 CC6.1 evidence package. GDPR Article 30 record of processing. Export PDF or structured JSON. No manual assembly required.
EU AI ActHIPAASOC 2GDPR
Model Governance + AUP Enforcement
Q2 2026
Model version pinning per agent. Change notifications when providers update underlying weights. Approval workflow before updates go live. Acceptable Use Policy acknowledgment — gateway holds first call until employee acknowledges. Every acknowledgment logged to immutable audit chain.
Version pinningAUP enforcementApproval workflow
DoH Resolver — Layer 2
Q2 2026
DNS-over-HTTPS profile extends gateway coverage to browser sessions and shadow AI. Intercepts AI provider domains (api.anthropic.com, api.openai.com, 30+ more). MDM push for managed devices. Android: Private DNS setting. iOS: MDM profile. Chrome: Group Policy. Zero user action on managed devices.
DoHMDM / Intune / JamfiOS · Android · Chrome
Integration Wrappers
Q3 2026
@membrain/langchain — one-line drop-in around any LangChain LLM. CrewAI agent-to-agent interception. MCP server wrapper for tool call governance. LangGraph, AutoGen, custom agent support. Most frameworks require only the gateway URL change — no wrapper needed at all.
LangChainCrewAIMCPAutoGen
npx membrain verify CLI
Q3 2026
One command confirms the integration is working: encrypted message sent, trust score returned, lineage chain verified, dashboard shows agent live. The “you’re done” moment. Removes all uncertainty from the first-time onboarding experience. Target: under 10 seconds from zero to green.
Node.js CLInpx zero-installDashboard auto-open