memledger — Open-source trust layer for multi-agent AI

memledger ~ demo

# Install the OSS extra (Postgres + pgvector + local embeddings)

$ pip install memledger[oss]

# Start a local pgvector

$ docker run -d -p 5432:5432 -e POSTGRES_PASSWORD=ml ankane/pgvector

$ python

>>> import asyncio

>>> from memledger import Memledger

>>> ml = await Memledger.create(

... connection_string="postgresql://postgres:ml@localhost:5432/postgres",

... confidence_policy={"min_threshold": 0.5, "flag_threshold": 0.7})

>>> await ml.add(content="api-gateway p99 SLO is 250ms", source="runbook.platform", confidence=0.95)

>>> await ml.add(content="api-gateway p99 SLO is 400ms", source="slack.oncall", confidence=0.6)

>>> hits = await ml.search("api-gateway SLO", top_k=3)

>>> [(h.content, h.metadata["_effective_confidence"], h.confidence_flag) for h in hits]

[('api-gateway p99 SLO is 250ms', 0.95, 'PASS'), ('api-gateway p99 SLO is 400ms', 0.60, 'FLAG: conflict with chain://runbook.platform')]

From pip install to a governed memory in a few lines.

Built for agent trust

Six guarantees you can audit

Multi-agent systems read each other’s beliefs. memledger is what keeps that from quietly failing at scale.

Provenance Chains

Every memory tracks its derivation; chains span agents and sessions.

Learn more →

Weakest-Link Confidence

Effective confidence bounded by min(declared, chain.min). High-conf claims can’t outscore their weakest ancestor.

Learn more →

Conflict Detection

Near-duplicates flagged at write; CONFLICTS edges visible in the trust graph.

Learn more →

Memory Quality Score

Composite 0–100 score with explainable decomposition (confidence, usage, access, reliability).

Learn more →

MAI Rubric

Memory Attribution Integrity runs in RAGAS today. DeepEval, Phoenix Evals, LangSmith, OpenAI Evals on the roadmap.

Learn more →

RTBF Cascades

Right-to-be-forgotten propagates through derivatives — compliance-grade audit trail.

Learn more →

Architecture

One trust layer. Any storage. Any eval framework

memledger sits between your agents and storage, applying provenance, confidence, conflict, and audit guarantees on every operation. Eval frameworks score MAI from above; backends serve from below.

memledger in action

See trust at a glance

The Console UI surfaces every memory’s chain, conflicts, and Memory Quality Score breakdown — live, on the same memory the agent just used.

memledger UI Console — chain, conflicts, and MQS breakdown for a single memory

Use cases

Three problems memledger solves directly

Multi-agent contamination

One agent writes a low-confidence guess. Another reads it and treats it as ground truth. The bad belief propagates silently.

✦memledger gates retrieval on chain-bounded effective confidence. Below threshold, the memory is filtered or flagged before your agent ever sees it.

hits = await ml.search(
  query="connection pool fix",
  namespace="/ops/payment-svc",
  top_k=5,
  confidence_policy={"min_threshold": 0.5},
)

Compliance & RTBF

A user invokes GDPR Art. 17. The memory you wrote is fine to delete — but ten derivatives across three agents reference it.

✦memledger plans the cascade across derivatives, shows you exactly what will be deleted, and executes with a signed audit record.

plan = await ml.plan_cascade(memory_id=incident_id)
result = await ml.execute_cascade(
  plan_id=plan.id,
  reason="GDPR Art. 17",
  requester_id="dpo@org",
)

Continuous trust scoring

You ship an agent. It works in eval. Three weeks later, retrieval drift has degraded answers and you have no signal.

✦memledger’s MAI rubric runs in your existing eval stack. Score Memory Attribution Integrity on every release; alert when it slips.

from evaluators.attribution_integrity_ragas import evaluate_mai_ragas

report = await evaluate_mai_ragas(
  ledger=ml,
  dataset="calibration-30",
)
print(report.summary)

Who memledger is for

Built for the people who get paged

SRE & platform engineers

Diagnose how agent recommendations went wrong — chain, conflicts, and confidence in one trace.

ML platform leads

Gate retrieval on chain-bounded effective confidence. Sane defaults, tunable per workload, no lock-in.

Security & compliance teams

Audit every memory’s provenance and run RTBF cascades through derivatives. Apache 2.0; deploy in your VPC.

Agent framework builders

Drop memledger under LangGraph, OpenAI Agents SDK, or kagent. The trust layer is framework-agnostic.

Researchers & students

A reference implementation of provenance, weakest-link confidence, and MAI you can read end to end.

Anyone shipping multi-agent AI

If two agents share memory, you need attribution. memledger gives you that without rewriting your stack.

Track, attribute, and gate every agent memory

Six guarantees you can audit

Provenance Chains

Weakest-Link Confidence

Conflict Detection

Memory Quality Score

MAI Rubric

RTBF Cascades

One trust layer. Any storage. Any eval framework

See trust at a glance

Three problems memledger solves directly

Multi-agent contamination

Compliance & RTBF

Continuous trust scoring

Built for the people who get paged

SRE & platform engineers

ML platform leads

Security & compliance teams

Agent framework builders

Researchers & students

Anyone shipping multi-agent AI

Get started in 60 seconds