Operate a durable knowledge vault
Capture, distill, and synthesize raw inputs into governed, citable markdown you own — no cloud, no lock-in.
See how it works →A local-first, markdown-native knowledge operating system that turns raw inputs into governed, citable, compounding thinking assets for agents.
# the compounding loop
raw sources → markdown wiki
markdown wiki → schemas
schemas → skills
skills → evals
evals → governed agent operations
# useful answers file back into the wiki
answers ↺ markdown wiki
Capture, distill, and synthesize raw inputs into governed, citable markdown you own — no cloud, no lock-in.
See how it works →Ingest, query, brief, and evaluate over a markdown vault — schema-governed, eval-checked, public/private-bounded.
Read the agent guide →The problem
Strategic knowledge work needs memory that can be inspected, cited, corrected, evaluated, and compounded. Most systems on offer are none of those things:
“Strategic knowledge work needs memory that can be inspected, cited, corrected, evaluated, and compounded.”
What it is
Memex is not a dumping ground, an Obsidian theme, or a vector database demo. It is a harnessed memory operating system: a compounding knowledge system an agent can read, cite, evaluate, govern, and operate.
It applies harness engineering to memory — reliability comes from the scaffolding around the model (context, tools, verification, observability, governance), not the prompt. The model is swappable; the harness is the product.
Ingested text is preserved verbatim as the reproducibility anchor. Nothing rewrites the record.
The canonical layer is atomic, linked markdown — portable plain text you own, with no lock-in.
Every transformation is a named, reviewable operation — not an opaque end-to-end guess.
Frontmatter schemas make briefs, memos, and notes structured and validatable.
Rule-based lint and evaluation checks catch broken links, missing provenance, and drift.
Read, write, share, and promote are explicit actions — bounded by a public/private vault boundary.
How it works
Inputs books, essays, conversations, research notes, market observations, strategic questions.
Output immutable raw sources.
Action the agent extracts claims, concepts, questions, references, decisions, and open loops.
Output structured markdown notes.
Action the wiki is incrementally maintained.
Output durable thinking assets — maps, briefs, memos, decision records.
Action schemas, permissions, validation, and public/private boundaries regulate agent actions.
Output safe, agent-operable memory.
Action useful answers and decisions are filed back into the wiki.
Output memory that improves instead of being re-derived.
Memex skills
A skill is a named operation the agent performs over the vault. Each reads from immutable sources and the markdown wiki, writes a schema-shaped artifact, and leaves a reviewable trail. Skill availability and maturity vary in this preview — see the preview status.
Promotes captured material into immutable raw sources with citation metadata.
Artifact a raw source note with provenance fields.
Why every downstream claim can be traced to where it came from.
Cuts a long source into candidate atomic notes — one claim or concept each.
Artifact proposed atomic markdown notes for review.
Why atomicity makes knowledge re-linkable without cascading rewrites.
Builds a reading order or concept map across a topic from existing notes.
Artifact a structured index linking related notes.
Why surfaces the shape of a domain and the gaps still open in it.
Assembles a position on a topic from the canonical layer, with citations.
Artifact a brief that links back to its source notes.
Why a position developed over months is not re-derived from scratch.
Captures a decision, its rationale, alternatives, and the open loops it leaves.
Artifact a durable decision record in the vault.
Why decisions become inspectable history, not lost context.
Answers a question by retrieving over the wiki and citing the notes used.
Artifact a cited Q&A note, fileable back into memory.
Why answers are auditable and compound instead of evaporating.
Logs a market or domain observation as a dated, linkable note.
Artifact an observation note tied to relevant entities.
Why weak signals accumulate into a durable, searchable record.
Enforces what may be read, written, shared, or promoted across the vault boundary.
Artifact governed promotion and publishing actions.
Why private material is not leaked into shareable output by accident.
Attaches and checks source links and cited-slug references on every artifact.
Artifact provenance fields the agent and human can verify.
Why citable memory is the difference between knowledge and a guess.
Runs rule-based checks for schema conformance, broken links, and missing provenance.
Artifact a validation report; failures block the run.
Why quality is enforced by the system, not by good intentions.
The OpenClaw layer
OpenClaw provides the agent runtime and assistant surface. Memex provides the durable memory and knowledge operating layer: a markdown-native vault an agent can read, cite, evaluate, govern, and improve.
Preview OpenClaw skill packaging is experimental in this release. The integration is described here at the product level; concrete packaging and runtime bindings are being validated and may change before a stable release.
Architecture
Information flows top-to-bottom: raw sources stay immutable, skills read from sources and the vault, schemas define what counts as a valid artifact, evaluators check quality, and governance controls access and promotion. The agent runtime operates over all of it — and answers can be filed back into the vault.
Markdown Vault
Use cases
Turn books and essays into a linked, citable body of knowledge that compounds.
Synthesize observations and sources into positions you can defend and revisit.
Keep decisions, rationale, and open loops in one inspectable, governed place.
Answer hard questions over your own corpus, with citations, not a generic model.
Bring structure to ambiguous strategic questions with evidence and provenance.
A vault that improves over a decade instead of being re-derived each year.
Move from private notes to shareable artifacts through a governed boundary.
Open-source showcase
The Showcase is a community gallery of real-world Memex developments and best cases from around the world. Browse the patterns others have built, then submit yours via a GitHub issue or pull request — no backend, no gatekeeping.
Candid scope
This is a harnessed memory operating system for agents that need durable, inspectable, governed knowledge — with a human as the final reviewer.
Intellectual lineage
This project stands on ideas others developed and shared. It is not affiliated with, endorsed, or sponsored by any of the people or projects below unless otherwise stated — they are inspirations and references.
The foundational pattern: raw sources remain immutable, an LLM incrementally maintains a persistent markdown wiki, and useful answers are filed back into the wiki so knowledge compounds instead of being re-derived from scratch.
Karpathy connects the pattern back to Bush's 1945 vision of the Memex: a private, actively curated knowledge store organized around associative trails between documents — closer to this project than to what the web became. The missing maintenance layer is what the LLM now supplies.
Thanks to Garry Tan for developing and sharing GStack and GBrain, which are major references for the agent-workflow and brain-layer aspects of this project.
Status
This is an early harness preview. It is not production-stable. The release is focused on validating the foundations: