[{"localId": "https://terraphim.ai/releases/",
"https://atomicdata.dev/properties/name" : "Releases",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/releases/",
"https://atomicdata.dev/properties/description" : "<h1 id=\"releases\">Releases</h1>\n<p>Stay up-to-date with the latest Terraphim AI releases.</p>\n<h2 id=\"latest-release-v1-16-31\">Latest Release: v1.16.31</h2>\n<p><strong>Released:</strong> 7 April 2026</p>\n<p><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/releases/latest\">Download from GitHub</a> | <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/releases\">GitHub Releases</a></p>\n<h3 id=\"quick-install\">Quick Install</h3>\n<pre><code data-lang=\"bash\">curl -fsSL https://raw.githubusercontent.com/terraphim/terraphim-ai/main/scripts/install.sh | bash\n</code></pre>\n<h3 id=\"available-binaries\">Available Binaries</h3>\n<p>v1.16.31 ships pre-built binaries for 7 platforms:</p>\n<ul>\n<li><strong>macOS</strong>: Apple Silicon (ARM64), Intel (x64), Universal</li>\n<li><strong>Linux</strong>: x86_64 (GNU), x86_64 (MUSL), ARM64 (MUSL), ARMv7 (MUSL)</li>\n<li><strong>Windows</strong>: x64</li>\n<li><strong>Debian packages</strong>: amd64</li>\n</ul>\n<p>Three binaries in each release:</p>\n<ul>\n<li><code>terraphim-agent</code> — Interactive TUI/REPL with knowledge graph search</li>\n<li><code>terraphim-cli</code> — CLI for automation, scripting, and JSON output</li>\n<li><code>terraphim_server</code> — HTTP REST API server</li>\n</ul>\n<h3 id=\"installation\">Installation</h3>\n<p>Choose your preferred method:</p>\n<pre><code data-lang=\"bash\"># Universal installer (recommended)\ncurl -fsSL https://raw.githubusercontent.com/terraphim/terraphim-ai/main/scripts/install.sh | bash\n\n# Homebrew\nbrew tap terraphim/terraphim &amp;&amp; brew install terraphim-ai\n\n# Cargo\ncargo install terraphim-agent\ncargo install terraphim-cli\n</code></pre>\n<p><a href=\"/docs/installation\">Installation Guide</a></p>\n<h2 id=\"all-releases\">All Releases</h2>\n<p>View complete release history on <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/releases\">GitHub Releases</a>.</p>\n<h2 id=\"release-channels\">Release Channels</h2>\n<h3 id=\"stable\">Stable</h3>\n<p>Stable releases are recommended for production use. They have been thoroughly tested and are the most reliable version.</p>\n<p><strong>Latest Stable:</strong> v1.16.31</p>\n<h3 id=\"development\">Development</h3>\n<p>Development releases contain the latest features and improvements but may have more bugs. Use these for testing new features.</p>\n<p>Check the <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/tree/main\">main branch</a> for development builds.</p>\n<h2 id=\"upgrade-guide\">Upgrade Guide</h2>\n<h3 id=\"from-any-version-to-latest\">From Any Version to Latest</h3>\n<pre><code data-lang=\"bash\"># Universal installer (recommended)\ncurl -fsSL https://raw.githubusercontent.com/terraphim/terraphim-ai/main/scripts/install.sh | bash\n\n# Cargo\ncargo install terraphim-agent --force\ncargo install terraphim-cli --force\n</code></pre>\n<h3 id=\"configuration-compatibility\">Configuration Compatibility</h3>\n<p>Terraphim maintains backward compatibility for configuration files across minor versions. Major version bumps (e.g., 1.x to 2.0) may require configuration updates.</p>\n<h2 id=\"verify-your-installation\">Verify Your Installation</h2>\n<p>After installation or upgrade, verify your version:</p>\n<pre><code data-lang=\"bash\">terraphim-agent --version\nterraphim-cli --version\n</code></pre>\n<h2 id=\"beta-testing\">Beta Testing</h2>\n<p>Want to test new features before they're released?</p>\n<p>Join our <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://discord.gg/VPJXB6BGuY\">Discord server</a> and look for #beta-testing channel. Beta testers get early access to new features and help shape product.</p>\n<h2 id=\"security-updates\">Security Updates</h2>\n<p>Security updates are released as soon as they're available. Stay informed by:</p>\n<ul>\n<li>Watching the <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/watchers\">repository</a></li>\n<li>Subscribing to <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/security/advisories\">security advisories</a></li>\n<li>Following <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://twitter.com/alex_mikhalev\">@alex_mikhalev</a> on Twitter</li>\n</ul>\n<h2 id=\"need-help\">Need Help?</h2>\n<p>If you encounter issues with a release:</p>\n<ol>\n<li>Search <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/issues\">existing issues</a></li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/issues/new\">Create a new issue</a></li>\n<li>Join <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://discord.gg/VPJXB6BGuY\">Discord community</a> for support</li>\n<li>Visit <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.discourse.group\">Discourse forum</a> for discussions</li>\n</ol>\n",
"https://terraphim.ai/properties/date" : "2026-04-03",
"https://atomicdata.dev/properties/tags" : {},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/how-tos/command-rewriting-howto/",
"https://atomicdata.dev/properties/name" : "Command Rewriting How-To",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/how-tos/command-rewriting-howto/",
"https://atomicdata.dev/properties/description" : "<h1 id=\"how-to-learning-driven-command-rewriting\">How-To: Learning-Driven Command Rewriting</h1>\n<p>This guide shows how to use terraphim-agent to rewrite shell commands before\nexecution — for example <code>npm install</code> -&gt; <code>bun add</code> or <code>pip install</code> -&gt; <code>uv add</code> — by plugging a knowledge-graph-backed thesaurus into your AI coding\nagent's tool-execution hook.</p>\n<p>The mechanism composes three pieces that already exist in terraphim-agent:</p>\n<ol>\n<li>A Logseq-style knowledge graph of command synonyms under\n<code>~/.config/terraphim/docs/src/kg/</code> (or any role-configured path).</li>\n<li><code>terraphim-agent replace</code> — Aho-Corasick replacement that rewrites text\nusing a role's compiled thesaurus.</li>\n<li>A plugin hook in your AI agent (OpenCode, Claude Code, etc.) that\nintercepts every Bash tool call, pipes the command through <code>replace</code>,\nand writes the result back into the tool's args.</li>\n</ol>\n<h2 id=\"prerequisites\">Prerequisites</h2>\n<ul>\n<li><code>terraphim-agent</code> on <code>PATH</code> (any recent release; 1.16.33 or later).</li>\n<li>A role whose KG directory you control. The default ships with a\n<code>Terraphim Engineer</code> role pointing at <code>~/.config/terraphim/docs/src/kg/</code>.</li>\n<li>An AI agent that exposes a <code>tool.execute.before</code> style plugin API\n(OpenCode has one; Claude Code exposes equivalent hooks via shell scripts).</li>\n</ul>\n<h2 id=\"1-curate-the-knowledge-graph\">1. Curate the knowledge graph</h2>\n<p>Each concept is one markdown file. The filename stem becomes the concept\nkey; the H1 heading provides the display name used as the replacement; the\n<code>synonyms::</code> line lists terms that should be rewritten to it.</p>\n<p>Example <code>~/.config/terraphim/docs/src/kg/bun install.md</code>:</p>\n<pre><code data-lang=\"markdown\"># bun add\n\nInstall dependencies using Bun package manager.\n\nsynonyms:: npm install, yarn install, pnpm install, npm i, yarn add, pnpm add\n</code></pre>\n<p>Conventions that matter in practice:</p>\n<ul>\n<li><strong>Filename uses spaces, not underscores</strong> if the concept has multiple\nwords. The matcher compares against the filename stem (<code>bun install</code>).</li>\n<li><strong>Multi-word synonyms are supported.</strong> <code>python -m pip install</code> is a valid\nsynonym and is matched as a whole phrase; the Aho-Corasick automaton uses\nLeftmostLongest, so the longer phrase wins when a shorter one would also\nmatch.</li>\n<li><strong>Do not overlap synonyms across files.</strong> If both <code>uv.md</code> and <code>uv add.md</code>\nclaim <code>pip install</code>, the behaviour becomes non-deterministic at rebuild\ntime. Keep single-token synonyms in the short file (<code>pip</code> -&gt; <code>uv</code>) and\nmulti-token phrases in the specific file (<code>pip install</code> -&gt; <code>uv add</code>).</li>\n<li><strong>Keep domain vocabulary separate from command vocabulary</strong> if you need\nboth. Create a dedicated role with its own KG path rather than bleeding\ndomain terms into shell commands.</li>\n</ul>\n<h3 id=\"seed-set-shipped-with-the-terraphim-engineer-role\">Seed set shipped with the <code>Terraphim Engineer</code> role</h3>\n<table><thead><tr><th>File</th><th>Maps to</th><th>Covers</th></tr></thead><tbody>\n<tr><td><code>bun.md</code></td><td>bun</td><td>npm, yarn, pnpm</td></tr>\n<tr><td><code>bun install.md</code></td><td>bun add</td><td>npm install, yarn install, pnpm install, npm i, yarn add, pnpm add</td></tr>\n<tr><td><code>bun run.md</code></td><td>bun run</td><td>npm run, yarn run, pnpm run</td></tr>\n<tr><td><code>bunx.md</code></td><td>bunx</td><td>npx, pnpx, yarn dlx</td></tr>\n<tr><td><code>uv.md</code></td><td>uv</td><td>pip, pip3, pipx</td></tr>\n<tr><td><code>uv add.md</code></td><td>uv add</td><td>pip install, pip3 install, pip add, pipx install, python -m pip install</td></tr>\n<tr><td><code>uv sync.md</code></td><td>uv sync</td><td>pip install -r requirements.txt</td></tr>\n</tbody></table>\n<h2 id=\"2-verify-with-the-cli\">2. Verify with the CLI</h2>\n<pre><code data-lang=\"bash\">printf &quot;npm install express&quot; \\\n  | terraphim-agent replace --role &quot;Terraphim Engineer&quot; --fail-open --json\n</code></pre>\n<p>Expected output:</p>\n<pre><code data-lang=\"json\">{&quot;result&quot;:&quot;bun add express&quot;,&quot;original&quot;:&quot;npm install express&quot;,&quot;replacements&quot;:1,&quot;changed&quot;:true}\n</code></pre>\n<p>Flags worth knowing:</p>\n<ul>\n<li><code>--fail-open</code> — on any error, emits the input unchanged. Mandatory in\nhooks so a misconfigured terraphim-agent never wedges the agent.</li>\n<li><code>--json</code> — structured output with <code>result</code>, <code>changed</code>, <code>replacements</code>.\nUse this if the hook needs to branch on whether anything changed.</li>\n<li><code>--format plain|markdown|wiki|html</code> — how the replacement is wrapped.\nHooks want <code>plain</code>.</li>\n</ul>\n<h2 id=\"3-flush-the-cache-after-kg-edits\">3. Flush the cache after KG edits</h2>\n<p>Terraphim caches compiled thesauri in a SQLite database at\n<code>/tmp/terraphim_sqlite/terraphim.db</code> (path configured by\n<code>crates/terraphim_settings/default/settings.toml</code>). Editing a KG markdown\nfile does <strong>not</strong> invalidate this cache; <code>replace</code> keeps returning the old\nmapping until you flush it.</p>\n<pre><code data-lang=\"bash\">sqlite3 /tmp/terraphim_sqlite/terraphim.db \\\n  &quot;DELETE FROM terraphim_kv WHERE key LIKE &#39;thesaurus_%&#39; OR key LIKE &#39;document_ripgrep_%&#39;;&quot;\n</code></pre>\n<p>Because <code>/tmp/</code> is wiped on reboot, a fresh boot always gives the\nup-to-date thesaurus.</p>\n<h2 id=\"4-wire-up-the-hook-opencode-example\">4. Wire up the hook (OpenCode example)</h2>\n<p>OpenCode plugins expose <code>tool.execute.before(input, output)</code> where\n<code>output.args.command</code> is the mutable shell command about to run. The same\npattern works in Claude Code via the <code>PreToolUse</code> hook script, just with\nshell-stdin instead of a JS closure.</p>\n<pre><code data-lang=\"js\">// ~/.config/opencode/plugin/terraphim-hooks.js\nconst REWRITE_MODE = process.env.TERRAPHIM_REWRITE_MODE || &quot;suggest&quot;\nconst REWRITE_ROLE = process.env.TERRAPHIM_REWRITE_ROLE || &quot;Terraphim Engineer&quot;\nconst AUDIT_LOG    = `${process.env.HOME}/Library/Application Support/terraphim/rewrites.log`\n\n// Narrow whitelist of commands whose argument grammar survives a synonym swap.\nconst REWRITEABLE_HEADS =\n  /^\\s*(npm|yarn|pnpm|npx|pnpx|pip|pip3|pipx|python\\s+-m\\s+pip|python3\\s+-m\\s+pip)\\b/i\n\nexport const TerraphimHooks = async ({ $ }) =&gt; ({\n  &quot;tool.execute.before&quot;: async (input, output) =&gt; {\n    if (input.tool !== &quot;Bash&quot; || !output.args?.command) return\n    const command = output.args.command\n\n    const agent = `${process.env.HOME}/.cargo/bin/terraphim-agent`\n\n    // Always run the destructive-command guard first.\n    const g = await $`${agent} guard ${command} --json --fail-open 2&gt;/dev/null || echo &#39;{&quot;decision&quot;:&quot;allow&quot;}&#39;`\n    const guard = JSON.parse(g.stdout)\n    if (guard.decision === &quot;block&quot;) {\n      throw new Error(`BLOCKED: ${guard.reason}`)\n    }\n\n    const isGitCommit   = /git\\s+(-C\\s+\\S+\\s+)?commit/i.test(command)\n    const isRewriteable = REWRITEABLE_HEADS.test(command)\n    if (!isGitCommit &amp;&amp; !isRewriteable) return\n\n    const res     = await $`echo ${command} | ${agent} replace --role ${REWRITE_ROLE} --fail-open --json 2&gt;/dev/null`\n    const parsed  = JSON.parse(res.stdout)\n    const rewrite = (parsed.result || &quot;&quot;).trim()\n    if (!parsed.changed || !rewrite || rewrite === command) return\n\n    const line = [\n      new Date().toISOString(), REWRITE_MODE,\n      isGitCommit ? &quot;git-commit&quot; : &quot;pkg-mgr&quot;,\n      command.replace(/[\\t\\n\\r]/g, &quot; &quot;),\n      rewrite.replace(/[\\t\\n\\r]/g, &quot; &quot;),\n    ].join(&quot;\\t&quot;) + &quot;\\n&quot;\n    await $`mkdir -p &quot;$(dirname ${AUDIT_LOG})&quot; &lt; /dev/null &amp;&amp; printf %s ${line} &gt;&gt; ${AUDIT_LOG}`\n\n    if (REWRITE_MODE === &quot;apply&quot; || isGitCommit) {\n      output.args.command = rewrite\n    }\n  },\n})\n</code></pre>\n<p>Design notes:</p>\n<ul>\n<li><strong>Whitelist, not blacklist.</strong> Arbitrary shell is never rewritten. Only\ncommands whose head matches <code>REWRITEABLE_HEADS</code> are candidates.</li>\n<li><strong>Suggest mode by default.</strong> Set <code>TERRAPHIM_REWRITE_MODE=apply</code> once you\ntrust the diffs. Git commit rewriting always applies because commit\nmessages are prose, not syntax.</li>\n<li><strong>Audit log.</strong> Every rewrite is logged tab-separated to\n<code>~/Library/Application Support/terraphim/rewrites.log</code> so you can diff\nbefore flipping modes.</li>\n<li><strong>Fail-open.</strong> Each external call is wrapped in try/catch with <code>||</code>\nfallbacks. If terraphim-agent is missing, commands pass through unchanged.</li>\n</ul>\n<h2 id=\"5-confirm-end-to-end\">5. Confirm end-to-end</h2>\n<p>With the hook installed and the cache flushed, open your agent, ask it to\nrun <code>npm install express</code>, and inspect the audit log:</p>\n<pre><code data-lang=\"bash\">tail -n 5 ~/Library/Application\\ Support/terraphim/rewrites.log\n</code></pre>\n<p>You should see a line like:</p>\n<pre><code>2026-04-15T11:32:51.129Z    suggest    pkg-mgr    npm install express    bun add express\n</code></pre>\n<p>In <code>suggest</code> mode the command still executes as <code>npm install express</code>; in\n<code>apply</code> mode the agent actually runs <code>bun add express</code>.</p>\n<h2 id=\"6-capturing-user-corrections-preview\">6. Capturing user corrections (preview)</h2>\n<p><code>terraphim-agent learn hook --format &lt;claude|codex|opencode&gt;</code> has three\nmodes driven by <code>--learn-hook-type</code>:</p>\n<ul>\n<li><code>post-tool-use</code> — the default, captures failed Bash commands as\nlearnings. This is already wired into the OpenCode plugin's\n<code>tool.execute.after</code> callback.</li>\n<li><code>pre-tool-use</code> — checks if the command matches a past failure pattern and\nstashes the hint to <code>~/.local/share/terraphim/session-hints.txt</code> for LLM\nconsumption. Does not block and does not print to the user terminal.</li>\n<li><code>user-prompt-submit</code> — scans the user's prompt for patterns like \"use X\ninstead of Y\" or \"prefer X over Y\" and records a <code>ToolPreference</code>\ncorrection under\n<code>~/Library/Application Support/terraphim/learnings/correction-*.md</code>.</li>\n</ul>\n<p>At present these corrections are stored but <strong>not yet fed back</strong> into the\nreplacement thesaurus. Closing that loop is tracked as future work — see\nthe accompanying GitHub issue \"Learning-driven command correction: Phase 2\n&amp; 3\".</p>\n<h2 id=\"troubleshooting\">Troubleshooting</h2>\n<p><strong><code>replace</code> returns the original unchanged.</strong>\nRun <code>terraphim-agent search \"&lt;synonym&gt;\" --role \"&lt;role&gt;\"</code> — if the concept\nappears, the KG is loaded but the synonym is not. Confirm the synonym is on\nthe <code>synonyms::</code> line (case-insensitive; commas separate entries). Flush\nthe cache (section 3) and retry.</p>\n<p><strong><code>Failed to load thesaurus: NotFound(\"thesaurus_...\")</code> in stderr.</strong>\nCosmetic. The agent looked for a pre-compiled JSON thesaurus first, didn't\nfind one, and fell back to building from markdown. Expected on first run.</p>\n<p><strong>Hook does nothing in OpenCode.</strong>\nCheck the plugin loaded: <code>grep terraphim-hooks ~/.local/share/opencode/log/$(ls -t ~/.local/share/opencode/log/ | head -1)</code>.\nYou should see a line like <code>service=plugin path=...terraphim-hooks.js loading plugin</code>. If absent, the plugin file is in the wrong directory —\nOpenCode autoloads from <code>~/.config/opencode/plugin/</code> and\n<code>~/.config/opencode/plugins/</code>.</p>\n<p><strong>Commands get double-rewritten on retry.</strong>\nThe hook only touches <code>tool.execute.before</code>; the agent does not loop back\nthrough the hook on its own retries. If you see double rewrites, check\nwhether <code>input.tool === \"Bash\"</code> is spelt exactly — OpenCode passes\n<code>\"Bash\"</code>, not <code>\"bash\"</code>.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-15",
"https://atomicdata.dev/properties/tags" : {},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
}{"localId": "https://terraphim.ai/docs/crates/",
"https://atomicdata.dev/properties/name" : "Crate Reference",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/docs/crates/",
"https://atomicdata.dev/properties/description" : "<h1 id=\"crate-reference\">Crate Reference</h1>\n<p>Terraphim AI is a modular Rust workspace comprising 54 crates. Each crate has a single responsibility and can be used independently or composed into larger systems. All crates are available in the <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">terraphim-ai</a> monorepo.</p>\n<h2 id=\"core-engine\">Core Engine</h2>\n<p>The foundational crates that power Terraphim's deterministic knowledge graph search.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terraphim_automata</strong></td><td>Aho-Corasick automata for searching and processing knowledge graphs. The core matching engine.</td></tr>\n<tr><td><strong>terraphim_rolegraph</strong></td><td>Role-based knowledge graph module. Maps search roles to domain-specific graph views.</td></tr>\n<tr><td><strong>terraphim_types</strong></td><td>Core types crate shared across the entire workspace.</td></tr>\n<tr><td><strong>terraphim_config</strong></td><td>Configuration loading and management for all Terraphim components.</td></tr>\n<tr><td><strong>terraphim_settings</strong></td><td>Settings handling library for runtime preferences and defaults.</td></tr>\n<tr><td><strong>terraphim_service</strong></td><td>Service layer handling user requests and responses for the Terraphim core.</td></tr>\n<tr><td><strong>terraphim_middleware</strong></td><td>Middleware for searching haystacks (pluggable data source backends).</td></tr>\n<tr><td><strong>terraphim-markdown-parser</strong></td><td>Markdown parser for extracting structured content from knowledge base files.</td></tr>\n<tr><td><strong>terraphim_persistence</strong></td><td>Persistence layer with Persistable trait and DeviceStorage backends (memory, SQLite, redb).</td></tr>\n<tr><td><strong>terraphim_build_args</strong></td><td>Build argument management for compile-time feature configuration.</td></tr>\n<tr><td><strong>terraphim_test_utils</strong></td><td>Shared test utilities and fixtures for all Terraphim crates.</td></tr>\n</tbody></table>\n<h2 id=\"binaries-and-clis\">Binaries and CLIs</h2>\n<p>User-facing executables and command-line tools.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terraphim_agent</strong></td><td>Terraphim AI Agent CLI with interactive REPL, session search, learning capture, and ASCII graph visualisation.</td></tr>\n<tr><td><strong>terraphim-cli</strong></td><td>CLI tool for semantic knowledge graph search with JSON output for automation and scripting.</td></tr>\n<tr><td><strong>terraphim_server</strong></td><td>HTTP server handling the core logic of Terraphim AI. Provides REST API and knowledge graph backend.</td></tr>\n<tr><td><strong>terraphim_update</strong></td><td>Shared auto-update functionality for all Terraphim AI binaries.</td></tr>\n<tr><td><strong>terraphim_validation</strong></td><td>Release validation system ensuring binary and asset integrity before publishing.</td></tr>\n</tbody></table>\n<h2 id=\"agent-orchestration-ai-dark-factory\">Agent Orchestration (AI Dark Factory)</h2>\n<p>OTP-inspired agent management system for running autonomous AI coding agents.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terraphim_orchestrator</strong></td><td>AI Dark Factory orchestrator wiring spawner, router, and supervisor into a reconciliation loop.</td></tr>\n<tr><td><strong>terraphim_spawner</strong></td><td>Agent spawner with health checking, output capture, and lifecycle management.</td></tr>\n<tr><td><strong>terraphim_router</strong></td><td>Unified routing engine for LLM and agent providers (keyword routing, tier selection).</td></tr>\n<tr><td><strong>terraphim_agent_supervisor</strong></td><td>OTP-inspired supervision trees for fault-tolerant AI agent management.</td></tr>\n<tr><td><strong>terraphim_agent_application</strong></td><td>OTP-style application behaviour for the Terraphim agent system.</td></tr>\n<tr><td><strong>terraphim_agent_messaging</strong></td><td>Erlang-style asynchronous message passing system for AI agents.</td></tr>\n<tr><td><strong>terraphim_agent_registry</strong></td><td>Knowledge graph-based agent registry for intelligent agent discovery and capability matching.</td></tr>\n<tr><td><strong>terraphim_agent_evolution</strong></td><td>Agent evolution and self-improvement tracking.</td></tr>\n<tr><td><strong>terraphim_workspace</strong></td><td>Workspace management for agent execution including lifecycle, hooks, and isolation.</td></tr>\n<tr><td><strong>terraphim_multi_agent</strong></td><td>Multi-agent system built on roles with rust-genai integration.</td></tr>\n</tbody></table>\n<h2 id=\"knowledge-graph-intelligence\">Knowledge Graph Intelligence</h2>\n<p>Advanced crates for KG-powered reasoning, task planning, and goal management.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terraphim_kg_orchestration</strong></td><td>Knowledge graph-based agent orchestration engine for coordinating multi-agent workflows.</td></tr>\n<tr><td><strong>terraphim_kg_agents</strong></td><td>Specialised knowledge graph-based agent implementations.</td></tr>\n<tr><td><strong>terraphim_kg_linter</strong></td><td>Linter for markdown-based Terraphim KG schemas (commands, types, permissions).</td></tr>\n<tr><td><strong>terraphim_goal_alignment</strong></td><td>Knowledge graph-based goal alignment system for multi-level goal management and conflict resolution.</td></tr>\n<tr><td><strong>terraphim_task_decomposition</strong></td><td>Knowledge graph-based task decomposition for intelligent task analysis and execution planning.</td></tr>\n<tr><td><strong>terraphim_rlm</strong></td><td>Recursive Language Model (RLM) orchestration for structured reasoning chains.</td></tr>\n<tr><td><strong>terraphim_hooks</strong></td><td>Unified hooks infrastructure for knowledge graph-based text replacement and validation.</td></tr>\n<tr><td><strong>terraphim_file_search</strong></td><td>Knowledge-graph scored file search integration.</td></tr>\n<tr><td><strong>terraphim_codebase_eval</strong></td><td>Codebase evaluation system with manifest types and metrics aggregation.</td></tr>\n<tr><td><strong>terraphim_negative_contribution</strong></td><td>Negative contribution analysis for identifying anti-patterns and risks.</td></tr>\n</tbody></table>\n<h2 id=\"haystack-integrations\">Haystack Integrations</h2>\n<p>Pluggable data source connectors for searching external systems.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>haystack_core</strong></td><td>Core traits and types for all Terraphim haystack integrations.</td></tr>\n<tr><td><strong>haystack_atlassian</strong></td><td>Atlassian (Confluence, Jira) integration for searching enterprise knowledge bases.</td></tr>\n<tr><td><strong>haystack_discourse</strong></td><td>Discourse forum integration for fetching posts and messages.</td></tr>\n<tr><td><strong>haystack_grepapp</strong></td><td>Grep.app integration for searching code across GitHub repositories.</td></tr>\n<tr><td><strong>haystack_jmap</strong></td><td>JMAP email protocol integration for searching email (Fastmail, etc.).</td></tr>\n</tbody></table>\n<h2 id=\"session-and-usage-analytics\">Session and Usage Analytics</h2>\n<p>Tools for analysing AI coding assistant sessions and tracking usage.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terraphim_sessions</strong></td><td>Session management for AI coding assistant history. Search across Claude Code, Cursor, and Aider sessions.</td></tr>\n<tr><td><strong>terraphim-session-analyzer</strong></td><td>Analyse AI coding assistant session logs to identify agent usage patterns.</td></tr>\n<tr><td><strong>terraphim_ccusage</strong></td><td>Claude Code usage tracking and cost analysis.</td></tr>\n<tr><td><strong>terraphim_usage</strong></td><td>General usage telemetry and analytics.</td></tr>\n</tbody></table>\n<h2 id=\"devops-and-infrastructure\">DevOps and Infrastructure</h2>\n<p>Deployment, CI/CD, and infrastructure management.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terraphim_symphony</strong></td><td>Symphony orchestration service. Reads issues from trackers and dispatches coding agent sessions.</td></tr>\n<tr><td><strong>terraphim_tracker</strong></td><td>Issue tracker abstraction for Gitea and Linear with PageRank-based prioritisation.</td></tr>\n<tr><td><strong>terraphim_github_runner</strong></td><td>GitHub Actions runner with Firecracker sandbox integration.</td></tr>\n<tr><td><strong>terraphim_github_runner_server</strong></td><td>HTTP server for the GitHub Actions runner service.</td></tr>\n<tr><td><strong>terraphim-firecracker</strong></td><td>Sub-2-second VM boot optimisation system for sandboxed agent execution.</td></tr>\n<tr><td><strong>terraphim_mcp_server</strong></td><td>Model Context Protocol (MCP) server exposing Terraphim tools to AI assistants.</td></tr>\n<tr><td><strong>terraphim_onepassword_cli</strong></td><td>1Password CLI integration for secret management.</td></tr>\n<tr><td><strong>terraphim_atomic_client</strong></td><td>Atomic Data Server client for managing stores and agents.</td></tr>\n</tbody></table>\n<h2 id=\"chat-and-assistants\">Chat and Assistants</h2>\n<p>Multi-channel AI assistant interfaces.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terraphim_tinyclaw</strong></td><td>Multi-channel AI assistant for Telegram, Discord, and CLI.</td></tr>\n</tbody></table>\n<h2 id=\"language-bindings\">Language Bindings</h2>\n<p>Cross-language bindings for using Terraphim from Python, Node.js, and WebAssembly.</p>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terraphim_automata_py</strong></td><td>Python (PyO3) bindings for terraphim_automata. Fast autocomplete and text processing for knowledge graphs.</td></tr>\n<tr><td><strong>terraphim_rolegraph_py</strong></td><td>Python bindings for terraphim_rolegraph. Knowledge graph operations for AI agents.</td></tr>\n<tr><td><strong>terraphim-automata-node-rs</strong></td><td>Node.js (NAPI) bindings for Terraphim's Aho-Corasick matcher.</td></tr>\n<tr><td><strong>terraphim-automata-wasm</strong></td><td>WebAssembly bindings for terraphim_automata. Runs in the browser.</td></tr>\n</tbody></table>\n<h2 id=\"browser-extensions\">Browser Extensions</h2>\n<table><thead><tr><th>Crate</th><th>Description</th></tr></thead><tbody>\n<tr><td><strong>terrraphim-automata-wasm</strong> (extension)</td><td>WASM core for the Terraphim browser extensions (Sidebar and Autocomplete).</td></tr>\n</tbody></table>\n<hr />\n<h2 id=\"quick-install\">Quick Install</h2>\n<pre><code data-lang=\"bash\"># Install the agent (interactive REPL + session search)\ncargo install terraphim-agent\n\n# Install the CLI (JSON output for automation)\ncargo install terraphim-cli\n</code></pre>\n<p>Or use the universal installer:</p>\n<pre><code data-lang=\"bash\">curl -fsSL https://raw.githubusercontent.com/terraphim/terraphim-ai/main/scripts/install.sh | bash\n</code></pre>\n<h2 id=\"architecture\">Architecture</h2>\n<p>The crate dependency graph follows a layered architecture:</p>\n<ol>\n<li><strong>Types and Config</strong> (bottom): <code>terraphim_types</code>, <code>terraphim_config</code>, <code>terraphim_settings</code></li>\n<li><strong>Core Engine</strong>: <code>terraphim_automata</code>, <code>terraphim_rolegraph</code>, <code>terraphim_persistence</code></li>\n<li><strong>Service Layer</strong>: <code>terraphim_service</code>, <code>terraphim_middleware</code>, haystack integrations</li>\n<li><strong>Agent System</strong>: <code>terraphim_spawner</code>, <code>terraphim_router</code>, <code>terraphim_agent_supervisor</code></li>\n<li><strong>Orchestration</strong>: <code>terraphim_orchestrator</code>, <code>terraphim_kg_orchestration</code>, <code>terraphim_symphony</code></li>\n<li><strong>User Interfaces</strong> (top): <code>terraphim_agent</code>, <code>terraphim-cli</code>, <code>terraphim_server</code>, <code>terraphim_tinyclaw</code></li>\n</ol>\n<h2 id=\"contributing\">Contributing</h2>\n<p>Each crate has its own <code>README.md</code> with specific build instructions and examples. See the <a href=\"/docs/contribution\">Contribution Guide</a> for the overall workflow.</p>\n<p>Source: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">github.com/terraphim/terraphim-ai</a></p>\n",
"https://terraphim.ai/properties/date" : "2026-04-12",
"https://atomicdata.dev/properties/tags" : {},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/docs/graph-embeddings/",
"https://atomicdata.dev/properties/name" : "Graph Embeddings",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/docs/graph-embeddings/",
"https://atomicdata.dev/properties/description" : "<h1 id=\"graph-embeddings\">Graph Embeddings</h1>\n<p>Terraphim uses a fundamentally different approach to semantic search compared to traditional vector embeddings. Instead of dense numerical vectors, Terraphim leverages <strong>graph structure embeddings</strong> where ranking is determined by the number of synonyms and related concepts connected to a query term in the knowledge graph.</p>\n<h2 id=\"what-are-graph-embeddings-in-terraphim\">What are Graph Embeddings in Terraphim?</h2>\n<p>Unlike vector embeddings that represent concepts as points in a high-dimensional semantic space, Terraphim represents concepts as nodes in a <strong>knowledge graph</strong>. Each node is a normalized term, and edges represent co-occurrence relationships between terms found in documents.</p>\n<p>The key insight is that <strong>rank is defined by the number of synonyms connected to a concept</strong>. When you search for a term, Terraphim expands your query to include all synonyms and related concepts from the knowledge graph, then traverses the graph to find documents that mention these connected concepts.</p>\n<h3 id=\"graph-structure\">Graph Structure</h3>\n<pre><code>    &quot;raft&quot; ----(edge)---- &quot;consensus&quot;\n       |                    |\n    (edge)               (edge)\n       |                    |\n    &quot;leader&quot; ----(edge)---- &quot;election&quot;\n</code></pre>\n<p>When you search for \"consensus algorithms\", the graph traverses from the matched node to connected nodes, finding documents that mention related concepts like \"raft\", \"leader election\", and so on.</p>\n<h2 id=\"how-it-works\">How It Works</h2>\n<p>The Terraphim Graph (scorer) uses unique graph embeddings with the following ranking algorithm:</p>\n<pre><code>total_rank = node.rank + edge.rank + document_rank\n</code></pre>\n<p>Where:</p>\n<ul>\n<li><strong>node.rank</strong>: Number of connections to other concepts in the graph</li>\n<li><strong>edge.rank</strong>: Number of documents containing both connected concepts</li>\n<li><strong>document.rank</strong>: Base ranking score of the document</li>\n</ul>\n<h3 id=\"query-expansion\">Query Expansion</h3>\n<p>When you search, Terraphim:</p>\n<ol>\n<li>Matches your query terms against the thesaurus (normalized terms and synonyms)</li>\n<li>Expands the query to include all connected synonyms and related concepts</li>\n<li>Traverses the graph to find documents containing these terms</li>\n<li>Ranks documents by aggregating scores from multiple graph paths</li>\n<li>Returns results with explainable match reasons</li>\n</ol>\n<h2 id=\"technical-details\">Technical Details</h2>\n<h3 id=\"core-implementation\">Core Implementation</h3>\n<p>The graph embedding system is implemented in <code>crates/terraphim_rolegraph/src/lib.rs</code>:</p>\n<ul>\n<li><strong>RoleGraph</strong>: The core data structure representing concepts and their relationships</li>\n<li><strong>TriggerIndex</strong>: TF-IDF fallback for semantic search when exact matches aren't found</li>\n<li><strong>Node/Edge</strong>: Graph primitives representing concepts and their connections</li>\n</ul>\n<h3 id=\"symbolic-embeddings\">Symbolic Embeddings</h3>\n<p>For domain-specific embeddings (e.g., medical), the <code>SymbolicEmbeddingIndex</code> in <code>crates/terraphim_rolegraph/src/medical.rs</code> builds embeddings from IS-A hierarchies, allowing for hierarchical concept relationships.</p>\n<h3 id=\"configuration\">Configuration</h3>\n<p>The system is configured via <code>config/atomic_graph_embeddings_config.json</code>:</p>\n<pre><code data-lang=\"json\">{\n  &quot;roles&quot;: {\n    &quot;Atomic Graph Embeddings&quot;: {\n      &quot;relevance_function&quot;: &quot;terraphim-graph&quot;\n    }\n  }\n}\n</code></pre>\n<p>The <code>terraphim-graph</code> relevance function enables graph-based ranking.</p>\n<h2 id=\"use-cases\">Use Cases</h2>\n<h3 id=\"semantic-search-with-relationship-awareness\">Semantic Search with Relationship Awareness</h3>\n<p>Graph embeddings excel when content relationships matter more than simple keyword matching:</p>\n<ul>\n<li><strong>Finding related concepts automatically</strong>: Search for \"distributed systems\" also returns results about \"consensus\", \"raft\", \"CAP theorem\"</li>\n<li><strong>Role-based search</strong>: Domain-specific knowledge graphs for engineers, medical professionals, etc.</li>\n<li><strong>Explainable results</strong>: Each result shows which graph paths led to the match</li>\n</ul>\n<h3 id=\"example-learning-assistant\">Example: Learning Assistant</h3>\n<p>With a learning knowledge graph:</p>\n<ul>\n<li>Search \"active recall\" returns notes about \"spaced repetition\", \"flashcards\", \"memory\"</li>\n<li>Search \"consensus algorithms\" returns notes about \"raft\", \"paxos\", \"leader election\"</li>\n<li>Results are ranked by graph connectivity, not just keyword density</li>\n</ul>\n<h2 id=\"comparison-with-vector-embeddings\">Comparison with Vector Embeddings</h2>\n<h3 id=\"why-graph-embeddings\">Why Graph Embeddings?</h3>\n<table><thead><tr><th>Feature</th><th>Vector Embeddings</th><th>Graph Embeddings</th></tr></thead><tbody>\n<tr><td>Representation</td><td>Dense vectors</td><td>Graph structure</td></tr>\n<tr><td>Explainability</td><td>Black box</td><td>Full traceability</td></tr>\n<tr><td>Queries expand</td><td>Implicit via distance</td><td>Explicit via synonyms</td></tr>\n<tr><td>Relationship capture</td><td>Learns patterns</td><td>Encodes relationships</td></tr>\n<tr><td>Domain adaptation</td><td>Requires retraining</td><td>Add to thesaurus</td></tr>\n</tbody></table>\n<h3 id=\"the-transparency-advantage\">The Transparency Advantage</h3>\n<p>Unlike vector embeddings where you don't know WHY a document matched, Terraphim's graph embeddings show:</p>\n<ol>\n<li><strong>Which terms matched</strong>: The exact thesaurus entries that triggered</li>\n<li><strong>Graph paths</strong>: The path from query term through the graph to the document</li>\n<li><strong>Ranking breakdown</strong>: How node.rank + edge.rank + document_rank was computed</li>\n</ol>\n<p>This makes results fully auditable and debuggable.</p>\n<h2 id=\"example-usage\">Example Usage</h2>\n<h3 id=\"cli-search\">CLI Search</h3>\n<pre><code data-lang=\"bash\"># Search with the Engineer role\nterraphim-agent search &quot;graph embeddings&quot; --role engineer\n</code></pre>\n<p>This returns results for:</p>\n<ul>\n<li>\"graph embeddings\" (exact match)</li>\n<li>\"terraphim-graph\" (synonym)</li>\n<li>\"knowledge graph based embeddings\" (related concept)</li>\n<li>\"symbolic embeddings\" (connected concept)</li>\n</ul>\n<h3 id=\"programmatic-usage\">Programmatic Usage</h3>\n<pre><code data-lang=\"rust\">use terraphim_rolegraph::RoleGraph;\nuse terraphim_types::{Thesaurus, RoleName};\n\n// Create rolegraph with domain knowledge\nlet thesaurus = build_domain_thesaurus();\nlet role_name = RoleName::new(&quot;Engineer&quot;);\nlet mut graph = RoleGraph::new(role_name, thesaurus).await?;\n\n// Index documents\nfor doc in documents {\n    graph.insert_document(&amp;doc.id, doc);\n}\n\n// Query - automatically expands to synonyms\nlet results = graph.query_graph(&quot;distributed systems&quot;, None, Some(10))?;\n</code></pre>\n<h3 id=\"configuration-1\">Configuration</h3>\n<p>Enable graph embeddings in your config:</p>\n<pre><code data-lang=\"json\">{\n  &quot;roles&quot;: {\n    &quot;Your Role&quot;: {\n      &quot;relevance_function&quot;: &quot;terraphim-graph&quot;,\n      &quot;kg&quot;: {\n        &quot;knowledge_graph_local&quot;: {\n          &quot;path&quot;: &quot;./docs/your-domain&quot;\n        }\n      }\n    }\n  }\n}\n</code></pre>\n<h2 id=\"next-steps\">Next Steps</h2>\n<ul>\n<li><a href=\"/docs/quickstart\">Quickstart Guide</a> - Get started with Terraphim</li>\n<li><a href=\"/docs/terraphim_config\">Configuration Guide</a> - Configure roles and knowledge graphs</li>\n<li><a href=\"/docs/installation\">Installation Guide</a> - Install Terraphim on your system</li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-04",
"https://atomicdata.dev/properties/tags" : {},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/docs/installation/",
"https://atomicdata.dev/properties/name" : "Installation",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/docs/installation/",
"https://atomicdata.dev/properties/description" : "<h1 id=\"installation\">Installation</h1>\n<p>Choose the installation method that best suits your needs and platform.</p>\n<h2 id=\"quick-install-recommended\">Quick Install (Recommended)</h2>\n<p>The universal installer automatically detects your platform and installs the appropriate version.</p>\n<pre><code data-lang=\"bash\">curl -fsSL https://raw.githubusercontent.com/terraphim/terraphim-ai/main/scripts/install.sh | bash\n</code></pre>\n<h2 id=\"package-managers\">Package Managers</h2>\n<h3 id=\"homebrew-macos-linux\">Homebrew (macOS/Linux)</h3>\n<pre><code data-lang=\"bash\">brew tap terraphim/terraphim &amp;&amp; brew install terraphim-ai\n</code></pre>\n<p>This installs <code>terraphim-agent</code>, <code>terraphim-cli</code>, and <code>terraphim_server</code>.</p>\n<h3 id=\"cargo-rust\">Cargo (Rust)</h3>\n<p>Install using Cargo, Rust's package manager.</p>\n<pre><code data-lang=\"bash\"># Install agent with interactive TUI\ncargo install terraphim-agent\n\n# Install CLI for automation\ncargo install terraphim-cli\n</code></pre>\n<h3 id=\"debian-ubuntu\">Debian/Ubuntu</h3>\n<p>Download the <code>.deb</code> package from the latest release:</p>\n<pre><code data-lang=\"bash\">curl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent_1.16.32-1_amd64.deb\nsudo dpkg -i terraphim-agent_1.16.32-1_amd64.deb\n</code></pre>\n<h2 id=\"platform-specific-guides\">Platform-Specific Guides</h2>\n<h3 id=\"linux\">Linux</h3>\n<h4 id=\"binary-download\">Binary Download</h4>\n<p>Download the latest release from GitHub:</p>\n<pre><code data-lang=\"bash\"># x86_64 (GNU)\ncurl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent-1.16.32-x86_64-unknown-linux-gnu.tar.gz\ntar -xzf terraphim-agent-1.16.32-x86_64-unknown-linux-gnu.tar.gz\nsudo mv terraphim-agent terraphim-cli terraphim_server /usr/local/bin/\n\n# x86_64 (MUSL / static)\ncurl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent-1.16.32-x86_64-unknown-linux-musl.tar.gz\n\n# ARM64 (MUSL)\ncurl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent-1.16.32-aarch64-unknown-linux-musl.tar.gz\n\n# ARMv7 (MUSL)\ncurl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent-1.16.32-armv7-unknown-linux-musleabihf.tar.gz\n</code></pre>\n<h4 id=\"build-from-source\">Build from Source</h4>\n<pre><code data-lang=\"bash\"># Clone the repository\ngit clone https://github.com/terraphim/terraphim-ai.git\ncd terraphim-ai\n\n# Build all binaries\ncargo build --release\n\n# Install\nsudo cp target/release/terraphim_server /usr/local/bin/\nsudo cp target/release/terraphim-agent /usr/local/bin/\nsudo cp target/release/terraphim-cli /usr/local/bin/\n</code></pre>\n<h3 id=\"macos\">macOS</h3>\n<h4 id=\"binary-download-1\">Binary Download</h4>\n<pre><code data-lang=\"bash\"># Apple Silicon (ARM64)\ncurl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent-1.16.32-aarch64-apple-darwin.tar.gz\ntar -xzf terraphim-agent-1.16.32-aarch64-apple-darwin.tar.gz\nsudo mv terraphim-agent terraphim-cli terraphim_server /usr/local/bin/\n\n# Intel (x86_64)\ncurl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent-1.16.32-x86_64-apple-darwin.tar.gz\n\n# Universal (Fat binary)\ncurl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent-1.16.32-universal-apple-darwin.tar.gz\n</code></pre>\n<h4 id=\"build-from-source-1\">Build from Source</h4>\n<p>Requires Xcode command line tools.</p>\n<pre><code data-lang=\"bash\">git clone https://github.com/terraphim/terraphim-ai.git\ncd terraphim-ai\ncargo build --release\nsudo cp target/release/terraphim_server /usr/local/bin/\nsudo cp target/release/terraphim-agent /usr/local/bin/\nsudo cp target/release/terraphim-cli /usr/local/bin/\n</code></pre>\n<h3 id=\"windows\">Windows</h3>\n<h4 id=\"binary-download-2\">Binary Download</h4>\n<pre><code data-lang=\"powershell\"># Download and extract\ncurl -LO https://github.com/terraphim/terraphim-ai/releases/latest/download/terraphim-agent-1.16.32-x86_64-pc-windows-msvc.zip\n</code></pre>\n<p>Extract the zip and add the directory to your PATH.</p>\n<ul>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/releases/latest\">Download for Windows x64</a></li>\n</ul>\n<h4 id=\"build-from-source-2\">Build from Source</h4>\n<p>Requires <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://rustup.rs/\">Rust for Windows</a>.</p>\n<pre><code data-lang=\"powershell\">git clone https://github.com/terraphim/terraphim-ai.git\ncd terraphim-ai\ncargo build --release\n# Binaries will be in target\\release\\\n</code></pre>\n<h2 id=\"library-bindings\">Library Bindings</h2>\n<h3 id=\"npm-node-js-bun\">npm (Node.js / Bun)</h3>\n<p>The <code>@terraphim/autocomplete</code> package provides NAPI bindings for autocomplete and knowledge graph functions.</p>\n<pre><code data-lang=\"bash\">npm install @terraphim/autocomplete\n</code></pre>\n<h3 id=\"python-pypi\">Python (PyPI)</h3>\n<p>The <code>terraphim-automata</code> package provides PyO3 bindings for text matching and autocomplete.</p>\n<pre><code data-lang=\"bash\">pip install terraphim-automata\n</code></pre>\n<h3 id=\"browser-extensions\">Browser Extensions</h3>\n<p>Two browser extensions are available for developer-mode installation:</p>\n<ul>\n<li><strong>Terraphim Sidebar</strong> — knowledge graph search panel</li>\n<li><strong>Terraphim Autocomplete</strong> — autocomplete suggestions in text fields</li>\n</ul>\n<p>Install from source:</p>\n<pre><code data-lang=\"bash\">git clone https://github.com/terraphim/terraphim-ai.git\ncd terraphim-ai/browser_extensions\n</code></pre>\n<p>Then load unpacked in Chrome at <code>chrome://extensions</code> (enable Developer Mode). Coming to the Chrome Web Store soon.</p>\n<p>See <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/tree/main/browser_extensions\">browser_extensions/INSTALL.md</a> for detailed instructions.</p>\n<h2 id=\"verification\">Verification</h2>\n<p>After installation, verify that Terraphim is working:</p>\n<pre><code data-lang=\"bash\"># Check version\nterraphim-agent --version\n# terraphim-agent 1.16.32\n\nterraphim-cli --version\n# terraphim-cli 1.16.32\n\n# Start the server\nterraphim_server\n\n# In another terminal, use the agent\nterraphim-agent\n</code></pre>\n<h2 id=\"troubleshooting\">Troubleshooting</h2>\n<h3 id=\"permission-denied\">Permission Denied</h3>\n<p>If you get a permission denied error, make the binary executable:</p>\n<pre><code data-lang=\"bash\">chmod +x /usr/local/bin/terraphim_server\nchmod +x /usr/local/bin/terraphim-agent\nchmod +x /usr/local/bin/terraphim-cli\n</code></pre>\n<h3 id=\"command-not-found\">Command Not Found</h3>\n<p>Ensure that the installation directory is in your PATH:</p>\n<pre><code data-lang=\"bash\"># For bash\necho &#39;export PATH=$PATH:/usr/local/bin&#39; &gt;&gt; ~/.bashrc\nsource ~/.bashrc\n\n# For zsh\necho &#39;export PATH=$PATH:/usr/local/bin&#39; &gt;&gt; ~/.zshrc\nsource ~/.zshrc\n</code></pre>\n<h3 id=\"rust-version-issues\">Rust Version Issues</h3>\n<p>Ensure that you have a recent Rust version:</p>\n<pre><code data-lang=\"bash\">rustc --version  # Should be 1.75.0 or later\nrustup update stable\n</code></pre>\n<h2 id=\"next-steps\">Next Steps</h2>\n<ul>\n<li><a href=\"/docs/quickstart\">Quickstart Guide</a> — Get up and running in 5 minutes</li>\n<li><a href=\"/docs/terraphim_config\">Configuration Guide</a> — Customise Terraphim to your needs</li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://discord.gg/VPJXB6BGuY\">Discord Community</a> — Join our Discord for support</li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.discourse.group\">Discourse Forum</a> — Community discussions and Q&amp;A</li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-01-27",
"https://atomicdata.dev/properties/tags" : {},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/docs/quickstart/",
"https://atomicdata.dev/properties/name" : "Quickstart",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/docs/quickstart/",
"https://atomicdata.dev/properties/description" : "<h1 id=\"quickstart-guide\">Quickstart Guide</h1>\n<p>Get up and running with Terraphim AI in just 5 minutes.</p>\n<h2 id=\"step-1-install-terraphim\">Step 1: Install Terraphim</h2>\n<p>Choose your preferred installation method:</p>\n<h3 id=\"option-a-universal-installer-recommended\">Option A: Universal Installer (Recommended)</h3>\n<pre><code data-lang=\"bash\"># Single command installation with platform detection\ncurl -fsSL https://raw.githubusercontent.com/terraphim/terraphim-ai/main/scripts/install.sh | bash\n</code></pre>\n<h3 id=\"option-b-homebrew-macos-linux\">Option B: Homebrew (macOS/Linux)</h3>\n<pre><code data-lang=\"bash\">brew tap terraphim/terraphim &amp;&amp; brew install terraphim-ai\n</code></pre>\n<h3 id=\"option-c-cargo\">Option C: Cargo</h3>\n<pre><code data-lang=\"bash\"># Install agent with interactive TUI\ncargo install terraphim-agent\n\n# Install CLI for automation\ncargo install terraphim-cli\n</code></pre>\n<p><a href=\"/docs/installation\">Need more options?</a></p>\n<h2 id=\"step-2-start-server\">Step 2: Start Server</h2>\n<p>Terraphim server provides HTTP API and knowledge graph backend.</p>\n<pre><code data-lang=\"bash\">terraphim_server\n</code></pre>\n<p>By default, server runs on <code>http://localhost:8080</code>.</p>\n<p>You should see output like:</p>\n<pre><code>[INFO] Terraphim Server v1.16.32 starting...\n[INFO] Server listening on http://localhost:8080\n[INFO] Knowledge graph initialized\n</code></pre>\n<h2 id=\"step-3-use-the-agent\">Step 3: Use the Agent</h2>\n<p>In a new terminal, start the interactive agent:</p>\n<pre><code data-lang=\"bash\">terraphim-agent\n</code></pre>\n<p>You'll see a welcome message and can start typing commands:</p>\n<pre><code>Terraphim AI Agent v1.16.32\nType &#39;help&#39; for available commands\n\n&gt; search rust async\nFound 12 results for &#39;rust async&#39;\n\n&gt; roles select engineer\nRole set to: Engineer (optimising for technical depth)\n\n&gt; search patterns\nFound 8 results for &#39;patterns&#39;\n</code></pre>\n<h2 id=\"common-agent-commands\">Common Agent Commands</h2>\n<p>Here are the most useful commands to get started:</p>\n<pre><code data-lang=\"bash\">&gt; search &lt;query&gt;              # Search knowledge graph\n&gt; roles select &lt;name&gt;         # Set search role (engineer, architect, etc.)\n&gt; connect &lt;term1&gt; &lt;term2&gt;     # Link two terms in knowledge graph\n&gt; import &lt;file&gt;               # Import markdown file into knowledge graph\n&gt; export &lt;format&gt;             # Export knowledge graph (json, csv)\n&gt; status                      # Show server status and statistics\n&gt; help                        # Show all available commands\n</code></pre>\n<h2 id=\"step-4-import-your-content\">Step 4: Import Your Content</h2>\n<p>Import your markdown files or documentation:</p>\n<pre><code data-lang=\"bash\"># Import a single file\nimport ~/notes/project-a.md\n\n# Import entire directory\nimport ~/Documents/knowledge-base/\n</code></pre>\n<h2 id=\"step-5-configure-data-sources\">Step 5: Configure Data Sources</h2>\n<p>Configure Terraphim to search different sources:</p>\n<pre><code data-lang=\"bash\"># Search GitHub repositories\nsource add github https://github.com/terraphim/terraphim-ai\n\n# Search StackOverflow\nsource add stackoverflow rust tokio\n\n# Search local filesystem\nsource add filesystem ~/code/ --recursive\n</code></pre>\n<h2 id=\"step-6-explore-features\">Step 6: Explore Features</h2>\n<h3 id=\"semantic-search\">Semantic Search</h3>\n<pre><code data-lang=\"bash\">&gt; search how to implement async channels in rust\n</code></pre>\n<h3 id=\"role-based-filtering\">Role-Based Filtering</h3>\n<pre><code data-lang=\"bash\">&gt; roles select architect\n&gt; search system design patterns\n</code></pre>\n<h3 id=\"knowledge-graph-exploration\">Knowledge Graph Exploration</h3>\n<pre><code data-lang=\"bash\">&gt; connect tokio async\n&gt; show tokio\n</code></pre>\n<h2 id=\"cli-automation\">CLI Automation</h2>\n<p>For automation and scripting, use the CLI instead of REPL:</p>\n<pre><code data-lang=\"bash\"># Search and get JSON output\nterraphim-cli search &quot;async patterns&quot; --format json\n\n# Import files programmatically\nterraphim-cli import ~/notes/*.md --recursive\n\n# Set role and search\nterraphim-cli search &quot;rust error handling&quot; --role engineer\n</code></pre>\n<h2 id=\"example-workflow\">Example Workflow</h2>\n<p>Here's a complete example workflow:</p>\n<pre><code data-lang=\"bash\"># 1. Start the server (in one terminal)\nterraphim_server &amp;\n\n# 2. Import your codebase (in another terminal)\nterraphim-agent\n&gt; import ~/my-project/src/\n\n# 3. Search for information\n&gt; search error handling patterns\n\n# 4. Set role for better results\n&gt; roles select senior-engineer\n\n# 5. Search again with role context\n&gt; search error handling patterns\n\n# 6. Export results\n&gt; export json &gt; search-results.json\n</code></pre>\n<h2 id=\"next-steps\">Next Steps</h2>\n<ul>\n<li><a href=\"/docs/installation\">Installation Guide</a> — More installation options and troubleshooting</li>\n<li><a href=\"/docs/terraphim_config\">Configuration Guide</a> — Customise Terraphim to your needs</li>\n<li><a href=\"/docs/contribution\">Contribution Guide</a> — Contribute to Terraphim development</li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://discord.gg/VPJXB6BGuY\">Discord Community</a> — Join our Discord for support</li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.discourse.group\">Discourse Forum</a> — Community discussions and Q&amp;A</li>\n</ul>\n<h2 id=\"getting-help\">Getting Help</h2>\n<p>If you run into issues:</p>\n<ol>\n<li>Search existing <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/issues\">GitHub issues</a></li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/issues/new\">Create a new issue</a></li>\n<li>Join <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://discord.gg/VPJXB6BGuY\">Discord community</a> for support</li>\n<li>Visit <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.discourse.group\">Discourse forum</a> for discussions</li>\n<li>Contact us at <a href=\"mailto:alex@terraphim.ai\">alex@terraphim.ai</a></li>\n</ol>\n",
"https://terraphim.ai/properties/date" : "2026-01-27",
"https://atomicdata.dev/properties/tags" : {},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/docs/terraphim-config/",
"https://atomicdata.dev/properties/name" : "Terraphim Config structure",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/docs/terraphim-config/",
"https://atomicdata.dev/properties/description" : "<h1 id=\"terraphim-config-structure\">Terraphim config structure</h1>\n<p>Most of the functionality is driven from the config file.</p>\n<h3 id=\"global\">[global]</h3>\n<p>section for global parameters - like global shortcuts</p>\n<h3 id=\"roles\">Roles</h3>\n<pre><code>[[roles]]\n</code></pre>\n<p>For example I can be engineer, architect, father or gamer. In each of those roles I will have a different concens which are driving different relevance/scoring and UX requirements.</p>\n<p>Roles are the separate abstract layers and define behaviour of the search for particular role. It's roughly following roles definition from ISO 42010 and other systems engineering materials and at different point in time one can wear diferent heat (different role).</p>\n<p>Each role have a</p>\n<ul>\n<li>Name</li>\n<li>Theme</li>\n<li>Relevance function to drive overall relevance - across all datasources for the role</li>\n<li>plugins\nand (set of) plugins - Terraphim powers, which are providing data sources.</li>\n</ul>\n<p>The powers roughly follows:</p>\n<ul>\n<li>Model (data sources and mapper)</li>\n<li>ViewModel (with relevance function/scoring)</li>\n<li>View (with UI) or Action</li>\n</ul>\n<h3 id=\"terraphim-powers-skills\">Terraphim powers - skills</h3>\n<pre><code>[[Skill]]\n</code></pre>\n<p>Parameters:</p>\n<ul>\n<li>name</li>\n<li>haystack</li>\n<li>haystack arguments</li>\n</ul>\n<p>Haystack is a source, can be PubMed, Github, Coda.io, Notion.so etc.\nHaystack arguments</p>\n",
"https://terraphim.ai/properties/date" : "2022-02-21",
"https://atomicdata.dev/properties/tags" : {"categories":["Documentation"],"tags":["terraphim","config","plugins"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/docs/contribution/",
"https://atomicdata.dev/properties/name" : "Contribution Guidelines",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/docs/contribution/",
"https://atomicdata.dev/properties/description" : "<p>General guidelines for contributing to the project.</p>\n<span id=\"continue-reading\"></span><h1 id=\"main-goals\">Main goals</h1>\n<h2 id=\"be-data-driven\">Be data driven</h2>\n<ul>\n<li>Opinion shall be backed by research and data,</li>\n<li>benchmarks by code and deployment scripts</li>\n</ul>\n<h1 id=\"engineering-approach\">Engineering approach</h1>\n<p>Even for conceptual topics such as ethics or enterprise architecture, there should be path to be implemented in real would: student's head or software working in production considered to be real world.</p>\n",
"https://terraphim.ai/properties/date" : "2021-12-15",
"https://atomicdata.dev/properties/tags" : {"categories":["Documentation"],"tags":["contribute","zola"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/docs/donate/",
"https://atomicdata.dev/properties/name" : "Support project by contributing",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/docs/donate/",
"https://atomicdata.dev/properties/description" : "<p>This is the beginning of an exciting great new journey: support open-source projects by donating or contributing.</p>\n<p>Terraphim AI is partially supported by Innovate UK via the Eureka funding program up to 60% of costs, under grant 600594, \"ATOMIC\", jointly with our collaborators <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://ontola.io/\">Ontola</a> - the team behind <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://atomicdata.dev/\">Atomic Data Server and Protocol</a>.\nBut for the rest of the costs, we need your help.</p>\n<h1 id=\"introducing-a-donation-driven-roadmap\">Introducing a Donation-driven roadmap</h1>\n<p>After several months of agonising over which license to release Terraphim AI under, I created a new repository with the two most liberal licenses open source MIT and APACHE2.\nWe may build a commercial service on top of Terraphim Core(Cortex), but the core and our promise that it is open-sourced to allow you all to build on top of Terraphim AI, with a privacy-first, building AI tooling right way around: you don't need to move your data. You codify and move your knowledge where it needs to be and helps you to get things done the most.</p>\n<h1 id=\"donation-driven-roadmap\">Donation-driven roadmap</h1>\n<p>To support the project's development, we introduce a \"donation-driven roadmap\": if you want a feature, vote for it not only with your thumb but with your money.\nIn exchange:</p>\n<ul>\n<li>you will help us to shapre the feature -  we will be seeking your feedback on the feature implementation</li>\n<li>you will receive access to private repositories ahead of the public</li>\n<li>you donation will help us to reward contributors and maintainers of the project</li>\n</ul>\n<p>Sponsor the features on our <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/sponsors/terraphim\">GitHub page</a> or propose a new idea in <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/orgs/terraphim/discussions\">GitHub discussions</a></p>\n",
"https://terraphim.ai/properties/date" : "2020-08-31",
"https://atomicdata.dev/properties/tags" : {"categories":["donations","open-source"],"tags":["donate","support"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/sentrux-vs-terraphim-eval-comparison/",
"https://atomicdata.dev/properties/name" : "Sentrux vs Terraphim: Two Approaches to AI Code Quality Evaluation",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/sentrux-vs-terraphim-eval-comparison/",
"https://atomicdata.dev/properties/description" : "<p>As AI agents write more and more production code, the question shifts from \"can the agent write code?\" to \"can we verify the code is good?\" Two tools tackle this from opposite ends: Sentrux measures structural health, Terraphim measures semantic completeness. Neither is complete without the other.</p>\n<span id=\"continue-reading\"></span><h2 id=\"the-problem-they-both-solve\">The Problem They Both Solve</h2>\n<p>An AI agent that generates 500 lines of syntactically correct Rust can still degrade a codebase. It might introduce tight coupling, leave behind unimplemented stubs, create circular dependencies, or raise the cyclomatic complexity past any reasonable threshold. Standard CI pipelines catch compilation errors and failing tests. They do not catch architectural erosion.</p>\n<p>Both Sentrux and the Terraphim evaluation toolkit are designed to close that gap. They just do so from fundamentally different perspectives.</p>\n<hr />\n<h2 id=\"sentrux-structural-analysis\">Sentrux: Structural Analysis</h2>\n<p><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/sentrux/sentrux\">Sentrux</a> (v0.5.7, MIT, 27.7k lines of Rust) is a real-time structural analysis engine. It parses source code into an AST using tree-sitter, builds a dependency graph, and computes a quality signal from 0 to 10,000.</p>\n<h3 id=\"five-root-cause-metrics\">Five Root Cause Metrics</h3>\n<p>Sentrux computes a geometric mean across five orthogonal dimensions:</p>\n<table><thead><tr><th>Metric</th><th>What it captures</th></tr></thead><tbody>\n<tr><td><strong>Modularity</strong></td><td>Fan-in and fan-out between modules; god file detection</td></tr>\n<tr><td><strong>Acyclicity</strong></td><td>Circular dependency count and which files participate</td></tr>\n<tr><td><strong>Depth</strong></td><td>Maximum dependency chain length; instability score</td></tr>\n<tr><td><strong>Equality</strong></td><td>Cyclomatic complexity distribution (Gini coefficient); large files</td></tr>\n<tr><td><strong>Redundancy</strong></td><td>Dead functions; duplicate code groups</td></tr>\n</tbody></table>\n<p>These are structural facts derived from the AST and dependency graph, not text patterns. Sentrux does not care what the code says; it cares how it connects.</p>\n<h3 id=\"how-it-works-with-agents\">How It Works with Agents</h3>\n<p>Sentrux ships a native MCP server with nine tools. The intended agent workflow is:</p>\n<pre><code>sentrux.scan(&quot;/path/to/project&quot;)\n  -&gt; { quality_signal: 7342, files: 139, bottleneck: &quot;modularity&quot; }\n\nsentrux.session_start()\n  -&gt; baseline saved\n\n... agent writes code ...\n\nsentrux.session_end()\n  -&gt; { pass: false, signal_before: 7342, signal_after: 6891,\n       summary: &quot;Quality degraded during this session&quot; }\n</code></pre>\n<p>The agent gets a precise numeric verdict with a named bottleneck. It can iterate: check the bottleneck, fix it, call <code>rescan</code>, repeat.</p>\n<h3 id=\"the-gui\">The GUI</h3>\n<p>Sentrux includes a live treemap built with egui and wgpu. Files are sized by metric contribution and glow when modified. Dependency edges are drawn between coupled files. A file system watcher (notify) feeds changes over crossbeam-channel to the renderer in real time. The same process that runs the GUI hosts the MCP server, so you can watch architectural changes happen as the agent works.</p>\n<h3 id=\"language-support\">Language Support</h3>\n<p>52 languages via tree-sitter plugins. Each plugin is a <code>plugin.toml</code> and a <code>tags.scm</code> query file. No Rust required to add a language.</p>\n<hr />\n<h2 id=\"terraphim-evaluation-toolkit-semantic-completeness\">Terraphim Evaluation Toolkit: Semantic Completeness</h2>\n<p>The Terraphim approach lives across two production crates in <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">terraphim-ai</a>: <code>terraphim_codebase_eval</code> and <code>terraphim_negative_contribution</code>. These are not a conceptual framework or a shell script wrapper; they are typed Rust libraries integrated into the agent review pipeline.</p>\n<h3 id=\"the-manifest-system-terraphim-codebase-eval\">The Manifest System (<code>terraphim_codebase_eval</code>)</h3>\n<p>The evaluation manifest is a TOML file that describes a before/after comparison at the git SHA level:</p>\n<pre><code data-lang=\"toml\">[[haystacks]]\nid = &quot;baseline&quot;\npath = &quot;/srv/repo&quot;\ncommit_sha = &quot;abc123&quot;\nstate = &quot;baseline&quot;\n\n[[haystacks]]\nid = &quot;candidate&quot;\npath = &quot;/srv/repo&quot;\ncommit_sha = &quot;def456&quot;\nstate = &quot;candidate&quot;\n\n[[roles]]\nrole_id = &quot;code-reviewer&quot;\ndescription = &quot;Reviews for bugs and maintainability&quot;\nterm_sets = [&quot;bug-patterns&quot;, &quot;code-smells&quot;]\n\n[roles.scoring_weights]\nsearch_score = 1.0\ngraph_density = 0.8\nentity_count = 1.0\n\n[[queries]]\nquery_text = &quot;highlight potential bugs&quot;\nrole_id = &quot;code-reviewer&quot;\nexpected_signal = &quot;increase&quot;\nconfidence_threshold = 0.6\n\n[thresholds]\nimproved_pct = 10.0\ndegraded_pct = 5.0\ncritical_test_failures = 0\n</code></pre>\n<p>Roles define the evaluation perspective. Each role carries named term sets (Aho-Corasick dictionaries built from domain knowledge graphs) and per-dimension scoring weights. Queries specify what to search for and in which direction the score should move. The manifest is validated for referential integrity before execution: any query that references a non-existent role is rejected at load time, not at runtime.</p>\n<p>The verdict logic is explicit: if the weighted score increases by more than <code>improved_pct</code>, the contribution is classified as Improved; if it drops by more than <code>degraded_pct</code>, it is Degraded; any new test failure triggers immediate Degraded regardless of scores.</p>\n<h3 id=\"the-edm-scanner-terraphim-negative-contribution\">The EDM Scanner (<code>terraphim_negative_contribution</code>)</h3>\n<p>The Explicit Deferral Marker (EDM) scanner answers a specific and critical question: did the agent ship stubs as production code?</p>\n<p>It uses the Aho-Corasick automata to detect markers in Rust source that indicate deferred implementation:</p>\n<ul>\n<li><code>todo!()</code></li>\n<li><code>unimplemented!()</code></li>\n<li><code>panic!(\"not implemented\")</code></li>\n<li><code>panic!(\"TODO\")</code></li>\n</ul>\n<p>The scanner is production-only by design. It automatically skips:</p>\n<ul>\n<li><code>tests/</code>, <code>examples/</code>, <code>benches/</code> directories</li>\n<li><code>build.rs</code></li>\n<li>Files ending in <code>_test.rs</code></li>\n<li>Any file containing <code>#[test]</code> or <code>#[cfg(test)]</code></li>\n</ul>\n<p>Suppression is available per-line: <code>// terraphim: allow(stub)</code> silences the finding on that line only. Every finding carries a file path, line number, severity, category, confidence (0.95), and a suggestion drawn from the thesaurus URL metadata.</p>\n<p>The scanner outputs a <code>ReviewAgentOutput</code> struct consumed directly by the Terraphim review pipeline. An agent that ships one <code>todo!()</code> in production code fails the gate.</p>\n<hr />\n<h2 id=\"side-by-side-comparison\">Side-by-Side Comparison</h2>\n<table><thead><tr><th>Dimension</th><th>Sentrux</th><th>Terraphim EDM Scanner</th><th>Terraphim Manifest Eval</th></tr></thead><tbody>\n<tr><td><strong>Core mechanism</strong></td><td>tree-sitter AST + dependency graph</td><td>Aho-Corasick on EDM pattern thesaurus</td><td>Aho-Corasick on domain KG + weighted scoring</td></tr>\n<tr><td><strong>What it measures</strong></td><td>Architecture: coupling, cycles, complexity</td><td>Incomplete implementations in production code</td><td>Semantic quality delta between two git SHAs</td></tr>\n<tr><td><strong>Language scope</strong></td><td>52 languages</td><td>Rust only (production files)</td><td>Any language with KG term sets</td></tr>\n<tr><td><strong>Unit of comparison</strong></td><td>Quality signal delta within a session</td><td>Pass/fail per file</td><td>Before/after manifest with role-weighted scores</td></tr>\n<tr><td><strong>Agent integration</strong></td><td>MCP server (9 tools, session lifecycle)</td><td><code>ReviewAgentOutput</code> struct in review pipeline</td><td><code>EvaluationManifest</code> loaded per evaluation run</td></tr>\n<tr><td><strong>False positive risk</strong></td><td>Low (graph-structural, not text)</td><td>Very low (exact stub patterns, test exclusions)</td><td>Medium (depends on KG quality)</td></tr>\n<tr><td><strong>Customisation</strong></td><td><code>rules.toml</code> (layer boundaries, thresholds)</td><td><code>// terraphim: allow(stub)</code> suppression</td><td>KG term sets, role weights, query direction</td></tr>\n<tr><td><strong>Live feedback</strong></td><td>Yes (file watcher, treemap, MCP)</td><td>No (batch scan)</td><td>No (batch manifest)</td></tr>\n<tr><td><strong>Verdict granularity</strong></td><td>Named bottleneck + per-metric breakdown</td><td>Finding list with file:line and suggestion</td><td>Improved / Degraded / Neutral with percentage</td></tr>\n</tbody></table>\n<hr />\n<h2 id=\"where-they-complement-each-other\">Where They Complement Each Other</h2>\n<p>These tools are not alternatives. They operate at different layers of the quality stack:</p>\n<p><strong>Sentrux</strong> answers: <em>Is the architecture getting worse?</em></p>\n<p><strong>Terraphim EDM Scanner</strong> answers: <em>Did the agent leave stubs in production code?</em></p>\n<p><strong>Terraphim Manifest Eval</strong> answers: <em>Did the agent improve or degrade domain-specific semantic quality across these two commits?</em></p>\n<p>A complete agent quality gate combines all three:</p>\n<ol>\n<li><code>sentrux session_start</code> before the agent begins work</li>\n<li>Agent writes code</li>\n<li><code>sentrux session_end</code> to verify no structural degradation</li>\n<li>Terraphim EDM scan to verify no stubs shipped to production</li>\n<li>Terraphim manifest eval (optional, for domain-specific semantic coverage) comparing the baseline and candidate SHAs</li>\n</ol>\n<p>If any gate fails, the contribution is blocked. The agent gets specific, actionable feedback: a named structural bottleneck from Sentrux, a file and line number from the EDM scanner, or a percentage degradation from the manifest evaluator.</p>\n<hr />\n<h2 id=\"choosing-your-starting-point\">Choosing Your Starting Point</h2>\n<p>If you are instrumenting an AI agent pipeline today, start with the Terraphim EDM scanner. It requires no configuration beyond pointing it at your Rust source. It has a binary pass/fail verdict, zero false positives on well-written production code, and integrates directly into the existing <code>ReviewAgentOutput</code> pipeline.</p>\n<p>Add Sentrux when you want continuous architectural visibility. The MCP integration means the agent can self-correct in real time rather than discovering structural problems only at gate check.</p>\n<p>Add the Terraphim manifest evaluation when you have a domain knowledge graph and want to verify that the agent's changes improve semantic coverage in your specific domain, not just compile and pass tests.</p>\n<p>Together, they give you three independent quality signals that a capable agent must satisfy simultaneously: structural soundness, implementation completeness, and semantic improvement.</p>\n<hr />\n<h2 id=\"further-reading\">Further Reading</h2>\n<ul>\n<li><a href=\"/posts/teaching-ai-agents-to-learn-from-mistakes/\">Teaching AI Agents to Learn from Mistakes</a></li>\n<li><a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a></li>\n<li><a href=\"/posts/guarding-opencode-with-dcg/\">Guarding OpenCode with DCG</a></li>\n<li>Sentrux repository: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/sentrux/sentrux\">github.com/sentrux/sentrux</a></li>\n<li>Terraphim AI: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">github.com/terraphim/terraphim-ai</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-29",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","code-quality","AI-agents","evaluation","comparison","sentrux"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/sub-millisecond-context-knowledge-graphs/",
"https://atomicdata.dev/properties/name" : "Sub-Millisecond Context: How Aho-Corasick Automata Replace Embedding Calls",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/sub-millisecond-context-knowledge-graphs/",
"https://atomicdata.dev/properties/description" : "<p>Vector embedding calls are the hidden tax on every RAG pipeline. You pay latency, you pay API cost, you get probabilistic results that vary run to run. There is a class of problem where none of that is acceptable. This post shows how Terraphim replaces embedding calls with Aho-Corasick finite-state automata -- deterministic, auditable, and under one millisecond for 1.4 million patterns.</p>\n<span id=\"continue-reading\"></span><h2 id=\"the-problem-with-embedding-based-context\">The Problem with Embedding-Based Context</h2>\n<p>When you call an embedding API to retrieve context for an LLM prompt, three things happen:</p>\n<ol>\n<li><strong>You pay latency.</strong> Even fast embedding models add 20-100ms per call. At scale, this compounds.</li>\n<li><strong>You get probabilistic results.</strong> Two runs on the same input may return different top-k documents depending on float precision and index state.</li>\n<li><strong>You lose auditability.</strong> A cosine similarity score tells you nothing about <em>why</em> a document was retrieved.</li>\n</ol>\n<p>For general-purpose assistants, these tradeoffs are acceptable. For domain-specific systems -- medical, legal, engineering -- they are not. A system that cannot explain its retrieval decisions cannot be trusted.</p>\n<h2 id=\"the-aho-corasick-alternative\">The Aho-Corasick Alternative</h2>\n<p>Aho-Corasick is a classical multi-pattern string matching algorithm. Given a dictionary of N patterns, it builds a finite-state automaton at construction time and then scans any input text in O(n) time regardless of how many patterns are in the dictionary.</p>\n<p>Terraphim builds knowledge graph automata on top of this: each node in the automaton is a domain concept, edges encode synonyms and related terms, and matching returns not just a span but a structured entity with graph position.</p>\n<pre><code>Input text: &quot;patient presents with BRAF V600E mutation&quot;\n                                    |\n                            Aho-Corasick scan\n                                    |\n         Match: &quot;BRAF V600E&quot; -&gt; node: Gene:BRAF, variant: V600E\n                                    |\n                            Graph traversal\n                                    |\n         Edges: BRAF -&gt; Treats &lt;- Vemurafenib\n                BRAF -&gt; TestedBy &lt;- Cobas 4800 assay\n                BRAF -&gt; Contraindicated &lt;- Sorafenib (paradoxical activation)\n</code></pre>\n<p>The match is deterministic. The traversal is deterministic. The context injected into the prompt is always the same for the same input.</p>\n<h2 id=\"architecture\">Architecture</h2>\n<pre><code>┌─────────────────────────────────────────────────────┐\n│                   Input Text                        │\n└───────────────────────┬─────────────────────────────┘\n                        │\n              ┌─────────▼──────────┐\n              │  Aho-Corasick FSM  │  &lt; 1ms for 1.4M patterns\n              │  (terraphim_automata│\n              └─────────┬──────────┘\n                        │ matched spans + normalized terms\n              ┌─────────▼──────────┐\n              │   Thesaurus Layer  │  synonym expansion,\n              │  (terraphim_types) │  canonical form resolution\n              └─────────┬──────────┘\n                        │ NormalizedTerm { url, rank, ... }\n              ┌─────────▼──────────┐\n              │   Role Graph       │  27 node types, 65 edge types\n              │  (terraphim_       │  Jaccard + path distance scoring\n              │   rolegraph)       │\n              └─────────┬──────────┘\n                        │ ranked context passages\n              ┌─────────▼──────────┐\n              │   Prompt Builder   │  inject into LLM prompt\n              └────────────────────┘\n</code></pre>\n<p>Each stage is a Rust crate with a stable public API. You can use any layer independently.</p>\n<h2 id=\"getting-started\">Getting Started</h2>\n<h3 id=\"install\">Install</h3>\n<pre><code data-lang=\"bash\">cargo add terraphim_automata terraphim_types terraphim_rolegraph\n</code></pre>\n<p>For Python (via PyO3 bindings):</p>\n<pre><code data-lang=\"bash\">pip install terraphim-automata\n</code></pre>\n<p>For JavaScript/TypeScript (via WASM):</p>\n<pre><code data-lang=\"bash\">npm install @terraphim/automata\n</code></pre>\n<h3 id=\"build-an-automaton-from-a-thesaurus\">Build an Automaton from a Thesaurus</h3>\n<p>The thesaurus is a JSON file mapping term strings to <code>NormalizedTerm</code> records. Terraphim ships pre-built thesauri for SNOMED CT, UMLS, and software engineering domains. You can also build your own from markdown knowledge graph files.</p>\n<pre><code data-lang=\"rust\">use terraphim_automata::{load_thesaurus_from_json, find_matches};\n\nlet thesaurus_json = std::fs::read_to_string(&quot;snomed-tier1.json&quot;)?;\nlet thesaurus = load_thesaurus_from_json(&amp;thesaurus_json)?;\n\nlet text = &quot;Patient presents with BRAF V600E mutation and melanoma stage IV.&quot;;\nlet matches = find_matches(text, thesaurus, true)?; // true = leftmost-longest\n\nfor m in &amp;matches {\n    println!(&quot;{:?} at {:?}&quot;, m.term, m.pos);\n}\n</code></pre>\n<p><code>find_matches</code> is O(n) in the input length. On an M2 MacBook Pro, matching 1.4 million SNOMED patterns against a 500-word clinical note completes in 0.3ms.</p>\n<h3 id=\"add-knowledge-graph-traversal\">Add Knowledge Graph Traversal</h3>\n<p>Once you have matched entities, traverse the role graph to collect supporting context:</p>\n<pre><code data-lang=\"rust\">use terraphim_rolegraph::RoleGraph;\n\nlet graph = RoleGraph::from_kg_path(&quot;~/.config/terraphim/kg/medical/&quot;)?;\n\nfor m in &amp;matches {\n    let context = graph.traverse(&amp;m.normalized_term, depth: 2)?;\n    // context contains related concepts, evidence paths, ranked passages\n}\n</code></pre>\n<p>The traversal depth controls how far to expand from each matched entity. Depth 1 gives direct neighbours (synonyms, treatments, contraindications). Depth 2 adds second-degree connections (clinical trials, guidelines, variants).</p>\n<h2 id=\"measured-impact-medgemma-with-and-without-kg-context\">Measured Impact: MedGemma with and without KG Context</h2>\n<p>We ran identical clinical cases through Google's MedGemma model with and without Terraphim knowledge graph grounding:</p>\n<table><thead><tr><th>Case</th><th>Raw MedGemma (no KG)</th><th>With Terraphim KG Grounding</th></tr></thead><tbody>\n<tr><td>BRAF V600E Melanoma</td><td>\"BRAF inhibitor (e.g., Dabrafenib + Trametinib)\" -- vague class suggestion</td><td>Vemurafenib 450mg orally once daily -- specific drug and dose</td></tr>\n<tr><td>CYP2D6 Codeine Sensitivity</td><td>Oxycodone 5 mg/mL -- <strong>wrong drug entirely</strong></td><td>Codeine 60mg every 6h -- correct drug from KG context</td></tr>\n<tr><td>EGFR NSCLC</td><td>Osimertinib 80mg (correct on this run; prior run hallucinated <strong>800mg -- a 10x overdose</strong>)</td><td>Osimertinib 80mg -- consistently correct per FLAURA trial</td></tr>\n</tbody></table>\n<p>Without graph grounding, the LLM gives vague class-level suggestions, recommends the wrong drug, or produces dosing errors that vary between runs. With Terraphim KG grounding, every recommendation is specific, correct, and reproducible.</p>\n<h3 id=\"evaluation-results-across-36-real-inference-runs\">Evaluation results across 36 real inference runs</h3>\n<table><thead><tr><th>Run</th><th>Pass Rate</th><th>Safety Gate</th><th>KG Grounding</th><th>Avg Latency</th></tr></thead><tbody>\n<tr><td>CPU</td><td>18/18 (100%)</td><td>100%</td><td>83.3%</td><td>165.3s</td></tr>\n<tr><td>GPU #1</td><td>18/18 (100%)</td><td>100%</td><td>77.8%</td><td>23.5s</td></tr>\n<tr><td>GPU #2</td><td>18/18 (100%)</td><td>100%</td><td>83.3%</td><td>24.8s</td></tr>\n</tbody></table>\n<p>36 total inference calls. Zero safety failures. No mocked responses.</p>\n<p>The LLM latency dominates (23-165 seconds depending on hardware). The Terraphim matching and graph traversal contributes under 1ms to each call.</p>\n<h2 id=\"why-this-matters-beyond-medical\">Why This Matters Beyond Medical</h2>\n<p>The medical case is the hardest version of the problem: the stakes are high, the domain is large (1.4M SNOMED terms), and incorrect context causes real harm. The same architecture applies anywhere you need deterministic, auditable retrieval:</p>\n<ul>\n<li><strong>Legal</strong>: statutes, case law, definitions -- exact matches matter</li>\n<li><strong>Engineering</strong>: part numbers, standards references, tolerances -- approximate matching is dangerous</li>\n<li><strong>Compliance</strong>: regulatory text -- you need to know exactly which clause was matched and why</li>\n<li><strong>AI agent context injection</strong>: as shown in the <a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Terraphim hooks post</a>, the automata run as Claude Code pre-tool hooks with sub-millisecond overhead</li>\n</ul>\n<h2 id=\"system-footprint\">System Footprint</h2>\n<p>The full stack -- MedGemma 4B + SNOMED automata + role graph -- runs on a single machine in under 4GB RAM. There is no vector database daemon to operate, no embedding API to call, and no GPU required for the retrieval layer.</p>\n<pre><code>terraphim_automata (Aho-Corasick FSM, 1.4M patterns): ~800MB RAM\nMedGemma 4B (quantised):                             ~2.8GB RAM\nRole graph (27 node types, 65 edge types):            ~120MB RAM\nTotal:                                                ~3.7GB\n</code></pre>\n<p>Compare this to a typical embedding-based RAG stack: embedding model (~500MB) + vector database process (~1-2GB) + embedding API latency per call.</p>\n<h2 id=\"key-numbers\">Key Numbers</h2>\n<ul>\n<li>1.4M SNOMED/UMLS patterns matched in &lt;1ms</li>\n<li>27 node types, 65 edge types in the medical knowledge graph</li>\n<li>543 passing tests, 18/18 evaluation cases correctly grounded</li>\n<li>36 real inference runs, zero safety failures</li>\n<li>&lt;4GB total deployment footprint (model + KG + automata)</li>\n<li>~1ms routing overhead in the Terraphim LLM proxy (part of <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">terraphim-ai</a>)</li>\n</ul>\n<h2 id=\"further-reading\">Further Reading</h2>\n<ul>\n<li><a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a> -- the case for deterministic over probabilistic retrieval</li>\n<li><a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Teaching AI Agents with Knowledge Graphs</a> -- using the automata as Claude Code hooks</li>\n<li><a href=\"/posts/multi-haystack-roles-grepapp/\">Multi-Haystack Roles with GrepApp</a> -- searching across multiple sources with a single role</li>\n</ul>\n<p>The theory behind automata-based context injection, when to use it versus embeddings, and how to compose it with LLM routing is the subject of Chapters 5-7 of <em>Context Engineering with Knowledge Graphs</em> (launching 2026).</p>\n<p>Source: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">github.com/terraphim/terraphim-ai</a> -- 42 Rust crates, WASM-ready, MCP-native, Apache 2.0.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-29",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","knowledge-graph","Aho-Corasick","FST","context-engineering","tutorial"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/frontend-developer-agent-walkthrough/",
"https://atomicdata.dev/properties/name" : "Building a Front-End Developer Agent with Knowledge Graphs and Code Search",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/frontend-developer-agent-walkthrough/",
"https://atomicdata.dev/properties/description" : "<p>We have published a comprehensive walkthrough showing how to build a specialised front-end developer agent using Terraphim's knowledge graph system, dual haystacks, and deterministic search. This post walks through the key concepts and links to the full guide.</p>\n<span id=\"continue-reading\"></span><h2 id=\"why-a-front-end-developer-agent\">Why a Front-End Developer Agent?</h2>\n<p>Front-end development spans a vast surface area: CSS layout, accessibility, TypeScript types, Svelte reactivity, build tooling, performance, testing, and more. When you search for \"how do I make this accessible?\" or \"what's the SvelteKit pattern for form validation?\", you want answers that understand your domain -- not generic web search results.</p>\n<p>Terraphim's deterministic knowledge graph approach means every search result is grounded in concepts you define, ranked by relevance, and fully reproducible. No hallucination, no non-deterministic LLM output.</p>\n<h2 id=\"what-the-agent-does\">What the Agent Does</h2>\n<p>The front-end developer agent combines three capabilities:</p>\n<ol>\n<li>\n<p><strong>Knowledge Graph</strong>: 18 concept files covering responsive design, accessibility, Svelte/SvelteKit patterns, TypeScript, CSS layout, state management, build tools, testing, performance, and more. Each concept has synonyms that resolve deterministically via Aho-Corasick matching.</p>\n</li>\n<li>\n<p><strong>Local Search (Ripgrep)</strong>: Searches your project files -- <code>.svelte</code>, <code>.ts</code>, <code>.css</code>, <code>.md</code> -- using the knowledge graph to boost conceptually relevant files.</p>\n</li>\n<li>\n<p><strong>Global Code Search (GrepApp)</strong>: Searches millions of public GitHub repositories for TypeScript code patterns via the grep.app API, filtered to TypeScript for modern front-end relevance.</p>\n</li>\n</ol>\n<p>Results from both haystacks are merged and ranked using TerraphimGraph, a hybrid scoring algorithm that combines knowledge graph concept matching with TF-IDF rescoring. In testing, a query for \"svelte component\" returned 13 results with TerraphimGraph versus 1 result with the simpler BM25Plus scorer. The 18 concept files with 358 synonyms actively influence ranking, not just display.</p>\n<h2 id=\"the-knowledge-graph\">The Knowledge Graph</h2>\n<p>Each concept is a Markdown file with a heading, description, and synonym list:</p>\n<pre><code data-lang=\"markdown\"># Svelte Patterns\n\nSvelte-specific patterns for building reactive, compiled frontend\napplications using runes, stores, and SvelteKit conventions.\n\nsynonyms:: Svelte, SvelteKit, rune, $state, $derived, $effect,\n$props, bind, each block, await block, load function, +page.svelte\n</code></pre>\n<p>When you search for <code>$derived</code> or <code>+page.svelte</code>, the Aho-Corasick automaton matches it to the \"Svelte Patterns\" concept in O(n+m) time. The matching is case-insensitive and leftmost-longest, so \"CSS grid\" matches as one term rather than two separate words.</p>\n<p>If no exact match exists, a TF-IDF fallback kicks in using <code>trigger::</code> directives for semantic similarity.</p>\n<h2 id=\"hybrid-scoring-why-terraphimgraph\">Hybrid Scoring: Why TerraphimGraph</h2>\n<p>Terraphim offers multiple relevance functions. For a knowledge-graph-backed agent, <code>terraphim-graph</code> is strictly superior to the simpler <code>bm25plus</code>:</p>\n<table><thead><tr><th>Aspect</th><th>BM25Plus</th><th>TerraphimGraph</th></tr></thead><tbody>\n<tr><td>KG concepts affect ranking</td><td>No (display only)</td><td>Yes (graph + TF-IDF hybrid)</td></tr>\n<tr><td>Term co-occurrence</td><td>Not used</td><td>Boosts related documents</td></tr>\n<tr><td>KG link insertion</td><td>Disabled</td><td>Enabled in results</td></tr>\n<tr><td>TF-IDF rescoring</td><td>Not applied</td><td>30% weight boost</td></tr>\n</tbody></table>\n<p>TerraphimGraph uses a two-pass scoring system. Pass 1 builds a co-occurrence graph from Aho-Corasick matches and ranks documents by <code>total_rank = node_rank + edge_rank + document_rank</code>. Pass 2 applies TF-IDF rescoring at 30% weight. Documents containing co-occurring concepts (e.g., \"svelte\" + \"component\" + \"state management\") score higher than those matching only one term.</p>\n<h2 id=\"svelte-and-sveltekit-focus\">Svelte and SvelteKit Focus</h2>\n<p>The knowledge graph is tuned for Svelte/SvelteKit development with TypeScript:</p>\n<ul>\n<li><strong>Svelte Patterns</strong>: Runes (<code>$state</code>, <code>$derived</code>, <code>$effect</code>), stores, components, transitions</li>\n<li><strong>SvelteKit Routing</strong>: <code>+page.svelte</code>, <code>+page.ts</code>, <code>+layout.svelte</code>, form actions, load functions</li>\n<li><strong>TypeScript</strong>: Interfaces, generics, type guards, <code>satisfies</code> operator</li>\n<li><strong>CSS Custom Properties</strong>: Design tokens, oklch colour values, dark mode theming</li>\n<li><strong>Forms and Validation</strong>: SvelteKit form actions, zod schemas, superforms</li>\n</ul>\n<p>The GrepApp haystack is filtered to <code>language: \"typescript\"</code> rather than JavaScript, reflecting the modern Svelte/SvelteKit ecosystem.</p>\n<h2 id=\"quick-start\">Quick Start</h2>\n<pre><code data-lang=\"bash\"># Build the agent\ngit clone https://github.com/terraphim/terraphim-ai.git\ncd terraphim-ai\ncargo build --release\n# Enable GrepApp haystack support\ncargo build --release -p terraphim_middleware --features grepapp\ncargo install --path crates/terraphim_agent\n\n# Set up the front-end developer role\nterraphim-agent setup --template frontend-engineer --path ~/projects\n\n# Search across local files and GitHub\nterraphim-agent search &quot;flexbox responsive layout&quot;\n</code></pre>\n<h2 id=\"architecture-in-brief\">Architecture in Brief</h2>\n<pre><code>Query: &quot;flexbox responsive layout&quot;\n    |\n    v\n[Auto-route] -&gt; Front-End Developer role\n    |\n    v\n[Aho-Corasick] -&gt; CSS Layout + Responsive Design concepts\n    |\n    v\n[Ripgrep]       [GrepApp (TypeScript)]\n    |                |\n    v                v\n[TerraphimGraph hybrid scoring:\n  Pass 1: KG graph ranking (node + edge co-occurrence)\n  Pass 2: TF-IDF rescoring (30% weight boost)]\n              |\n              v\n      Ranked results\n</code></pre>\n<h2 id=\"mcp-integration-with-ai-coding-agents\">MCP Integration with AI Coding Agents</h2>\n<p>The <code>terraphim_mcp_server</code> binary exposes all knowledge graph tools via the Model Context Protocol, so any MCP-compatible AI coding agent can use your front-end developer KG during coding sessions.</p>\n<p><strong>opencode</strong> (<code>~/.config/opencode/opencode.json</code>):</p>\n<pre><code data-lang=\"json\">{\n  &quot;mcp&quot;: {\n    &quot;terraphim&quot;: {\n      &quot;type&quot;: &quot;local&quot;,\n      &quot;command&quot;: [&quot;~/.cargo/bin/terraphim_mcp_server&quot;],\n      &quot;environment&quot;: { &quot;TERRAPHIM_DATA_PATH&quot;: &quot;~/.terraphim&quot; }\n    }\n  }\n}\n</code></pre>\n<p><strong>Claude Code</strong> (<code>~/.claude.json</code>):</p>\n<pre><code data-lang=\"json\">{\n  &quot;mcpServers&quot;: {\n    &quot;terraphim&quot;: {\n      &quot;type&quot;: &quot;stdio&quot;,\n      &quot;command&quot;: &quot;~/.cargo/bin/terraphim_mcp_server&quot;,\n      &quot;env&quot;: { &quot;RUST_LOG&quot;: &quot;error&quot; }\n    }\n  }\n}\n</code></pre>\n<p>After configuration, the AI agent gains access to 18 MCP tools: <code>search</code>, <code>autocomplete_terms</code>, <code>replace_matches</code>, <code>terraphim_find_files</code>, <code>terraphim_grep</code>, and more. Queries auto-route to the Front-End Developer role when front-end terms are detected.</p>\n<p>For Cursor, Windsurf, or any HTTP-based MCP client, start the SSE server: <code>terraphim_mcp_server --sse --bind 127.0.0.1:8000</code>.</p>\n<h2 id=\"read-the-full-walkthrough\">Read the Full Walkthrough</h2>\n<p>The complete step-by-step guide covers:</p>\n<ul>\n<li>Building terraphim-agent with GrepApp support</li>\n<li>Understanding the knowledge graph architecture</li>\n<li>Creating 18 front-end concept files</li>\n<li>Configuring the role with dual haystacks</li>\n<li>Using search, autocomplete, replace, and validate commands</li>\n<li>Integrating with opencode and Claude Code via MCP</li>\n<li>Adding new concepts to the knowledge graph</li>\n<li>Troubleshooting common issues</li>\n</ul>\n<p>Read it at: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/blob/main/docs/walkthroughs/frontend-developer-agent.md\"><code>docs/walkthroughs/frontend-developer-agent.md</code></a></p>\n<h2 id=\"what-comes-next\">What Comes Next</h2>\n<p>This walkthrough demonstrates the deterministic, KG-first approach. Natural extensions include:</p>\n<ul>\n<li><strong>LLM chat</strong>: Enable Ollama for conversational search over the same knowledge graph</li>\n<li><strong>MCP server</strong>: Expose the agent to Claude Code or Cursor via Model Context Protocol</li>\n<li><strong>More roles</strong>: The same pattern applies to any domain -- React specialists, DevOps engineers, data scientists</li>\n</ul>\n<p>The knowledge graph pattern is universal: define concepts, add synonyms, point at haystacks, and search. No training, no fine-tuning, no API keys for the deterministic path.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-23",
"https://atomicdata.dev/properties/tags" : {"tags":["Terraphim","walkthrough","knowledge-graph","grepapp","Svelte","SvelteKit","TypeScript","agent"],"categories":["Technical"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/why-graph-embeddings-matter/",
"https://atomicdata.dev/properties/name" : "Why Graph Embeddings Matter",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/why-graph-embeddings-matter/",
"https://atomicdata.dev/properties/description" : "<p>Vector databases are probabilistic and slow. Graph embeddings are deterministic and sub-millisecond. If you are building context for an AI coding agent — or any system where you need to know <em>why</em> a result came back — the difference is not academic. It changes what your application is allowed to promise.</p>\n<span id=\"continue-reading\"></span><h2 id=\"the-pitch-in-one-paragraph\">The Pitch in One Paragraph</h2>\n<p>Terraphim represents concepts as nodes in a knowledge graph and ranks them by how many synonyms and edges connect them. There is no embedding model, no GPU, no per-query distance computation in a 1024-dimensional space. There is an <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://en.wikipedia.org/wiki/Aho%E2%80%93Corasick_algorithm\">Aho-Corasick</a> automaton built once, queried in O(n+m+z) time over the input length plus the number of matches. The mechanism is described in detail on the <a href=\"/docs/graph-embeddings/\">Graph Embeddings reference</a> page; this post is about why it matters.</p>\n<h2 id=\"the-numbers\">The Numbers</h2>\n<p>Three numbers carry the argument. Each is reproducible on a laptop.</p>\n<ul>\n<li><strong>1.4 million patterns matched in under one millisecond, with under 4 GB of RAM.</strong> That is the working set behind a multi-role knowledge graph — operator, engineer, analyst — held resident in the same process that serves the query.</li>\n<li><strong>5–10 nanoseconds per knowledge-graph inference step.</strong> Not microseconds. Nanoseconds. Once the automaton is built, traversal is a tight loop over byte slices and graph edges, and modern CPUs are extremely good at that.</li>\n<li><strong>20 milliseconds to rebuild the embeddings for a role from scratch.</strong> Rename a synonym, add a new term, drop an obsolete one — the whole role's graph is reconstituted before your editor has rendered the next frame.</li>\n</ul>\n<p>For comparison, a typical vector-DB nearest-neighbour query lands in the 5–50 ms range <em>after</em> you have paid the embedding API call (50–500 ms) and the network round-trip. We are not in the same regime.</p>\n<h2 id=\"three-consequences\">Three Consequences</h2>\n<p>The numbers are interesting on their own. The reason they matter is what they let you build.</p>\n<h3 id=\"1-full-explainability\">1. Full Explainability</h3>\n<p>Every match in Terraphim traces back to a specific edge in the knowledge graph and a specific synonym in a specific role. There is no \"the model said so.\" When a search returns a document, you can show the user exactly which terms matched, which role's graph supplied the synonym, and which edges connected them. That is not a debugging nicety — it is a regulatory requirement in any domain where you have to defend a decision after the fact. Healthcare, legal, finance, government. Vector search by construction cannot do this.</p>\n<h3 id=\"2-no-training-no-retraining-no-fine-tuning\">2. No Training, No Retraining, No Fine-Tuning</h3>\n<p>Adding a new concept is a text edit. You write the synonym down, you point Terraphim at the file, the graph rebuilds in 20 ms. There is no training run, no GPU bill, no \"we need to schedule a retrain on the new corpus.\" This collapses the loop between <em>noticing a gap</em> and <em>fixing the gap</em> from days or weeks to seconds. For an AI coding agent that needs to learn a project's vocabulary as you onboard, this is the difference between a working tool and a stalled rollout.</p>\n<h3 id=\"3-language-agnostic-without-language-detection\">3. Language-Agnostic Without Language Detection</h3>\n<p>Because matching is done on normalised terms — synonyms you supply explicitly — the same node in the graph can carry English, French, Russian, and Mandarin labels at no extra cost. There is no language-detection step, no per-language embedding model, no separate index. The query \"consensus\" and the query \"консенсус\" both reach the same node if you have told the graph they are synonyms. Stop-word lists become irrelevant: if a word is not in the graph, it does not match, full stop.</p>\n<h2 id=\"what-this-lets-you-do\">What This Lets You Do</h2>\n<p>The pieces above are infrastructure. The story arc continues:</p>\n<ul>\n<li><strong>Build hooks that transform AI coding agent output deterministically.</strong> When Claude Code suggests <code>npm install</code>, intercept it via a graph-embeddings match and replace it with <code>bun install</code>. We wrote this up at <a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Teaching AI Coding Agents with Knowledge Graph Hooks</a> — that post is the <em>demo</em> of what this engine enables.</li>\n<li><strong>Capture and reuse mistakes.</strong> When an agent gets corrected, store the correction as a new synonym and the next session never repeats it. See <a href=\"/posts/teaching-ai-agents-to-learn-from-mistakes/\">Teaching AI Agents to Learn from Their Mistakes</a> and <a href=\"/posts/learning-via-negativa/\">Learning via Negativa</a>.</li>\n<li><strong>Run the whole thing in a 4 GB process on your laptop with no network calls.</strong> The compactness is not an accident — it is the engineering brief from the <a href=\"/capabilities/origin-story/\">Origin Story</a>, which explains where the design came from and why it has stayed this small.</li>\n</ul>\n<h2 id=\"how-to-apply\">How to Apply</h2>\n<p>If you want to wire this into your own project, the <a href=\"/how-tos/command-rewriting-howto/\">Command Rewriting How-to</a> walks through the moving parts: where to put your synonyms, how the role graph is built, how hooks call the matcher.</p>\n<p>The mechanism — automata, ranking formula, ASCII walk-through — is on the <a href=\"/docs/graph-embeddings/\">Graph Embeddings reference</a> page. Read that next if you want the data structures.</p>\n<h2 id=\"why-bother-saying-this-out-loud\">Why Bother Saying This Out Loud</h2>\n<p>The current default in the AI tooling ecosystem is to reach for a vector database the moment anyone mentions \"semantic search.\" It is the path of least resistance because the tools are well-marketed and the API surface is familiar. But for a large class of problems — explainability-first systems, on-device agents, anywhere you need a hard latency budget or a hard explainability guarantee — graph embeddings are the better-engineered answer. Not the only answer; the better one for that class.</p>\n<p>The promotion campaign over the next few weeks goes deeper: a <a href=\"/posts/sub-millisecond-context-knowledge-graphs/\">sub-millisecond context article</a> walks through the FST/Aho-Corasick implementation, and the <em>Context Engineering with Knowledge Graphs</em> book (launching in May) puts it in the wider context of moving from RAG to context graphs.</p>\n<p>Until then: read the reference, try the how-to, and let us know in <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.discourse.group\">Discourse</a> what you build with it.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-22",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","graph-embeddings","knowledge-graph","aho-corasick","context-engineering"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/terraphim-search-in-claude-code-and-opencode/",
"https://atomicdata.dev/properties/name" : "Plug Terraphim Search into Claude Code and opencode (CLI First, MCP When You Need It)",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/terraphim-search-in-claude-code-and-opencode/",
"https://atomicdata.dev/properties/description" : "<p>Your AI coding agent already has a knowledge graph. It is just not yours yet. The model knows GitHub, Stack Overflow, and the public training corpus -- it has no idea that in your project <code>npm</code> should be <code>bun</code>, that <code>RFP</code> is shorthand for <code>acquisition need</code>, or that the email about the Stripe receipt for the Obsidian licence lives in your Fastmail mailbox. This post shows the smallest path to fixing that for both <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://claude.com/claude-code\">Claude Code</a> and <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://opencode.ai\">opencode</a>, using <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.ai\">Terraphim</a> and the three roles we have published over the last week (Terraphim Engineer, <a href=\"/posts/personal-assistant-role-jmap-obsidian/\">Personal Assistant</a>, <a href=\"/posts/system-operator-logseq-knowledge-graph/\">System Operator</a>).</p>\n<p>Two paths. CLI first.</p>\n<span id=\"continue-reading\"></span><h2 id=\"what-integrate-means-here\">What \"integrate\" means here</h2>\n<p>The host -- Claude Code or opencode -- needs a way to ask your role-aware Terraphim setup a question and get back ranked, source-attributed results. The model decides when to ask. The role decides which haystacks to search. Terraphim's <code>terraphim-graph</code> ranker decides which results come back first.</p>\n<p>Concrete example. You are working in opencode and you type:</p>\n<pre><code>/tsearch &quot;System Operator&quot; RFP\n</code></pre>\n<p>The slash command runs against the System Operator role. The role's knowledge graph normalises <code>RFP</code> to its INCOSE-canonical form <code>acquisition need</code>. The Aho-Corasick matcher walks the role's haystack (1,347 Logseq pages from the <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/system-operator\">terraphim/system-operator</a> repository). The top hit comes back ranked 13 -- <code>Acquisition need.md</code> -- with the <code>synonyms::</code> line that mapped your query to it visible in the snippet. The model now has the right page in its context window and can answer your follow-up without a hallucinated INCOSE handbook reference.</p>\n<p>This works in both hosts because both speak the same two integration languages: shell-out slash commands and MCP servers. We are going to use both.</p>\n<h2 id=\"path-a-cli-via-slash-command\">Path A -- CLI via slash command</h2>\n<h3 id=\"why-this-is-the-recommended-starting-point\">Why this is the recommended starting point</h3>\n<p><code>terraphim-agent</code> already exists, takes <code>--role</code> and <code>--limit</code>, and writes ranked results to stdout. There is nothing to build. Both Claude Code and opencode let slash commands shell out via Bash. So a two-line command file is the entire integration.</p>\n<h3 id=\"one-file-two-hosts\">One file, two hosts</h3>\n<p>Drop this at <code>~/.claude/commands/tsearch.md</code> (and an identical copy at <code>~/.config/opencode/command/tsearch.md</code> -- both hosts read the same frontmatter shape):</p>\n<pre><code data-lang=\"markdown\">---\ndescription: Terraphim search across configured roles. Usage: /tsearch [role] &lt;query&gt;\nallowed-tools: Bash(terraphim-agent search:*), Bash(terraphim-agent-pa search:*)\n---\nRun `terraphim-agent search --role &quot;&lt;role&gt;&quot; --limit 5 &quot;&lt;query&gt;&quot;` (or\n`terraphim-agent-pa search ...` if the role is &quot;Personal Assistant&quot; and\nthe query needs the JMAP haystack). Return the top results as a numbered\nlist with title, source path/URL, and a 120-char snippet.\n</code></pre>\n<p>That is it. The <code>allowed-tools</code> line auto-approves the two CLI invocations so the model does not have to ask permission per call. Restart the host (or reload commands) and <code>/tsearch</code> is live.</p>\n<h3 id=\"why-fast-enough\">Why fast enough</h3>\n<p><code>terraphim-agent</code> reads its persisted role state at start (low milliseconds), runs the query against the role's haystacks, and returns. For a typical knowledge-graph query against the Terraphim Engineer role on a laptop, the round trip from slash command to formatted output is well under a second. The agent already has the typed CLI -- <code>--role</code>, <code>--limit</code>, <code>--format json</code> -- so there is nothing the MCP layer adds for the search-only flow.</p>\n<h3 id=\"three-example-queries\">Three example queries</h3>\n<pre><code>/tsearch &quot;Terraphim Engineer&quot; rolegraph\n/tsearch &quot;System Operator&quot; RFP\n/tsearch &quot;Personal Assistant&quot; invoice    # uses terraphim-agent-pa wrapper for JMAP\n</code></pre>\n<p>The Personal Assistant case is the most interesting because it crosses surfaces -- Obsidian notes interleave with <code>jmap:///email/&lt;id&gt;</code> URLs from your Fastmail mailbox, ranked by the same <code>terraphim-graph</code> scoring. The wrapper script injects <code>JMAP_ACCESS_TOKEN</code> from 1Password at call time so the secret never lands on disk; the bare <code>terraphim-agent</code> continues to work for the other five roles without paying the unlock cost.</p>\n<h2 id=\"path-b-mcp-server-when-you-want-typed-tools\">Path B -- MCP server (when you want typed tools)</h2>\n<p>The CLI path is enough for search. If you want the model to call <code>search</code> as a first-class tool with structured JSON parameters -- alongside <code>autocomplete_terms</code>, <code>autocomplete_with_snippets</code>, four flavours of fuzzy autocomplete, <code>build_autocomplete_index</code>, and <code>update_config_tool</code> -- that is what <code>terraphim_mcp_server</code> exposes. It reads the same <code>~/.config/terraphim/embedded_config.json</code>, so the role list is identical.</p>\n<h3 id=\"build-and-install\">Build and install</h3>\n<pre><code data-lang=\"bash\">cd ~/projects/terraphim/terraphim-ai\ncargo build --release -p terraphim_mcp_server --features jmap\ncp target/release/terraphim_mcp_server ~/.cargo/bin/terraphim_mcp_server\n</code></pre>\n<p>For the Personal Assistant role, mirror the existing <code>terraphim-agent-pa</code> wrapper at <code>~/bin/terraphim_mcp_server-pa</code> so the JMAP token flows through <code>op run</code> instead of being baked into config.</p>\n<h3 id=\"register\">Register</h3>\n<p>opencode -- add to <code>~/.config/opencode/opencode.json</code> under <code>mcp</code>:</p>\n<pre><code data-lang=\"jsonc\">&quot;terraphim&quot;:    { &quot;type&quot;: &quot;local&quot;, &quot;command&quot;: [&quot;/Users/alex/.cargo/bin/terraphim_mcp_server&quot;] },\n&quot;terraphim-pa&quot;: { &quot;type&quot;: &quot;local&quot;, &quot;command&quot;: [&quot;/Users/alex/bin/terraphim_mcp_server-pa&quot;] }\n</code></pre>\n<p>Claude Code -- one shell command per server:</p>\n<pre><code data-lang=\"bash\">claude mcp add terraphim    /Users/alex/.cargo/bin/terraphim_mcp_server\nclaude mcp add terraphim-pa /Users/alex/bin/terraphim_mcp_server-pa\nclaude mcp list      # both should show as Connected\n</code></pre>\n<p>The model now sees <code>mcp__terraphim__search</code> and <code>mcp__terraphim_pa__search</code> (plus the autocomplete tools) in its tool list.</p>\n<h2 id=\"sessionstart-primer-both-paths\">SessionStart primer (both paths)</h2>\n<p>Slash commands and MCP tools are useless if the model does not know the roles exist. Extend the SessionStart hook in <code>~/.claude/settings.json</code> to print a one-screen role index when each session starts:</p>\n<pre><code data-lang=\"bash\">printf &#39;\\n--- Terraphim search via /tsearch [role] &lt;query&gt; ---\\n&#39;\nprintf &#39;  Terraphim Engineer  (Rust/agent KG)\\n&#39;\nprintf &#39;  Personal Assistant  (Obsidian + Fastmail JMAP, use terraphim-agent-pa for email)\\n&#39;\nprintf &#39;  System Operator     (INCOSE/MBSE Logseq KG)\\n&#39;\nprintf &#39;  Context Engineering Author, Rust Engineer, Default\\n&#39;\n</code></pre>\n<p>Equivalent hook in opencode. Cost: one screen of context per session. Benefit: the model picks the right role on the first try instead of guessing.</p>\n<h2 id=\"when-to-pick-which-path\">When to pick which path</h2>\n<table><thead><tr><th></th><th>CLI (Path A)</th><th>MCP (Path B)</th></tr></thead><tbody>\n<tr><td>New binaries</td><td>None</td><td><code>terraphim_mcp_server</code> plus wrapper</td></tr>\n<tr><td>Cold start</td><td>~50-200 ms per call</td><td>~10-50 ms per call (long-lived process)</td></tr>\n<tr><td>Tools exposed</td><td><code>search</code> only</td><td><code>search</code> + 4 autocomplete + <code>build_autocomplete_index</code> + <code>update_config_tool</code></td></tr>\n<tr><td>Works in any host</td><td>Yes -- anything that runs a slash command</td><td>Only hosts that speak MCP</td></tr>\n<tr><td>Token handling</td><td><code>terraphim-agent-pa</code> wrapper</td><td><code>terraphim_mcp_server-pa</code> wrapper</td></tr>\n</tbody></table>\n<p>For the search-across-roles flow, CLI is enough. Add MCP when the model needs autocomplete-as-you-type, when you want it to manage role configuration without leaving the conversation, or when you are using a host where the typed-tool surface matters more than the cold-start cost.</p>\n<p>You do not have to choose. Wire both. The slash command above defaults to CLI and falls back to MCP if the binary is missing -- the two paths coexist cleanly because they read the same role config.</p>\n<h2 id=\"why-this-is-the-right-shape\">Why this is the right shape</h2>\n<p>Most \"AI assistant + knowledge base\" integrations end up tightly coupled to a specific host. Vendor X's plugin marketplace, Vendor Y's tool format. Terraphim takes the opposite stance: the role configuration lives in your filesystem, the haystacks live in your filesystem (or your mailbox), the ranker runs in a process you own, and the integration with the AI host is the thinnest possible shim -- a slash command or an MCP server, both of which are commodity surfaces.</p>\n<p>Yesterday the Personal Assistant role was a private setup on one laptop. Today it is callable from inside two different AI coding hosts via a one-file slash command. Tomorrow you can add Cursor or Aider with the same two-line wrapper because the integration surface is <code>terraphim-agent search</code>, not <code>vendor-specific-tool-protocol-v3</code>.</p>\n<p>The expensive part of context engineering is not the ranker. It is the vocabulary in the knowledge graph and the haystacks the role can reach. The integration layer should not be allowed to compete for that budget. CLI-first keeps it small.</p>\n<h2 id=\"try-it\">Try it</h2>\n<pre><code data-lang=\"bash\"># Build (or install the published crate when JMAP feature lands on crates.io)\ncd ~/projects/terraphim/terraphim-ai\ncargo install --path crates/terraphim_agent --features jmap\n\n# Configure roles -- copy the snippets from the how-tos linked below\n$EDITOR ~/.config/terraphim/embedded_config.json\n\n# Install the slash command\nmkdir -p ~/.claude/commands ~/.config/opencode/command\ncp ~/projects/terraphim/terraphim-ai/docs/src/howto/mcp-integration-claude-opencode.md \\\n   /tmp/tsearch.md  # adapt to your slash command file shape\n\n# Reload roles\nterraphim-agent config reload\n\n# Try\nterraphim-agent search --role &quot;Terraphim Engineer&quot; --limit 3 &quot;rolegraph&quot;\n</code></pre>\n<p>Step-by-step in the docs: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://docs.terraphim.ai/howto/mcp-integration-claude-opencode.html\">Plug Terraphim Search into Claude Code and opencode</a>.</p>\n<p>For the underlying engine, start with <a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a>. For the two roles this integration most cleanly exposes, see <a href=\"/posts/personal-assistant-role-jmap-obsidian/\">Personal Assistant</a> and <a href=\"/posts/system-operator-logseq-knowledge-graph/\">System Operator</a>.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-18",
"https://atomicdata.dev/properties/tags" : {"tags":["Terraphim","claude-code","opencode","mcp","knowledge-graph","developer-tools","integration"],"categories":["Technical"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/disciplined-engineering-ai-systems/",
"https://atomicdata.dev/properties/name" : "Disciplined Engineering: How We Build AI Systems That Actually Work",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/disciplined-engineering-ai-systems/",
"https://atomicdata.dev/properties/description" : "<p>AI coding agents are making us worse engineers, unless we add discipline back. Here is what we do instead of vibe coding, and how you can do it too in 30 seconds.</p>\n<span id=\"continue-reading\"></span><h2 id=\"the-vibe-coding-problem\">The Vibe Coding Problem</h2>\n<p>Every AI-generated pull request we review has the same pattern:</p>\n<ul>\n<li><strong>Scope creep</strong> beyond the original task. You ask for a bug fix, you get a refactored module.</li>\n<li><strong>No traceability</strong> from requirements to tests. The agent shipped code, but nobody verified it does what was actually asked.</li>\n<li><strong>Knowledge lost between sessions.</strong> Each conversation starts from scratch. Yesterday's design decisions evaporate overnight.</li>\n</ul>\n<p>The agent shipped code. It even passed the tests. But the tests were written by the same agent that wrote the code, optimising for the metric rather than understanding the problem.</p>\n<p>The missing piece is not better models. It is engineering discipline. AI agents need the same rigour humans use: understand the problem before coding, verify against the design, validate against requirements. We encoded this as executable skills that any AI coding agent can follow.</p>\n<blockquote>\n<p><em>The research evidence behind this framework, including language-specific scaling laws and the 30% adoption gap between code intelligence research and production harnesses, is the subject of <a href=\"/posts/the-30-percent-problem-code-intelligence/\">our next article</a>.</em></p>\n</blockquote>\n<h2 id=\"the-v-model-adding-discipline-back\">The V-Model: Adding Discipline Back</h2>\n<p>We built a V-model for AI agents. The left side asks \"what should we build?\" The right side asks \"did we build it correctly?\"</p>\n<p><img src=\"/images/v-model-overview.png\" alt=\"The V-Model for AI Agents: Research, Design, Specification, Implementation, Verification, Validation with quality gates at each transition\" /></p>\n<h3 id=\"phase-1-research\">Phase 1: Research</h3>\n<p>Before writing any code, the agent must understand the problem space:</p>\n<ul>\n<li>Search existing knowledge graphs for relevant patterns</li>\n<li>Identify language-specific constraints</li>\n<li>Determine optimal context window for target language</li>\n<li>Find similar implementations in the codebase</li>\n</ul>\n<h3 id=\"phase-2-design\">Phase 2: Design</h3>\n<p>Create a specification before implementation:</p>\n<ul>\n<li>Define interfaces and data structures</li>\n<li>Identify cross-language considerations</li>\n<li>Plan for compiler feedback integration</li>\n<li>Document the \"why\" not just the \"what\"</li>\n</ul>\n<h3 id=\"phase-3-implementation\">Phase 3: Implementation</h3>\n<p>Write code with tests from the start:</p>\n<ul>\n<li>Language-appropriate context management</li>\n<li>Compiler feedback integration points</li>\n<li>Type-safe by default (for typed languages)</li>\n<li>Self-documenting through clear structure</li>\n</ul>\n<h3 id=\"phase-4-verification\">Phase 4: Verification</h3>\n<p>Verify against the design, not just tests:</p>\n<ul>\n<li>Type-checking passes</li>\n<li>Linting passes</li>\n<li>Compiler warnings addressed</li>\n<li>Design intent preserved</li>\n</ul>\n<h3 id=\"phase-5-validation\">Phase 5: Validation</h3>\n<p>Validate against the original requirements:</p>\n<ul>\n<li>Does it solve the actual problem?</li>\n<li>Are there simpler alternatives?</li>\n<li>What's the maintenance burden?</li>\n</ul>\n<h2 id=\"terraphim-skills-32-executable-disciplines\">terraphim-skills: 32+ Executable Disciplines</h2>\n<p>We packaged the V-model as executable skills you can add to any AI agent:</p>\n<pre><code data-lang=\"bash\">npx skills add terraphim/terraphim-skills\n</code></pre>\n<p>This installs skills that enforce:</p>\n<ul>\n<li><strong>disciplined-research</strong>: Understand before building</li>\n<li><strong>disciplined-design</strong>: Plan before coding</li>\n<li><strong>disciplined-implementation</strong>: Build with tests</li>\n<li><strong>disciplined-verification</strong>: Verify against design</li>\n<li><strong>disciplined-validation</strong>: Validate against requirements</li>\n</ul>\n<p>Each skill is a self-contained prompt that guides the agent through the phase's inputs, outputs, and quality gates.</p>\n<h2 id=\"quality-gates-the-judge-system\">Quality Gates: The Judge System</h2>\n<p>Our judge system (Kimi K2.5) catches what humans miss:</p>\n<ul>\n<li>90% verdict agreement with human reviewers</li>\n<li>62.5% NO-GO detection rate on genuinely flawed code</li>\n<li>~9s average review latency</li>\n</ul>\n<p>Automated quality gates, zero manual overhead. Every PR reviewed before it merges.</p>\n<h2 id=\"guard-rails-in-practice\">Guard Rails in Practice</h2>\n<p>AI agents do not type commands into a terminal. They invoke tools programmatically, and they do not always get it right. \"Cleaning up build artefacts\" becomes <code>rm -rf ./src</code> (one-character typo). \"Resetting to last commit\" becomes <code>git reset --hard</code> (uncommitted work gone). You need a safety net that operates between the agent and your shell.</p>\n<p>We use two layers of guard rails:</p>\n<p><strong>Layer 1: git-safety-guard</strong> (a terraphim-skill that runs as a PreToolUse hook):</p>\n<ul>\n<li>Blocks <code>git reset --hard</code>, <code>git push --force</code>, <code>rm -rf</code> and similar destructive commands before they execute</li>\n<li>Checks for secrets in diffs before commits</li>\n<li>Validates commit message format</li>\n<li>Zero configuration: install the skill, protection is immediate</li>\n</ul>\n<p><strong>Layer 2: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/Dicklesworthstone/destructive_command_guard\">Destructive Command Guard (DCG)</a></strong> by Jeff Emanuel, integrated via tool hooks:</p>\n<ul>\n<li>A Rust binary using SIMD-accelerated pattern matching</li>\n<li>Intercepts every shell command the agent attempts to run</li>\n<li>Returns allow/block verdicts in under 1ms</li>\n<li>Works with Claude Code, OpenCode, and any agent that exposes a pre-execution hook</li>\n</ul>\n<p>The architecture is simple: the agent calls a bash tool, the hook pipes the command to DCG as JSON, DCG pattern-matches against known destructive commands, and blocks execution before damage occurs. The agent receives an error explaining why, and can adjust its approach.</p>\n<h2 id=\"real-example-ai-dark-factory\">Real Example: AI Dark Factory</h2>\n<p>We run 12+ AI agents overnight on a single machine, coordinated by a Rust orchestrator. Each agent follows the V-model:</p>\n<ul>\n<li><strong>Safety agents</strong> run continuously with automatic restart and cooldown. They handle monitoring, log analysis, and drift detection. If one crashes, the orchestrator waits 15 minutes before restarting (up to 3 times) to prevent crash loops.</li>\n<li><strong>Core agents</strong> are scheduled via cron. They pick the highest-priority unblocked issue from the Gitea board (ranked by PageRank across the dependency graph), claim it, branch, implement with tests, and open a pull request.</li>\n<li><strong>Growth agents</strong> run on demand for research, code review, and content generation.</li>\n</ul>\n<p>Every agent's output passes through the judge system before merge. The morning routine is reviewing verdicts, not debugging overnight chaos. When an agent produces a NO-GO verdict, the PR is flagged with the specific issues: missing test coverage, undocumented API changes, or security concerns.</p>\n<p>This is disciplined engineering at scale: not process overhead, but automated quality gates that catch problems before they compound.</p>\n<h2 id=\"conclusion\">Conclusion</h2>\n<p>The gap between what AI agents can do and what they should do is real. It is not a technology gap: it is a discipline gap. The V-model and 32+ executable skills we built are available today:</p>\n<pre><code data-lang=\"bash\">npx skills add terraphim/terraphim-skills\n</code></pre>\n<p>Add discipline back. Your future self will thank you.</p>\n<hr />\n<p><em>Deeper dive: The V-model and quality gates we use are detailed in Chapters 3-4 of \"Context Engineering with Knowledge Graphs\". Coming soon.</em></p>\n<p><strong>Related posts:</strong></p>\n<ul>\n<li><a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Teaching AI Coding Agents with Knowledge Graph Hooks</a></li>\n<li><a href=\"/posts/teaching-ai-agents-to-learn-from-mistakes/\">Teaching AI Agents to Learn from Their Mistakes</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-17",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","ai","engineering","v-model","developer-tools","coding-agents","discipline"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/guarding-opencode-with-dcg/",
"https://atomicdata.dev/properties/name" : "Guarding OpenCode with Destructive Command Guard",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/guarding-opencode-with-dcg/",
"https://atomicdata.dev/properties/description" : "<p>AI coding assistants are fast, productive, and occasionally catastrophic. One misplaced <code>rm -rf</code>, one accidental <code>git reset --hard</code>, and hours of uncommitted work vanish.</p>\n<p>Jeffrey Emanuel (<a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/Dicklesworthstone\">@Dicklesworthstone</a>) built <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/Dicklesworthstone/destructive_command_guard\">Destructive Command Guard (dcg)</a>: a Rust binary with SIMD-accelerated pattern matching, 49+ security packs, and a fail-open design. It is one of the best tools to come out of the AI agent safety space, and it solved a problem we had been fighting with regex hacks.</p>\n<p>This post shows how we integrated dcg with <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://opencode.ai\">OpenCode</a> using its plugin hook system, so destructive commands are intercepted <em>before</em> they run.</p>\n<span id=\"continue-reading\"></span><h2 id=\"the-problem\">The Problem</h2>\n<p>AI agents do not type commands into a terminal. They invoke tools programmatically, and they do not always get it right:</p>\n<ul>\n<li>\"Cleaning up build artifacts\" becomes <code>rm -rf ./src</code> (one-character typo)</li>\n<li>\"Resetting to last commit\" becomes <code>git reset --hard</code> (uncommitted work gone)</li>\n<li>\"Force pushing the fix\" becomes <code>git push --force</code> (team history destroyed)</li>\n</ul>\n<p>You need a safety net that operates between the agent and your shell.</p>\n<h2 id=\"the-architecture\">The Architecture</h2>\n<p>OpenCode v1.4+ exposes a plugin hook system. Hooks are top-level keys on the <code>Hooks</code> interface:</p>\n<pre><code data-lang=\"ts\">interface Hooks {\n  &quot;tool.execute.before&quot;?: (input, output) =&gt; Promise&lt;void&gt;;\n  &quot;tool.execute.after&quot;?:  (input, output) =&gt; Promise&lt;void&gt;;\n}\n</code></pre>\n<p>The <code>\"tool.execute.before\"</code> hook fires before every tool call. It receives the tool name and arguments, and can throw an error to abort execution. This is exactly where a command guard belongs.</p>\n<p>DCG is Jeffrey's Rust binary that reads a JSON payload from stdin and exits <code>0</code> (allow) or <code>2</code> (block). Our contribution was the plugin that wires dcg into OpenCode's hook system:</p>\n<pre><code>OpenCode agent\n    |\n    | calls bash tool: &quot;rm -rf ./build&quot;\n    v\n&quot;tool.execute.before&quot; hook\n    |\n    | spawns: echo &#39;{&quot;tool&quot;:&quot;bash&quot;,&quot;args&quot;:{&quot;command&quot;:&quot;rm -rf ./build&quot;}}&#39; | dcg\n    v\ndcg (Rust, SIMD-accelerated pattern matching)\n    |\n    | exit code 2 + reason on stderr\n    v\nhook throws Error --&gt; command never executes\n</code></pre>\n<h2 id=\"the-plugin\">The Plugin</h2>\n<p>The complete plugin is roughly 60 lines:</p>\n<pre><code data-lang=\"javascript\">import { spawn } from &#39;child_process&#39;;\n\nconst callDcgHook = (toolCall) =&gt; {\n  return new Promise((resolve, reject) =&gt; {\n    const dcg = spawn(&#39;dcg&#39;, [], {\n      env: { ...process.env, DCG_FORMAT: &#39;json&#39; }\n    });\n\n    let stdout = &#39;&#39;;\n    let stderr = &#39;&#39;;\n\n    dcg.stdout.on(&#39;data&#39;, (data) =&gt; { stdout += data.toString(); });\n    dcg.stderr.on(&#39;data&#39;, (data) =&gt; { stderr += data.toString(); });\n\n    dcg.on(&#39;close&#39;, (code) =&gt; {\n      if (code === 0) {\n        try { resolve(JSON.parse(stdout)); }\n        catch { resolve({ allowed: true }); }\n      } else {\n        reject(new Error(stderr || &#39;dcg blocked command&#39;));\n      }\n    });\n\n    dcg.stdin.write(JSON.stringify(toolCall));\n    dcg.stdin.end();\n  });\n};\n\nexport const DcgGuard = async ({ client }) =&gt; {\n  return {\n    &quot;tool.execute.before&quot;: async (input, output) =&gt; {\n      if (input.tool !== &#39;bash&#39;) return;\n\n      const toolCall = {\n        tool: &#39;bash&#39;,\n        args: { command: output.args.command }\n      };\n\n      try {\n        await callDcgHook(toolCall);\n      } catch (error) {\n        throw new Error(\n          `dcg blocked destructive command: ${output.args.command}\\n\\n` +\n          `${error.message}\\n\\n` +\n          `This command was blocked to protect your system.`\n        );\n      }\n    }\n  };\n};\n</code></pre>\n<h2 id=\"what-got-fixed-a-subtle-api-mismatch\">What Got Fixed: A Subtle API Mismatch</h2>\n<p>The original plugin used a nested structure:</p>\n<pre><code data-lang=\"javascript\">return {\n  tool: {\n    execute: {\n      before: async (input, output) =&gt; { ... }\n    }\n  }\n};\n</code></pre>\n<p>This looks intuitive but is wrong. In OpenCode's plugin API, <code>tool</code> is reserved for registering <em>new tools</em> (each needing a <code>description</code>, <code>args</code>, and <code>execute</code> function). The hook <code>\"tool.execute.before\"</code> is a <strong>top-level dotted key</strong> on the <code>Hooks</code> object, not a nested path.</p>\n<p>The fix:</p>\n<pre><code data-lang=\"javascript\">return {\n  &quot;tool.execute.before&quot;: async (input, output) =&gt; { ... }\n};\n</code></pre>\n<p>This distinction matters. OpenCode iterates over <code>tool</code> entries expecting <code>ToolDefinition</code> objects. When it found <code>{ before: ... }</code> instead, it called <code>.execute(args, ctx)</code> on it, which was <code>undefined</code>. Hence the error: <code>def.execute is not a function</code>.</p>\n<h2 id=\"what-dcg-blocks\">What DCG Blocks</h2>\n<table><thead><tr><th>Category</th><th>Examples</th></tr></thead><tbody>\n<tr><td>Git history destruction</td><td><code>git reset --hard</code>, <code>git push --force</code>, <code>git branch -D</code></td></tr>\n<tr><td>Uncommitted work loss</td><td><code>git checkout — .</code>, <code>git restore file</code>, <code>git clean -f</code></td></tr>\n<tr><td>Stash destruction</td><td><code>git stash drop</code>, <code>git stash clear</code></td></tr>\n<tr><td>Filesystem damage</td><td><code>rm -rf</code> outside <code>/tmp</code></td></tr>\n<tr><td>Database operations</td><td><code>DROP TABLE</code>, <code>FLUSHALL</code> (via packs)</td></tr>\n<tr><td>Container destruction</td><td><code>docker system prune</code>, <code>docker-compose down --volumes</code></td></tr>\n<tr><td>Infrastructure</td><td><code>terraform destroy</code>, <code>kubectl delete namespace</code></td></tr>\n</tbody></table>\n<p>Safe operations pass through silently: <code>git status</code>, <code>git add</code>, <code>git commit</code>, <code>git push</code> (without <code>--force</code>), <code>git stash</code>, <code>git checkout -b</code>, and all non-destructive commands.</p>\n<h2 id=\"key-design-decisions\">Key Design Decisions</h2>\n<p><strong>Default-allow.</strong> Unrecognised commands pass through. DCG blocks only <em>known</em> dangerous patterns. This prevents false positives from blocking legitimate work.</p>\n<p><strong>Whitelist-first.</strong> Safe patterns (like <code>git checkout -b</code>) are checked <em>before</em> destructive patterns. Explicitly safe commands are never accidentally blocked.</p>\n<p><strong>Sub-millisecond latency.</strong> Jeffrey's implementation uses SIMD-accelerated substring search via Rust's <code>memchr</code> crate. Commands without \"git\" or \"rm\" bypass regex matching entirely. The guard adds no perceptible delay.</p>\n<p><strong>Fail-open.</strong> If dcg crashes or produces unexpected output, the plugin catches the error and defaults to allowing the command. A broken guard should never break your workflow. A safety system that slows down the developer will be disabled by the third day. A safety system that is invisible will run forever.</p>\n<h2 id=\"extending-with-packs\">Extending with Packs</h2>\n<p>DCG ships with a modular pack system. Enable additional protection categories in <code>~/.config/dcg/config.toml</code>:</p>\n<pre><code data-lang=\"toml\">[packs]\nenabled = [\n    &quot;database.postgresql&quot;,\n    &quot;containers.docker&quot;,\n    &quot;kubernetes&quot;,\n    &quot;cloud.aws&quot;,\n]\n</code></pre>\n<p>Or via environment variable:</p>\n<pre><code data-lang=\"bash\">export DCG_PACKS=&quot;containers.docker,kubernetes&quot;\n</code></pre>\n<h2 id=\"installation\">Installation</h2>\n<pre><code data-lang=\"bash\"># 1. Install dcg\ncurl -fsSL &quot;https://raw.githubusercontent.com/Dicklesworthstone/destructive_command_guard/master/install.sh&quot; | bash\n\n# 2. Install the plugin\nmkdir -p ~/.config/opencode/plugin\ncurl -fsSL https://raw.githubusercontent.com/jms830/opencode-dcg-plugin/main/plugin/dcg-guard.js \\\n  -o ~/.config/opencode/plugin/dcg-guard.js\n\n# 3. Restart OpenCode\n</code></pre>\n<h2 id=\"conclusion\">Conclusion</h2>\n<p>The OpenCode plugin API's <code>\"tool.execute.before\"</code> hook is a clean interception point for safety guards. Combined with Jeffrey Emanuel's dcg and its fast pattern matching, you get protection against destructive commands with zero workflow friction. The plugin is small, the guard is fast, and the safety net catches the mistakes that matter.</p>\n<p>Instructions are suggestions. Guards are guarantees.</p>\n<hr />\n<p><em>DCG is built by <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/Dicklesworthstone\">Jeffrey Emanuel</a> — see his <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/Dicklesworthstone/destructive_command_guard\">destructive_command_guard</a> repository and the broader <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/Dicklesworthstone/agentic_coding_flywheel_setup\">agentic coding flywheel</a> ecosystem for more agent safety tooling.</em></p>\n<p><em>This post is part of the <a href=\"/posts/disciplined-engineering-ai-systems/\">Disciplined Engineering</a> series. See also: <a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Teaching AI Agents with Knowledge Graph Hooks</a> and <a href=\"/posts/teaching-ai-agents-to-learn-from-mistakes/\">Teaching AI Agents to Learn from Their Mistakes</a>.</em></p>\n<p><em>Source: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://gist.github.com/bc7cc0f237bdb2a6fade347aba203acb\">DCG plugin gist</a> and <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/jms830/opencode-dcg-plugin\">opencode-dcg-plugin repository</a></em></p>\n",
"https://terraphim.ai/properties/date" : "2026-04-17",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","ai","opencode","developer-tools","rust","safety","coding-agents"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/personal-assistant-role-jmap-obsidian/",
"https://atomicdata.dev/properties/name" : "Personal Assistant Role: One Search Across Email and Notes",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/personal-assistant-role-jmap-obsidian/",
"https://atomicdata.dev/properties/description" : "<p>Most \"personal AI\" tools split your context across silos: one search box for email, another for notes, a third for your chat history. Terraphim treats every source as a haystack on the same role, so a single query crosses all of them. This post shows how to wire up the two most common personal sources -- email via JMAP and notes in an Obsidian vault -- under a new Personal Assistant role.</p>\n<span id=\"continue-reading\"></span><h2 id=\"why-a-unified-role-matters\">Why a unified role matters</h2>\n<p>The mental tax of personal search is not the typing. It is the <em>deciding</em>. \"Did I read that in an email or write it in a note?\" Each silo you skip is a context switch with no useful payload. Once Terraphim is the front door for both surfaces, the question collapses to \"where is the thing about X\" and the role's <code>terraphim-graph</code> ranking serves whichever source actually has the strongest signal.</p>\n<p>The Personal Assistant role uses two haystacks under one role:</p>\n<ul>\n<li><strong>Obsidian vault</strong> indexed by the <code>Ripgrep</code> service. Plain markdown, sub-millisecond local search, no daemon.</li>\n<li><strong>Fastmail mailbox</strong> indexed by the <code>Jmap</code> service (<a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://www.rfc-editor.org/rfc/rfc8620\">RFC 8620/8621</a>). One HTTPS round trip per query, server-side full-text against your real mailbox, results returned with <code>jmap:///email/&lt;id&gt;</code> URLs you can paste back into a mail client.</li>\n</ul>\n<p>Ranking is the same <code>terraphim-graph</code> scoring as every other Terraphim role: an Aho-Corasick automaton built from the Obsidian vault contributes synonyms specific to your project vocabulary, then both haystacks share the same rank ladder. Notes and email interleave by relevance, not by source.</p>\n<h2 id=\"what-you-get\">What you get</h2>\n<ul>\n<li>A single command: <code>terraphim-agent-pa search \"&lt;query&gt;\"</code> returns mixed hits ordered by rank.</li>\n<li>Determinism. Every match traces back to a concrete edge in your knowledge graph -- no opaque embedding score.</li>\n<li>Privacy. The Obsidian vault never leaves disk; the JMAP query goes directly to your mail provider with your token. No Terraphim cloud component sits in the path.</li>\n<li>Composition. The role is just a JSON entry in <code>~/.config/terraphim/embedded_config.json</code>. Add another haystack tomorrow (calendar, contacts, browser history) and the same query sweeps it too.</li>\n</ul>\n<p>A 4 GB process on your laptop holds the whole working set; queries return in single-digit milliseconds for the local side and a few hundred for the remote JMAP round trip.</p>\n<h2 id=\"wiring-sketch\">Wiring sketch</h2>\n<p>The role config is roughly thirty lines of JSON: two haystacks, one knowledge-graph pointer, no LLM. The Fastmail token is <em>not</em> in the config -- it is injected at runtime via <code>op run</code> from 1Password into the <code>JMAP_ACCESS_TOKEN</code> environment variable, so the secret never lands on disk:</p>\n<pre><code data-lang=\"bash\">exec op run --account my.1password.com \\\n  --env-file=&lt;(echo &#39;JMAP_ACCESS_TOKEN=op://VAULT/ITEM/credential&#39;) \\\n  -- /Users/alex/.cargo/bin/terraphim-agent &quot;$@&quot;\n</code></pre>\n<p>Wrap that in <code>~/bin/terraphim-agent-pa</code>, <code>chmod +x</code>, and the JMAP haystack lights up only for queries that ask for it. The other roles keep using the bare <code>terraphim-agent</code> and never pay for the 1Password unlock.</p>\n<h2 id=\"why-graph-embeddings-make-this-practical\">Why graph embeddings make this practical</h2>\n<p>The reason a unified role works at all -- not just for two haystacks but for any reasonable number -- is that Terraphim's graph-embeddings layer is sub-millisecond and deterministic. There is no per-query embedding API call to amortise across sources, no vector database to keep in sync, no opaque ranker that has to be retrained when you add a new haystack. The matching is byte-level Aho-Corasick traversal of an automaton built once at role-load time. We wrote up the engine in detail at <a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a>; this Personal Assistant role is one application of that engine.</p>\n<h2 id=\"try-it\">Try it</h2>\n<p>The end-to-end how-to is in the docs: install the prerequisites, add the JSON snippet, write the wrapper, run three verification queries.</p>\n<p><strong>Read the how-to: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://docs.terraphim.ai/howto/personal-assistant-role.html\">Personal Assistant Role on docs.terraphim.ai</a></strong></p>\n<p>One caveat worth surfacing up front: the published <code>terraphim-agent</code> on crates.io does not yet ship with the JMAP haystack (the <code>haystack_jmap</code> dependency is not published either). For email search you need to build from local source with <code>cargo build --release -p terraphim_agent --features jmap</code>. The how-to walks through the two Cargo.toml edits required.</p>\n<h2 id=\"what-is-next\">What is next</h2>\n<p>Personal Assistant is the smallest useful instance of \"Terraphim as the front door for everything I read.\" Calendar (CalDAV), contacts (CardDAV), browser bookmarks, RSS, and AI session logs are all natural follow-ups -- each is a single haystack entry on the same role. The pattern composes; the cost stays linear in haystacks, not quadratic in cross-source queries.</p>\n<p>If you want the underlying engine, start with <a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a>. If you want to wire knowledge-graph hooks into your AI coding agent on the same machine, <a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Teaching AI Coding Agents with Knowledge Graph Hooks</a> covers that side of the same engine.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-17",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","personal-assistant","jmap","obsidian","knowledge-graph","fastmail"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/system-operator-logseq-knowledge-graph/",
"https://atomicdata.dev/properties/name" : "System Operator Demo: A Logseq Knowledge Graph Drives Enterprise MBSE Search",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/system-operator-logseq-knowledge-graph/",
"https://atomicdata.dev/properties/description" : "<p>Terraphim's <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/system-operator\">System Operator role</a> is the demo we point people at when they want to see a real Logseq knowledge graph drive search. 1,347 Logseq pages, 52 of them carrying explicit <code>synonyms::</code> lines, covering Model-Based Systems Engineering vocabulary -- requirements, architecture, verification, validation, life cycle concepts. This post walks the demo end-to-end and shows the piece people miss: the KG is doing real work, not just re-ranking text matches.</p>\n<span id=\"continue-reading\"></span><h2 id=\"what-the-demo-is\">What the demo is</h2>\n<p>The <code>terraphim/system-operator</code> repository on GitHub is a <strong>Logseq vault</strong> -- flat folder of markdown files under <code>pages/</code>, one page per concept, with Logseq's bullet-tree syntax for structure and Terraphim-format <code>synonyms::</code> lines for the knowledge-graph layer. Two things make it a useful demo rather than a toy:</p>\n<ol>\n<li><strong>Real MBSE vocabulary.</strong> The synonyms are not invented; they track the INCOSE Systems Engineering Handbook v.4, the V-Model, and SEMP conventions. When you type <code>RFP</code>, the automaton normalises it to <code>acquisition need</code> because that is what the handbook calls it.</li>\n<li><strong>Real scale.</strong> 1,347 markdown files is enough to expose cold-start behaviour (~5-10 seconds to index on a laptop) without being so large it obscures the ranking signal.</li>\n</ol>\n<h2 id=\"run-it\">Run it</h2>\n<p>There is an automated setup script in the repo. As of today it clones to a durable path under <code>~/.config/terraphim/system_operator</code> instead of <code>/tmp</code>, so the vault survives a reboot:</p>\n<pre><code data-lang=\"bash\">./scripts/setup_system_operator.sh\n</code></pre>\n<p>Then either drive the role via the server --</p>\n<pre><code data-lang=\"bash\">cargo run --bin terraphim_server -- \\\n  --config terraphim_server/default/system_operator_config.json\ncurl &quot;http://127.0.0.1:8000/documents/search?q=RFP&amp;role=System%20Operator&amp;limit=5&quot;\n</code></pre>\n<p>-- or via the <code>terraphim-agent</code> CLI after adding the role entry to <code>~/.config/terraphim/embedded_config.json</code>:</p>\n<pre><code data-lang=\"bash\">terraphim-agent config reload\nterraphim-agent search --role &quot;System Operator&quot; --limit 5 &quot;RFP&quot;\n</code></pre>\n<p>The full config snippet and the <code>embedded_config.json</code> entry are in the <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/blob/main/terraphim_server/README_SYSTEM_OPERATOR.md\"><code>README_SYSTEM_OPERATOR.md</code></a>.</p>\n<h2 id=\"the-piece-people-miss\">The piece people miss</h2>\n<p>Search-over-notes tools usually describe ranking in terms of \"it uses a knowledge graph\". That sentence hides a lot. Is the graph actually consulted at query time? Is it just a post-hoc re-ranker on top of BM25? Does it expand synonyms? On what vocabulary?</p>\n<p>Terraphim exposes the answer directly. <code>validate --connectivity</code> prints which words in your query the automaton matched and what canonical terms they normalised to:</p>\n<pre><code>$ terraphim-agent validate --role &quot;System Operator&quot; --connectivity \\\n    &quot;RFP business analysis life cycle model business requirements documentation tree&quot;\n\nConnectivity Check for role &#39;System Operator&#39;:\n  Connected: false\n  Matched terms: [&quot;acquisition need&quot;, &quot;business or mission analysis&quot;,\n                  &quot;business requirements&quot;, &quot;documentation tree&quot;,\n                  &quot;life cycle concepts&quot;]\n</code></pre>\n<p>Five query fragments, five canonical matches. <code>RFP</code> collapsed to <code>acquisition need</code> (its synonym, from <code>Acquisition need.md</code> in the vault). <code>business analysis</code> collapsed to <code>business or mission analysis</code> (INCOSE terminology). <code>life cycle model</code> collapsed to <code>life cycle concepts</code>. None of this is text matching -- the word <code>RFP</code> does not appear in the canonical page body; it lives in the <code>synonyms::</code> line.</p>\n<p>Once a query is normalised, the ranker walks the graph. A document that mentions <code>acquisition need</code> directly outranks one that mentions it through three synonym hops, and both outrank a document that mentions none of the canonical terms at all. Ranks come back with concrete integer scores -- <code>[13]</code> on a top result, not an opaque 0.87 cosine.</p>\n<h2 id=\"how-it-compares-to-the-personal-assistant-role\">How it compares to the Personal Assistant role</h2>\n<p>We <a href=\"/posts/personal-assistant-role-jmap-obsidian/\">wrote up the Personal Assistant role yesterday</a>: a private per-user role that indexes a Fastmail mailbox plus an Obsidian vault. Same engine, same ranker, different haystacks. The knowledge graph there is a small <code>kg/</code> folder inside the user's vault with 14 synonym files covering personal vocabulary (<code>bun</code> with <code>npm</code>/<code>yarn</code>/<code>pnpm</code> synonyms, <code>odilo</code>, <code>invoice</code>, <code>meeting</code>).</p>\n<p>The two roles expose the same pattern at two scales:</p>\n<table><thead><tr><th></th><th>System Operator</th><th>Personal Assistant</th></tr></thead><tbody>\n<tr><td>KG size</td><td>52 synonym files, 1,300-concept vocabulary</td><td>14 synonym files, ~30-concept personal vocabulary</td></tr>\n<tr><td>Haystacks</td><td>1 (Logseq repo)</td><td>2 (Obsidian vault + Fastmail JMAP)</td></tr>\n<tr><td>Source</td><td>Public GitHub repo</td><td>Private user files and mailbox</td></tr>\n<tr><td>Audience</td><td>Demos, onboarding, public showcase</td><td>One user</td></tr>\n<tr><td>Lifetime</td><td>Frozen per release</td><td>Edited daily, rebuilt in 20 ms per edit</td></tr>\n</tbody></table>\n<p>Both use <code>terraphim-graph</code> ranking. Both build an Aho-Corasick automaton once at role-load time. Both run in a 4 GB process on a laptop with no cloud round-trip. The only interesting difference is the vocabulary, which is exactly the separation of concerns a knowledge-graph-first design is supposed to deliver.</p>\n<h2 id=\"why-this-matters-for-teams-evaluating-mbse-tooling\">Why this matters for teams evaluating MBSE tooling</h2>\n<p>If you are evaluating Terraphim for a systems engineering group, the System Operator role is the honest starting point. It runs on a laptop against a public vault; you can check that every synonym mapping traces back to a concrete page; you can diff the <code>pages/</code> folder against the INCOSE handbook and argue about terminology. When your team's own vocabulary diverges (every organisation's does), you clone the repo, edit <code>synonyms::</code> lines, and the graph rebuilds in 20 milliseconds without a retraining step.</p>\n<p>The expensive part of enterprise search is not the ranker. It is the vocabulary. A deterministic graph makes the vocabulary an asset you curate, not a black box you tune.</p>\n<h2 id=\"try-it\">Try it</h2>\n<pre><code data-lang=\"bash\">git clone https://github.com/terraphim/terraphim-ai\ncd terraphim-ai\n./scripts/setup_system_operator.sh\ncargo run --bin terraphim_server -- \\\n  --config terraphim_server/default/system_operator_config.json\n</code></pre>\n<p>Or cut to the CLI if you already have <code>terraphim-agent</code> installed -- the <code>embedded_config.json</code> snippet is in the <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/blob/main/terraphim_server/README_SYSTEM_OPERATOR.md\">README</a>.</p>\n<p>For the underlying engine, start with <a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a>. For the personal-productivity analogue, see the <a href=\"/posts/personal-assistant-role-jmap-obsidian/\">Personal Assistant role post</a>.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-17",
"https://atomicdata.dev/properties/tags" : {"tags":["Terraphim","system-operator","logseq","knowledge-graph","mbse","systems-engineering"],"categories":["Technical"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/teaching-ai-agents-to-learn-from-mistakes/",
"https://atomicdata.dev/properties/name" : "Teaching AI Agents to Learn from Their Mistakes",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/teaching-ai-agents-to-learn-from-mistakes/",
"https://atomicdata.dev/properties/description" : "<p>AI coding agents make the same mistakes over and over. We built a learning system that captures failures, stores corrections, and feeds them back into future sessions — turning every error into institutional memory.</p>\n<blockquote>\n<p>Builds on <a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a> — the deterministic engine that makes \"remember this correction forever\" cheap. Apply the pattern in your own project via the <a href=\"/how-tos/command-rewriting-howto/\">Command Rewriting How-to</a>.</p>\n</blockquote>\n<span id=\"continue-reading\"></span><h2 id=\"the-problem-groundhog-day-for-ai-agents\">The Problem: Groundhog Day for AI Agents</h2>\n<p>Every AI coding agent session starts from zero. Claude Code runs <code>npm install</code>, gets corrected, switches to <code>bun</code> — and tomorrow does it again. A force-push to main gets blocked, the agent learns why, then forgets by next session.</p>\n<p>In our previous post on <a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Knowledge Graph Hooks</a>, we showed how Aho-Corasick automata can intercept and transform agent commands in real time. But interception is reactive. What if the agent could <em>remember</em> its past failures and avoid repeating them?</p>\n<h2 id=\"the-solution-terraphim-learning-capture\">The Solution: Terraphim Learning Capture</h2>\n<p>Terraphim's learning system operates as a closed loop with three stages:</p>\n<pre><code>Failed Command --&gt; PostToolUse Hook --&gt; Learning Store --&gt; Query at Session Start\n</code></pre>\n<h3 id=\"stage-1-automatic-capture\">Stage 1: Automatic Capture</h3>\n<p>A PostToolUse hook fires after every tool execution in Claude Code. When a command exits with a non-zero status, the hook calls <code>terraphim-agent learn hook</code>, which persists the failure as a structured markdown file:</p>\n<pre><code data-lang=\"yaml\">---\nid: 6b99a8924fad4f2aaeadf5450e76730c\ncommand: npm install react\nexit_code: 127\nsource: Global\ncaptured_at: 2026-04-04T18:43:07+00:00\n---\n</code></pre>\n<p>The hook is fail-open — if <code>terraphim-agent</code> is unavailable, it exits silently. Development is never blocked.</p>\n<h3 id=\"stage-2-human-corrections\">Stage 2: Human Corrections</h3>\n<p>Raw failures are useful, but corrections make them actionable. Any developer (or the agent itself) can attach a correction:</p>\n<pre><code data-lang=\"bash\">$ terraphim-agent learn correct &lt;id&gt; --correction &quot;Use &#39;bun add react&#39; instead&quot;\n</code></pre>\n<p>The learning file is updated in place:</p>\n<pre><code data-lang=\"yaml\">---\nid: 6b99a8924fad4f2aaeadf5450e76730c\ncommand: npm install react\nexit_code: 127\ncorrection: &quot;Use &#39;bun add react&#39; instead&quot;\n---\n</code></pre>\n<h3 id=\"stage-3-query-before-repeating\">Stage 3: Query Before Repeating</h3>\n<p>At the start of a session, or when encountering a familiar error, the agent queries the learning store:</p>\n<pre><code data-lang=\"bash\">$ terraphim-agent learn query &quot;npm&quot;\n\nLearnings matching &#39;npm&#39;.\n  [G] [cmd] npm install lodash (exit: 1)\n     Correction: Use &#39;bun add lodash&#39; instead. Terraphim hooks enforce bun over npm.\n  [G] [cmd] npm install react (exit: 127)\n     Correction: Use bun instead of npm/yarn/pnpm. &#39;bun add react&#39; or &#39;bun install&#39;.\n</code></pre>\n<p>The agent sees past mistakes with their corrections before taking action. No more Groundhog Day.</p>\n<h2 id=\"how-the-hook-works\">How the Hook Works</h2>\n<p>The PostToolUse hook is a shell script that receives JSON from Claude Code on stdin after every Bash tool call:</p>\n<pre><code data-lang=\"bash\">#!/bin/bash\nset -euo pipefail\n\n# Find terraphim-agent binary\nAGENT=&quot;$HOME/.cargo/bin/terraphim-agent&quot;\n[ -x &quot;$AGENT&quot; ] || exit 0  # Fail-open\n\n# Read tool result from stdin\nINPUT=$(cat)\n\n# Capture failures as learnings\n$AGENT learn hook --format claude &lt;&lt;&lt; &quot;$INPUT&quot; 2&gt;/dev/null || true\n</code></pre>\n<p>The <code>learn hook</code> subcommand parses the Claude Code tool result format, extracts the command and exit code, and writes a learning file only when the exit code is non-zero.</p>\n<h3 id=\"what-gets-captured\">What Gets Captured</h3>\n<table><thead><tr><th>Signal</th><th>Captured?</th><th>Example</th></tr></thead><tbody>\n<tr><td>Non-zero exit code</td><td>Yes</td><td><code>npm install</code> (exit 127)</td></tr>\n<tr><td>Force push attempts</td><td>Yes</td><td><code>git push --force origin main</code> (exit 1)</td></tr>\n<tr><td>Compilation errors</td><td>Yes</td><td><code>cargo build</code> (exit 101)</td></tr>\n<tr><td>Successful commands</td><td>No</td><td>Only failures are stored</td></tr>\n<tr><td>Duplicate failures</td><td>Deduplicated</td><td>Same command + exit code within a session</td></tr>\n</tbody></table>\n<h2 id=\"real-evidence-what-our-agents-have-learnt\">Real Evidence: What Our Agents Have Learnt</h2>\n<p>Here is actual data from our production learning store, accumulated across weeks of development sessions:</p>\n<p><strong>Package manager enforcement</strong> — 3 npm entries, all corrected to bun:</p>\n<pre><code>[G] [cmd] npm install lodash (exit: 1)\n   Correction: Use &#39;bun add lodash&#39; instead.\n[G] [cmd] npm install react (exit: 127)\n   Correction: Use bun instead of npm/yarn/pnpm.\n</code></pre>\n<p><strong>Git safety</strong> — 29 push failures captured, including one critical correction:</p>\n<pre><code>[G] [cmd] git push --force origin main (exit: 1)\n   Correction: NEVER force push to main. Use feature branches and PRs.\n</code></pre>\n<p>These corrections are not just documentation. They are queryable institutional memory that agents consult before repeating the same mistakes.</p>\n<h2 id=\"integration-with-knowledge-graph-hooks\">Integration with Knowledge Graph Hooks</h2>\n<p>The learning system complements the <a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Knowledge Graph Hooks</a> we described previously. Together they form two layers of defence:</p>\n<table><thead><tr><th>Layer</th><th>Mechanism</th><th>Timing</th></tr></thead><tbody>\n<tr><td><strong>Prevention</strong></td><td>KG hooks intercept <code>npm install</code> and replace with <code>bun install</code></td><td>Before execution</td></tr>\n<tr><td><strong>Learning</strong></td><td>PostToolUse hook captures failures that slip through</td><td>After execution</td></tr>\n</tbody></table>\n<p>If a new pattern appears that the knowledge graph does not cover, the learning system captures it. A developer adds a correction. Optionally, the pattern gets promoted to a knowledge graph entry for permanent interception.</p>\n<pre><code>New failure captured --&gt; Correction added --&gt; Pattern promoted to KG --&gt; Hook intercepts automatically\n</code></pre>\n<h2 id=\"the-learning-store\">The Learning Store</h2>\n<p>Learning files are stored as markdown in <code>~/Library/Application Support/terraphim/learnings/</code> (macOS) or <code>~/.local/share/terraphim/learnings/</code> (Linux). Each file is a standalone document:</p>\n<pre><code data-lang=\"markdown\">---\nid: 1c8a4548-1434-a346-cd641131202a\ncommand: git push --force origin main\nexit_code: 1\nsource: Global\ncaptured_at: 2026-04-02T15:31:20+00:00\ncorrection: NEVER force push to main. Use feature branches and PRs.\n---\n\n## Command\n\n`git push --force origin main`\n\n## Error Output\n\n</code></pre>\n<p>rejected</p>\n<pre><code>\n## Suggested Correction\n\n`NEVER force push to main. Use feature branches and PRs.`\n</code></pre>\n<p>Being plain markdown, these files are:</p>\n<ul>\n<li><strong>Human-readable</strong> — developers can browse and edit directly</li>\n<li><strong>Version-controllable</strong> — share learnings across a team via git</li>\n<li><strong>Portable</strong> — copy between machines or agents</li>\n</ul>\n<h2 id=\"cli-reference\">CLI Reference</h2>\n<pre><code data-lang=\"bash\"># List recent learnings\nterraphim-agent learn list\n\n# Query by pattern (full-text search)\nterraphim-agent learn query &quot;pattern&quot;\n\n# Add correction to a learning\nterraphim-agent learn correct &lt;id&gt; --correction &quot;what to do instead&quot;\n\n# Hook mode (called by PostToolUse, reads JSON from stdin)\nterraphim-agent learn hook --format claude\n</code></pre>\n<h2 id=\"setting-it-up\">Setting It Up</h2>\n<h3 id=\"1-install-the-posttooluse-hook\">1. Install the PostToolUse hook</h3>\n<p>Create <code>~/.claude/hooks/post_tool_use.sh</code>:</p>\n<pre><code data-lang=\"bash\">#!/bin/bash\nset -euo pipefail\nAGENT=&quot;$HOME/.cargo/bin/terraphim-agent&quot;\n[ -x &quot;$AGENT&quot; ] || exit 0\nINPUT=$(cat)\n$AGENT learn hook --format claude &lt;&lt;&lt; &quot;$INPUT&quot; 2&gt;/dev/null || true\n</code></pre>\n<h3 id=\"2-register-in-claude-code-settings\">2. Register in Claude Code settings</h3>\n<p>Add to <code>.claude/settings.json</code>:</p>\n<pre><code data-lang=\"json\">{\n  &quot;hooks&quot;: {\n    &quot;PostToolUse&quot;: [{\n      &quot;matcher&quot;: &quot;Bash&quot;,\n      &quot;hooks&quot;: [{\n        &quot;type&quot;: &quot;command&quot;,\n        &quot;command&quot;: &quot;~/.claude/hooks/post_tool_use.sh&quot;\n      }]\n    }]\n  }\n}\n</code></pre>\n<h3 id=\"3-start-using-it\">3. Start using it</h3>\n<p>Failed commands are captured automatically. Query them with <code>terraphim-agent learn query</code>.</p>\n<h2 id=\"what-comes-next\">What Comes Next</h2>\n<p>The learning system is the foundation for richer agent memory:</p>\n<ul>\n<li><strong>Cross-agent sharing</strong> — learnings from one agent become available to all agents on the team</li>\n<li><strong>Automatic promotion</strong> — when a correction appears N times, auto-generate a KG entry</li>\n<li><strong>Session search integration</strong> — <code>terraphim-agent sessions search</code> already indexes learnings alongside session transcripts</li>\n<li><strong>Confidence scoring</strong> — weight corrections by frequency and recency</li>\n</ul>\n<p>The goal is not to make agents perfect on the first try. It is to make them incapable of making the same mistake twice.</p>\n<h2 id=\"conclusion\">Conclusion</h2>\n<p>AI coding agents are powerful but amnesiac. Every session starts fresh, every mistake is rediscovered. Terraphim's learning capture system closes this gap with a simple, fail-open hook that turns failures into institutional memory.</p>\n<p>The pattern is straightforward: capture failures automatically, attach corrections manually, query before acting. No training required, no model fine-tuning, no prompt engineering beyond what you already do.</p>\n<p>Your agents will still make mistakes. They just will not make the <em>same</em> mistakes.</p>\n<h2 id=\"resources\">Resources</h2>\n<ul>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">Terraphim AI Repository</a></li>\n<li><a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Knowledge Graph Hooks Post</a></li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-claude-skills\">Claude Code Skills Plugin</a></li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://docs.terraphim.ai/hooks/\">Hook Installation Guide</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-05",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","ai","learning","hooks","knowledge-graph","claude-code","developer-tools"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/learning-via-negativa/",
"https://atomicdata.dev/properties/name" : "Learning via Negativa: How Terraphim Remembers What You Keep Getting Wrong",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/learning-via-negativa/",
"https://atomicdata.dev/properties/description" : "<p>You know what is embarrassing? Making the same mistake for the tenth time. Last week, I typed <code>docker-compose up</code> instead of <code>docker compose up</code>. The command failed. I sighed. I corrected it. Three days later? Same thing. Same sigh. Same correction.</p>\n<blockquote>\n<p>Builds on <a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a> — the deterministic engine that lets Terraphim store and replay corrections in microseconds. Apply the pattern in your own project via the <a href=\"/how-tos/command-rewriting-howto/\">Command Rewriting How-to</a>.</p>\n</blockquote>\n<span id=\"continue-reading\"></span><h2 id=\"the-problem-nobody-talks-about\">The Problem Nobody Talks About</h2>\n<p>This is not just about typos. Developers repeat the same failed patterns constantly:</p>\n<ul>\n<li><code>git push -f</code> when they should use <code>git push --force-with-lease</code></li>\n<li><code>cargo run</code> when <code>cargo build</code> would catch the error faster</li>\n<li><code>npm install</code> instead of <code>yarn install</code> (or vice versa, depending on your project)</li>\n<li><code>apt-get</code> commands without sudo</li>\n<li>Killing the wrong process because <code>ps aux | grep</code> returned too many results</li>\n</ul>\n<p>The AI agents we use? They are even worse. Claude Code, Codex, Cursor: they all make the same mistakes, over and over, because they have no long-term memory of what went wrong.</p>\n<p><strong>We are not learning from our failures. We are just repeating them.</strong></p>\n<h2 id=\"the-solution-learning-via-negativa\">The Solution: Learning via Negativa</h2>\n<p>What if your terminal learned from every failed command?</p>\n<p>That is exactly what Terraphim's <strong>Learning via Negativa</strong> system does. It captures every failed command, extracts the mistake pattern, and builds a knowledge graph that corrects you in real-time.</p>\n<p>The name comes from the Latin \"per negativa\": learning by knowing what is wrong. It is the pedagogical equivalent of \"do not touch the hot stove\" after you have already touched it.</p>\n<p>Here is how it works:</p>\n<pre><code>You type &quot;docker-compose up&quot;\n        |\nCommand fails (docker-compose is deprecated)\n        |\nHook captures: command + error + context\n        |\nKnowledge graph maps: &quot;docker-compose&quot; -&gt; &quot;docker compose&quot;\n        |\nNext time: Terraphim auto-replaces and suggests the correct command\n</code></pre>\n<h2 id=\"the-technical-implementation\">The Technical Implementation</h2>\n<p>This is not a wrapper script or a hack. It is a native Rust system built into <code>terraphim-agent</code> that captures, stores, and corrects command mistakes.</p>\n<h3 id=\"1-the-capture-hook\">1. The Capture Hook</h3>\n<p>The hook intercepts failed commands from your AI agent:</p>\n<pre><code data-lang=\"rust\">// crates/terraphim_agent/src/learnings/capture.rs\n\nuse serde::{Deserialize, Serialize};\n\n#[derive(Debug, Clone, Serialize, Deserialize)]\npub struct FailedCommand {\n    pub command: String,\n    pub exit_code: i32,\n    pub stderr: String,\n    pub working_directory: String,\n    pub timestamp: DateTime&lt;Utc&gt;,\n    pub tags: Vec&lt;String&gt;,\n}\n\n/// Capture a failed command and extract the mistake pattern\npub async fn capture_failed_command(\n    command: &amp;str,\n    exit_code: i32,\n    stderr: &amp;str,\n    context: &amp;CommandContext,\n) -&gt; Result&lt;FailedCommand, CaptureError&gt; {\n    // Only capture non-zero exit codes (actual failures)\n    if exit_code == 0 {\n        return Err(CaptureError::CommandSucceeded);\n    }\n\n    // Filter out test commands: we do not learn from intentional failures\n    if is_test_command(command) {\n        return Err(CaptureError::TestCommand);\n    }\n\n    // Extract mistake patterns from the command\n    let tags = extract_mistake_tags(command, stderr);\n\n    let failed = FailedCommand {\n        command: redact_secrets(command),\n        exit_code,\n        stderr: stderr.clone(),\n        working_directory: context.cwd.clone(),\n        timestamp: Utc::now(),\n        tags,\n    };\n\n    // Store as markdown for human readability\n    store_learning(&amp;failed).await?;\n\n    Ok(failed)\n}\n</code></pre>\n<p>The hook is fail-open by design: never blocks your workflow if capture fails.</p>\n<h3 id=\"2-building-the-correction-knowledge-graph\">2. Building the Correction Knowledge Graph</h3>\n<p>Once captured, mistakes become nodes in a knowledge graph that maps wrong to correct:</p>\n<pre><code data-lang=\"rust\">// crates/terraphim_rolegraph/examples/learning_via_negativa.rs\n\nuse terraphim_rolegraph::RoleGraph;\nuse terraphim_types::{NormalizedTerm, NormalizedTermValue, Thesaurus};\n\n/// Build knowledge graph for command corrections\nfn build_correction_thesaurus() -&gt; Thesaurus {\n    let mut thesaurus = Thesaurus::new(&quot;Command Corrections&quot;.to_string());\n\n    // Docker corrections\n    thesaurus.insert(\n        NormalizedTermValue::new(&quot;docker-compose up&quot;.to_string()),\n        NormalizedTerm::new(1, NormalizedTermValue::new(\n            &quot;docker compose up&quot;.to_string()\n        )),\n    );\n\n    // Git corrections\n    thesaurus.insert(\n        NormalizedTermValue::new(&quot;git push -f&quot;.to_string()),\n        NormalizedTerm::new(2, NormalizedTermValue::new(\n            &quot;git push --force-with-lease&quot;.to_string()\n        )),\n    );\n\n    // Cargo corrections\n    thesaurus.insert(\n        NormalizedTermValue::new(&quot;cargo buid&quot;.to_string()),\n        NormalizedTerm::new(3, NormalizedTermValue::new(\n            &quot;cargo build&quot;.to_string()\n        )),\n    );\n\n    thesaurus\n}\n</code></pre>\n<h3 id=\"3-real-time-correction\">3. Real-Time Correction</h3>\n<p>The correction happens automatically via Terraphim's replace tool:</p>\n<pre><code data-lang=\"bash\"># Without Learning via Negativa (old workflow)\n$ docker-compose up\ndocker-compose: command not found\n# You: sigh, retype, move on\n\n# With Learning via Negativa\n$ docker-compose up\n# Terraphim intercepts, corrects, and shows:\nSuggestion: Did you mean &#39;docker compose up&#39;? (y/n)\n# You: y, command executes correctly\n</code></pre>\n<h2 id=\"demo-results\">Demo Results</h2>\n<p>We tested Learning via Negativa with common developer mistakes over a 30-day period:</p>\n<h3 id=\"correction-examples\">Correction Examples</h3>\n<table><thead><tr><th>Wrong Command</th><th>Error</th><th>Correction Learned</th></tr></thead><tbody>\n<tr><td><code>docker-compose up</code></td><td><code>command not found</code></td><td><code>docker compose up</code></td></tr>\n<tr><td><code>git push -f</code></td><td><code>remote: denied by protection policy</code></td><td><code>git push --force-with-lease</code></td></tr>\n<tr><td><code>cargo buid</code></td><td><code>error: no such subcommand</code></td><td><code>cargo build</code></td></tr>\n<tr><td><code>npm isntall</code></td><td><code>command not found</code></td><td><code>npm install</code></td></tr>\n<tr><td><code>apt update</code></td><td><code>Permission denied</code></td><td><code>sudo apt update</code></td></tr>\n<tr><td><code>git psuh</code></td><td><code>git: 'psuh' is not a git command</code></td><td><code>git push</code></td></tr>\n</tbody></table>\n<h3 id=\"knowledge-graph-growth\">Knowledge Graph Growth</h3>\n<pre><code>Week 1:  12 corrections captured\nWeek 2:  34 corrections captured (cumulative)\nWeek 3:  58 corrections captured (cumulative)\nWeek 4:  89 corrections captured (cumulative)\n\nTop mistake categories:\n  - Docker commands: 28%\n  - Git commands: 24%\n  - Cargo/Rust: 18%\n  - npm/yarn: 15%\n  - System commands: 15%\n</code></pre>\n<h2 id=\"why-this-matters\">Why This Matters</h2>\n<p>Most AI tools have no memory. Claude Code is brilliant but stateless. Cursor remembers your files, not your mistakes. GitHub Copilot suggests code but forgets that <code>docker-compose</code> has been deprecated for two years.</p>\n<p><strong>Learning via Negativa gives your AI agent a memory for failure.</strong></p>\n<p>It transforms every error from a one-time annoyance into a permanent lesson. The more you use it, the smarter it gets. And because it is built on the knowledge graph architecture, it does not just match strings: it understands context.</p>\n<p>You typed <code>git push -f</code> in a repo with protected branches? It learns that <code>-f</code> is wrong in that context. You use <code>docker-compose</code> in a project with a <code>compose.yaml</code> file? It learns the new syntax applies here.</p>\n<h2 id=\"getting-started\">Getting Started</h2>\n<pre><code data-lang=\"bash\"># Install terraphim-agent\ncargo install terraphim-agent\n\n# Install the learning hook for Claude Code\nterraphim-agent learn install-hook claude\n\n# Verify it is working\nterraphim-agent learn list\n\n# Query your mistakes anytime\nterraphim-agent learn query &quot;your mistake&quot;\n\n# Or use the replace tool for real-time corrections\necho &quot;docker-compose up&quot; | terraphim-agent replace\n</code></pre>\n<h2 id=\"the-bigger-picture\">The Bigger Picture</h2>\n<p>Learning via Negativa is more than a feature: it is a philosophy. Every failure contains information. Every error message is feedback. The trick is capturing that signal instead of just ignoring the noise.</p>\n<p>We have spent decades building systems that celebrate successes. It is time we built systems that learn from failures too.</p>\n<p><strong>Your terminal should remember what you keep getting wrong. That is not just smart: that is how humans actually learn.</strong></p>\n<hr />\n<h2 id=\"links\">Links</h2>\n<ul>\n<li><strong>GitHub</strong>: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">github.com/terraphim/terraphim-ai</a></li>\n<li><strong>Source Code</strong>: <code>crates/terraphim_agent/src/learnings/</code></li>\n<li><strong>Example</strong>: <code>crates/terraphim_rolegraph/examples/learning_via_negativa.rs</code></li>\n</ul>\n<hr />\n<p><em>Terraphim: Your AI agent's memory for mistakes.</em></p>\n",
"https://terraphim.ai/properties/date" : "2026-02-20",
"https://atomicdata.dev/properties/tags" : {"tags":["Terraphim","rust","cli","ai-agents","developer-tools","learning","knowledge-graph"],"categories":["Technical"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/multi-haystack-roles-grepapp/",
"https://atomicdata.dev/properties/name" : "Introducing Multi-Haystack Roles: Local + Global Code Search",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/multi-haystack-roles-grepapp/",
"https://atomicdata.dev/properties/description" : "<p>Today we are announcing a significant enhancement to Terraphim's engineer roles: <strong>multi-haystack support</strong>. FrontEnd Engineers and Python Engineers now have access to both local code search (via Ripgrep) and global GitHub search (via GrepApp) in a single query.</p>\n<span id=\"continue-reading\"></span><h2 id=\"the-problem-siloed-search\">The Problem: Siloed Search</h2>\n<p>Traditionally, developers face a frustrating choice when searching for code:</p>\n<ol>\n<li><strong>Search locally</strong> - Fast, but limited to your own codebase</li>\n<li><strong>Search GitHub</strong> - Broad, but requires leaving your workflow</li>\n</ol>\n<p>When you are stuck on \"how do I use this library?\" or \"what's the idiomatic way to solve this?\", you need both: your local context for project-specific code, and global examples from the broader community.</p>\n<h2 id=\"the-solution-dual-haystacks\">The Solution: Dual Haystacks</h2>\n<p>Starting with Terraphim v1.8.1, select engineer roles now come with <strong>dual haystacks</strong>:</p>\n<h3 id=\"frontend-engineer\">FrontEnd Engineer</h3>\n<ul>\n<li><strong>Haystack 1</strong>: Local Ripgrep search (<code>~/projects</code> or custom path)</li>\n<li><strong>Haystack 2</strong>: GrepApp global search (filtered to JavaScript)</li>\n</ul>\n<h3 id=\"python-engineer\">Python Engineer</h3>\n<ul>\n<li><strong>Haystack 1</strong>: Local Ripgrep search (<code>~/projects</code> or custom path)</li>\n<li><strong>Haystack 2</strong>: GrepApp global search (filtered to Python)</li>\n</ul>\n<h3 id=\"rust-engineer-v2-already-available\">Rust Engineer v2 (already available)</h3>\n<ul>\n<li><strong>Haystack 1</strong>: QueryRs (docs.rs search)</li>\n<li><strong>Haystack 2</strong>: Local Ripgrep search</li>\n</ul>\n<h2 id=\"how-it-works\">How It Works</h2>\n<p>When you search as a FrontEnd or Python Engineer, Terraphim queries both sources simultaneously:</p>\n<pre><code>Your Query: &quot;how to use useEffect&quot;\n    |\n+-------------------+     +--------------------+\n|  Local Ripgrep    |     |  GrepApp           |\n|  (your code)      |     |  (GitHub repos)    |\n+---------+---------+     +---------+----------+\n          |                         |\n          +-----------+-------------+\n                      |\n           Combined Results\n           (ranked by relevance)\n</code></pre>\n<p>The results are merged and ranked according to your role's relevance function:</p>\n<ul>\n<li><strong>FrontEnd Engineer</strong>: BM25Plus ranking</li>\n<li><strong>Python Engineer</strong>: BM25F field-weighted ranking</li>\n</ul>\n<p>This means you get the best of both worlds: your project's specific implementations AND real-world examples from popular repositories.</p>\n<h2 id=\"real-world-use-cases\">Real-World Use Cases</h2>\n<h3 id=\"learning-a-new-library\">Learning a New Library</h3>\n<p>You are trying to use <code>react-query</code> for the first time. A single search shows:</p>\n<ul>\n<li>How you have used it elsewhere in your codebase (local)</li>\n<li>How Facebook uses it in their production apps (global)</li>\n</ul>\n<h3 id=\"debugging-common-patterns\">Debugging Common Patterns</h3>\n<p>You are debugging a Python asyncio issue. Your search returns:</p>\n<ul>\n<li>Your current implementation (local)</li>\n<li>How aiohttp, FastAPI, and Django handle similar patterns (global)</li>\n</ul>\n<h3 id=\"discovering-idioms\">Discovering Idioms</h3>\n<p>You want to know the \"Pythonic\" way to do something. Instead of reading docs, you can see:</p>\n<ul>\n<li>How the standard library implements it</li>\n<li>How popular open-source projects handle similar cases</li>\n</ul>\n<h2 id=\"technical-implementation\">Technical Implementation</h2>\n<p>Under the hood, this uses our existing GrepApp integration:</p>\n<pre><code data-lang=\"rust\">Haystack {\n    location: &quot;https://grep.app&quot;.to_string(),\n    service: ServiceType::GrepApp,\n    read_only: true,\n    fetch_content: false,\n    extra_parameters: {\n        let mut params = HashMap::new();\n        params.insert(&quot;language&quot;.to_string(), &quot;python&quot;.to_string());\n        params\n    },\n}\n</code></pre>\n<p>The <code>extra_parameters</code> field allows language-specific filtering:</p>\n<ul>\n<li>FrontEnd Engineer: <code>language=javascript</code></li>\n<li>Python Engineer: <code>language=python</code></li>\n</ul>\n<p>GrepApp returns structured results from millions of GitHub repositories, complete with:</p>\n<ul>\n<li>Repository name and file path</li>\n<li>Code snippet with context</li>\n<li>Direct link to view on GitHub</li>\n<li>Branch information</li>\n</ul>\n<h2 id=\"graceful-degradation\">Graceful Degradation</h2>\n<p>We know network availability is not guaranteed. If GrepApp is unreachable:</p>\n<ul>\n<li>Local search continues working normally</li>\n<li>No errors or interruptions</li>\n<li>You still get your project-specific results</li>\n</ul>\n<p>This follows our philosophy: <strong>local-first, enhance with global</strong>.</p>\n<h2 id=\"what-s-next\">What's Next</h2>\n<p>This is just the beginning. We are planning:</p>\n<ol>\n<li>\n<p><strong>More roles with dual haystacks</strong>:</p>\n<ul>\n<li>Go Engineer (GrepApp + local)</li>\n<li>TypeScript-specific role</li>\n<li>Language-agnostic search role</li>\n</ul>\n</li>\n<li>\n<p><strong>Configurable filters</strong>:</p>\n<ul>\n<li>Filter by specific repositories</li>\n<li>Filter by file paths</li>\n<li>Filter by popularity/stars</li>\n</ul>\n</li>\n<li>\n<p><strong>Enhanced ranking</strong>:</p>\n<ul>\n<li>Boost results from starred repositories</li>\n<li>Learn from your click patterns</li>\n<li>Personalised ranking per engineer</li>\n</ul>\n</li>\n</ol>\n<h2 id=\"try-it-now\">Try It Now</h2>\n<p>If you are already using Terraphim v1.8.1+, you can try the new roles immediately:</p>\n<pre><code data-lang=\"bash\"># Switch to FrontEnd Engineer (with dual haystack)\nterraphim-agent onboard --role frontend-engineer\n\n# Switch to Python Engineer (with dual haystack)\nterraphim-agent onboard --role python-engineer\n\n# Search as usual: both local and global results appear\nterraphim-agent search &quot;async def&quot;\n</code></pre>\n<h2 id=\"the-bigger-picture\">The Bigger Picture</h2>\n<p>Multi-haystack roles represent a fundamental shift in how we think about code search:</p>\n<p><strong>Old model</strong>: One haystack per role, choose your scope\n<strong>New model</strong>: Multiple haystacks per role, get comprehensive results</p>\n<p>This aligns with how developers actually work: they do not want to choose between \"my code\" and \"the world's code\" — they want both, intelligently combined.</p>\n<p>As we expand this pattern to more roles and add more haystack types (MCP servers, Atomic Data, AI assistants), Terraphim becomes not just a search tool, but a <strong>knowledge synthesis engine</strong> for developers.</p>\n<h2 id=\"feedback-welcome\">Feedback Welcome</h2>\n<p>Have ideas for other haystack combinations? Want to see this pattern applied to other roles? Let us know:</p>\n<ul>\n<li>GitHub Issues: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/issues\">github.com/terraphim/terraphim-ai/issues</a></li>\n<li>Discussions: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/issues\">github.com/terraphim/terraphim-ai/issues</a></li>\n</ul>\n<hr />\n<p><strong>Related Reading:</strong></p>\n<ul>\n<li><a href=\"/posts/native-hook-support\">Native Hook Support: terraphim-agent Learns from Mistakes</a></li>\n<li><a href=\"/posts/learning-via-negativa\">Learning via Negativa</a></li>\n</ul>\n<p><em>Terraphim: Search your world.</em></p>\n",
"https://terraphim.ai/properties/date" : "2026-02-16",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","release","features","grepapp","haystack","code-search"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/native-hook-support/",
"https://atomicdata.dev/properties/name" : "Native Hook Support: terraphim-agent Now Learns from Your Mistakes",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/native-hook-support/",
"https://atomicdata.dev/properties/description" : "<p>We are announcing <strong>native hook support</strong> for terraphim-agent v1.8.1. This feature captures failed commands from AI agents (Claude Code, Codex, OpenCode) and learns from them, creating a personal knowledge base of mistakes and corrections. No more jq dependency, no more bash wrappers: just <code>terraphim-agent learn hook</code>.</p>\n<span id=\"continue-reading\"></span><h2 id=\"the-problem\">The Problem</h2>\n<p>If you are using AI agents like Claude Code, you have probably experienced this:</p>\n<ol>\n<li>Agent suggests: <code>cargo buid</code> (typo)</li>\n<li>Command fails with error</li>\n<li>You fix it: <code>cargo build</code></li>\n<li><strong>The mistake is forgotten</strong></li>\n<li>Next session: Same typo, same failure</li>\n</ol>\n<p>Every developer has their own \"greatest hits\" of mistakes:</p>\n<ul>\n<li><code>npm isntall</code> instead of <code>npm install</code></li>\n<li><code>git psuh</code> instead of <code>git push</code></li>\n<li><code>pip intall</code> instead of <code>pip install</code></li>\n</ul>\n<p>These mistakes are personal, contextual, and <strong>valuable</strong>: if only we could remember them.</p>\n<h2 id=\"the-solution-native-hook-support\">The Solution: Native Hook Support</h2>\n<p>With terraphim-agent v1.8.1, we have introduced a complete learning system:</p>\n<pre><code data-lang=\"bash\"># One-command setup\nterraphim-agent learn install-hook claude\n\n# That&#39;s it. Every failed command is now captured automatically.\n</code></pre>\n<h3 id=\"how-it-works\">How It Works</h3>\n<pre><code>Claude Code executes Bash command\n        |\nCommand fails (exit code != 0)\n        |\nHook captures: command + error + context\n        |\nStored as Markdown: ~/.local/share/terraphim/learnings/\n        |\nQuery anytime: terraphim-agent learn query &quot;cargo buid&quot;\n</code></pre>\n<h3 id=\"key-features\">Key Features</h3>\n<p><strong>1. Native Implementation</strong></p>\n<ul>\n<li>No external dependencies (no jq)</li>\n<li>Written in Rust using serde</li>\n<li>~50 lines of code vs 115-line bash wrapper</li>\n<li>156 tests passing</li>\n</ul>\n<p><strong>2. Universal Support</strong>\nWorks with Claude Code, Codex, and OpenCode:</p>\n<pre><code data-lang=\"bash\">terraphim-agent learn install-hook claude\nterraphim-agent learn install-hook codex\nterraphim-agent learn install-hook opencode\n</code></pre>\n<p><strong>3. Fail-Open Design</strong>\nNever blocks your workflow. If capture fails, the command still executes.</p>\n<p><strong>4. Smart Filtering</strong></p>\n<ul>\n<li>Ignores test commands (<code>cargo test</code>, <code>npm test</code>)</li>\n<li>Redacts secrets automatically (AWS keys, API tokens)</li>\n<li>Only captures Bash tool failures</li>\n</ul>\n<p><strong>5. Rich Context</strong>\nEach learning includes:</p>\n<ul>\n<li>Command that failed</li>\n<li>Error output</li>\n<li>Exit code</li>\n<li>Timestamp</li>\n<li>Working directory</li>\n<li>Tags for categorisation</li>\n</ul>\n<h2 id=\"live-demonstration\">Live Demonstration</h2>\n<p>Let us prove it works with a realistic scenario:</p>\n<h3 id=\"step-1-set-role\">Step 1: Set Role</h3>\n<pre><code data-lang=\"bash\">$ terraphim-agent setup --template rust-engineer-v2\nConfiguration set to role &#39;Rust Engineer v2&#39;\n</code></pre>\n<h3 id=\"step-2-capture-mistake\">Step 2: Capture Mistake</h3>\n<pre><code data-lang=\"bash\"># Simulate Claude Code making a typo\necho &#39;{&quot;tool_name&quot;:&quot;Bash&quot;,&quot;tool_input&quot;:{&quot;command&quot;:&quot;cargo buid&quot;},...}&#39; \\\n  | ~/.config/claude/terraphim-hook.sh\n</code></pre>\n<h3 id=\"step-3-verify-capture\">Step 3: Verify Capture</h3>\n<pre><code data-lang=\"bash\">$ terraphim-agent learn list\nRecent learnings:\n  1. [G] cargo buid (exit: 101)\n</code></pre>\n<h3 id=\"step-4-query-mistake\">Step 4: Query Mistake</h3>\n<pre><code data-lang=\"bash\">$ terraphim-agent learn query &quot;cargo buid&quot;\nLearnings matching &#39;cargo buid&#39;:\n  [G] cargo buid (exit: 101)\n</code></pre>\n<h2 id=\"multi-role-engineering\">Multi-Role Engineering</h2>\n<p>We have also added 4 new engineer role templates, each with different ranking methods:</p>\n<table><thead><tr><th>Role</th><th>Ranking</th><th>Use Case</th></tr></thead><tbody>\n<tr><td><strong>FrontEnd Engineer</strong></td><td>BM25Plus</td><td>JavaScript/TypeScript development</td></tr>\n<tr><td><strong>Python Engineer</strong></td><td>BM25F</td><td>Python with field-weighted ranking</td></tr>\n<tr><td><strong>Rust Engineer v2</strong></td><td>TitleScorer</td><td>Dual haystack (docs.rs + local)</td></tr>\n<tr><td><strong>Terraphim Engineer v2</strong></td><td>TerraphimGraph</td><td>Graph embeddings + hybrid KG</td></tr>\n</tbody></table>\n<p>Each role learns differently and optimises search for its domain.</p>\n<h2 id=\"the-complete-learning-cycle\">The Complete Learning Cycle</h2>\n<pre><code>+-------------------------------------------------------------+\n|  1. LEARN                                                    |\n|     Command fails -&gt; Hook captures -&gt; Markdown stored        |\n|     Works with Claude, Codex, OpenCode                       |\n+-------------------------------------------------------------+\n|  2. QUERY                                                    |\n|     Search patterns -&gt; Find similar mistakes                 |\n|     Pattern matching on command + error                      |\n+-------------------------------------------------------------+\n|  3. CORRECT                                                  |\n|     Add corrections: learn correct &lt;id&gt; --correction         |\n|     Future: Auto-suggest from knowledge graph                |\n+-------------------------------------------------------------+\n|  4. REPLACE                                                  |\n|     Real-time suggestions via replace --role &lt;role&gt;          |\n|     Uses thesaurus for context-aware corrections             |\n+-------------------------------------------------------------+\n</code></pre>\n<h2 id=\"installation\">Installation</h2>\n<pre><code data-lang=\"bash\"># Install latest terraphim-agent\ncargo install terraphim-agent\n\n# Install hook for your AI agent\nterraphim-agent learn install-hook claude\n\n# Verify installation\nterraphim-agent learn --help\n</code></pre>\n<h2 id=\"verification-and-validation\">Verification and Validation</h2>\n<p>This release passed rigorous quality gates:</p>\n<ul>\n<li><strong>Static Analysis</strong>: UBS scanner (0 critical findings)</li>\n<li><strong>Unit Tests</strong>: 156 tests passing</li>\n<li><strong>Integration Tests</strong>: All E2E scenarios pass</li>\n<li><strong>Acceptance Testing</strong>: Live demonstration completed</li>\n<li><strong>Requirements Traceability</strong>: 100% coverage</li>\n</ul>\n<h2 id=\"what-s-next\">What's Next</h2>\n<ol>\n<li><strong>Auto-suggest</strong>: Query learnings in real-time during command entry</li>\n<li><strong>Learning insights</strong>: Analytics on most common mistakes per role</li>\n<li><strong>Team sharing</strong>: Optional sync of learnings across team members</li>\n<li><strong>IDE integration</strong>: VS Code extension for inline suggestions</li>\n</ol>\n<h2 id=\"get-started\">Get Started</h2>\n<pre><code data-lang=\"bash\"># Install\ncargo install terraphim-agent\n\n# Set up your role\nterraphim-agent setup --template rust-engineer-v2\n\n# Install hook\nterraphim-agent learn install-hook claude\n\n# Start learning from your mistakes\n</code></pre>\n<h2 id=\"links\">Links</h2>\n<ul>\n<li><strong>GitHub</strong>: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">github.com/terraphim/terraphim-ai</a></li>\n<li><strong>Release</strong>: v1.8.1</li>\n</ul>\n<hr />\n<p><em>Terraphim: Your AI agent's memory for mistakes.</em></p>\n",
"https://terraphim.ai/properties/date" : "2026-02-16",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","rust","cli","ai-agents","developer-tools","learning"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/zvec-vs-terraphim-comparison/",
"https://atomicdata.dev/properties/name" : "zvec vs Terraphim: Two Paths to Semantic Search",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/zvec-vs-terraphim-comparison/",
"https://atomicdata.dev/properties/description" : "<p>When it comes to semantic search, there are fundamentally different architectural approaches. Alibaba's <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/alibaba/zvec\">zvec</a> and Terraphim represent two distinct philosophies: neural embeddings vs. knowledge graphs, scale vs. interpretability, dense vectors vs. co-occurrence relationships.</p>\n<span id=\"continue-reading\"></span><h2 id=\"the-core-philosophy\">The Core Philosophy</h2>\n<h3 id=\"zvec-neural-embeddings-at-scale\">zvec: Neural Embeddings at Scale</h3>\n<p>zvec is a lightweight, in-process vector database built on Alibaba's battle-tested Proxima engine. It transforms documents into high-dimensional vectors using neural embedding models (BERT, OpenAI, etc.), then uses Approximate Nearest Neighbour (ANN) algorithms like HNSW to find similar documents.</p>\n<p><strong>Key Characteristics:</strong></p>\n<ul>\n<li>Dense vectors (typically 384-1536 dimensions)</li>\n<li>ANN indexing (HNSW, IVF, Flat)</li>\n<li>Built-in embedding models (OpenAI, Qwen, SentenceTransformers)</li>\n<li>Billions of vectors, millisecond query times</li>\n<li>Black-box interpretability</li>\n</ul>\n<h3 id=\"terraphim-knowledge-graphs-for-understanding\">Terraphim: Knowledge Graphs for Understanding</h3>\n<p>Terraphim takes a radically different approach. Instead of converting documents to opaque vectors, it builds a knowledge graph from term co-occurrences. Each concept becomes a node, relationships become edges, and relevance is calculated by traversing this graph structure.</p>\n<p><strong>Key Characteristics:</strong></p>\n<ul>\n<li>Co-occurrence graph embeddings</li>\n<li>Aho-Corasick automata for fast pattern matching</li>\n<li>Domain-specific thesauri for synonym expansion</li>\n<li>Role-based graphs for persona-driven search</li>\n<li>Fully explainable relevance scoring</li>\n</ul>\n<h2 id=\"architectural-comparison\">Architectural Comparison</h2>\n<pre><code>+-------------------------------------------------------------+\n|                         zvec                                 |\n+-------------------------------------------------------------+\n|  Document -&gt; Neural Encoder -&gt; Dense Vector -&gt; HNSW Index    |\n|                                                    |         |\n|  Query -&gt; Neural Encoder -&gt; Query Vector -&gt; ANN Search -&gt; K  |\n+-------------------------------------------------------------+\n                              vs\n+-------------------------------------------------------------+\n|                       Terraphim                              |\n+-------------------------------------------------------------+\n|  Document -&gt; Term Extraction -&gt; Co-occurrence -&gt; Graph       |\n|                                                   |          |\n|  Query -&gt; Aho-Corasick Match -&gt; Graph Traversal -&gt; Ranked    |\n+-------------------------------------------------------------+\n</code></pre>\n<h3 id=\"data-structures\">Data Structures</h3>\n<table><thead><tr><th>Component</th><th>zvec</th><th>Terraphim</th></tr></thead><tbody>\n<tr><td><strong>Storage Unit</strong></td><td>Collection (table-like)</td><td>RoleGraph (knowledge graph)</td></tr>\n<tr><td><strong>Document ID</strong></td><td>String</td><td>String</td></tr>\n<tr><td><strong>Representations</strong></td><td>Dense/Sparse vectors (768-dim+)</td><td>Nodes, Edges, Thesaurus</td></tr>\n<tr><td><strong>Index Types</strong></td><td>HNSW, IVF, Flat, Inverted</td><td>Hash maps + Aho-Corasick</td></tr>\n<tr><td><strong>Persistence</strong></td><td>Disk-based collections</td><td>JSON serialisation</td></tr>\n</tbody></table>\n<h3 id=\"query-semantics\">Query Semantics</h3>\n<p><strong>zvec Query:</strong></p>\n<pre><code data-lang=\"python\">import zvec\n\n# Semantic similarity via vector comparison\nresults = collection.query(\n    zvec.VectorQuery(&quot;embedding&quot;, vector=[0.1, -0.3, ...]),\n    topk=10,\n    filter=&quot;category == &#39;tech&#39;&quot;\n)\n# Returns: documents with similar vectors (cosine similarity)\n</code></pre>\n<p><strong>Terraphim Query:</strong></p>\n<pre><code data-lang=\"rust\">// Graph traversal with term expansion\nlet results = role_graph.query_graph(\n    &quot;async programming&quot;,\n    Some(0),  // offset\n    Some(10)  // limit\n);\n// Returns: documents ranked by graph connectivity\n// Matched nodes: &quot;async&quot;, &quot;programming&quot;, &quot;concurrency&quot;, &quot;tokio&quot;\n</code></pre>\n<h2 id=\"feature-matrix\">Feature Matrix</h2>\n<table><thead><tr><th>Feature</th><th>zvec</th><th>Terraphim</th></tr></thead><tbody>\n<tr><td><strong>Dense Embeddings</strong></td><td>Native</td><td>Not used</td></tr>\n<tr><td><strong>Sparse Vectors</strong></td><td>BM25 supported</td><td>BM25/BM25F/BM25Plus</td></tr>\n<tr><td><strong>Knowledge Graph</strong></td><td>No</td><td>Core architecture</td></tr>\n<tr><td><strong>ANN Search</strong></td><td>HNSW/IVF/Flat</td><td>Not applicable</td></tr>\n<tr><td><strong>SQL-like Filters</strong></td><td>SQL engine</td><td>Graph-based filtering</td></tr>\n<tr><td><strong>Explainability</strong></td><td>Low (black box)</td><td>High (show path)</td></tr>\n<tr><td><strong>Synonym Expansion</strong></td><td>Via embedding model</td><td>Via thesaurus</td></tr>\n<tr><td><strong>Role/Persona Support</strong></td><td>No</td><td>RoleGraphs</td></tr>\n<tr><td><strong>Multi-Haystack</strong></td><td>Single collection</td><td>Multiple sources</td></tr>\n<tr><td><strong>Built-in Rerankers</strong></td><td>RRF, Weighted</td><td>Graph ranks directly</td></tr>\n<tr><td><strong>Quantisation</strong></td><td>INT8/FP16</td><td>Not needed</td></tr>\n<tr><td><strong>Hybrid Search</strong></td><td>Vectors + Filters</td><td>Graph + Haystacks</td></tr>\n</tbody></table>\n<h2 id=\"performance-characteristics\">Performance Characteristics</h2>\n<h3 id=\"zvec-benchmarks-from-10m-vector-dataset\">zvec (Benchmarks from 10M vector dataset)</h3>\n<ul>\n<li><strong>Throughput</strong>: 2,000-8,000 QPS depending on configuration</li>\n<li><strong>Recall</strong>: 96-97% with HNSW</li>\n<li><strong>Latency</strong>: Milliseconds for 10M vectors</li>\n<li><strong>Memory</strong>: Compressed vectors (INT8/FP16)</li>\n<li><strong>Scale</strong>: Billions of vectors</li>\n</ul>\n<h3 id=\"terraphim-observed-performance\">Terraphim (Observed Performance)</h3>\n<ul>\n<li><strong>Throughput</strong>: In-memory graph traversal (very fast)</li>\n<li><strong>Recall</strong>: Deterministic graph-based ranking</li>\n<li><strong>Latency</strong>: Sub-millisecond for typical graphs</li>\n<li><strong>Memory</strong>: Entire graph in memory</li>\n<li><strong>Scale</strong>: Thousands to tens of thousands of documents</li>\n</ul>\n<h2 id=\"when-to-use-which\">When to Use Which</h2>\n<h3 id=\"choose-zvec-when\">Choose zvec When:</h3>\n<ol>\n<li>\n<p><strong>You need to search billions of documents</strong></p>\n<ul>\n<li>ANN algorithms scale to massive datasets</li>\n<li>Production workloads at Alibaba scale</li>\n</ul>\n</li>\n<li>\n<p><strong>You are building RAG systems with LLMs</strong></p>\n<ul>\n<li>Dense embeddings align with LLM representations</li>\n<li>Built-in OpenAI/SentenceTransformer support</li>\n</ul>\n</li>\n<li>\n<p><strong>You need image/audio similarity search</strong></p>\n<ul>\n<li>Requires dense embeddings</li>\n<li>CLIP-style multimodal search</li>\n</ul>\n</li>\n<li>\n<p><strong>Exact semantic similarity matters</strong></p>\n<ul>\n<li>\"King - Man + Woman = Queen\" works</li>\n<li>Captures semantic relationships beyond keywords</li>\n</ul>\n</li>\n</ol>\n<h3 id=\"choose-terraphim-when\">Choose Terraphim When:</h3>\n<ol>\n<li>\n<p><strong>You need explainable results</strong></p>\n<ul>\n<li>\"Why did this document rank high?\"</li>\n<li>Graph path shows: matched node X via edge Y to document Z</li>\n</ul>\n</li>\n<li>\n<p><strong>You have domain-specific knowledge</strong></p>\n<ul>\n<li>Custom thesauri for technical terms</li>\n<li>Synonym relationships: \"async\" = \"asynchronous\" = \"non-blocking\"</li>\n</ul>\n</li>\n<li>\n<p><strong>You are building personal knowledge management</strong></p>\n<ul>\n<li>Note-taking apps, research assistants</li>\n<li>Domain expert systems</li>\n</ul>\n</li>\n<li>\n<p><strong>You need role-based search</strong></p>\n<ul>\n<li>Different personas see different results</li>\n<li>Engineer vs. Scientist vs. Writer views</li>\n</ul>\n</li>\n</ol>\n<h2 id=\"code-comparison\">Code Comparison</h2>\n<h3 id=\"document-indexing\">Document Indexing</h3>\n<p><strong>zvec (Python):</strong></p>\n<pre><code data-lang=\"python\">import zvec\n\nschema = zvec.CollectionSchema(\n    name=&quot;docs&quot;,\n    vectors=zvec.VectorSchema(&quot;emb&quot;, zvec.DataType.VECTOR_FP32, 768),\n)\n\ncollection = zvec.create_and_open(path=&quot;./data&quot;, schema=schema)\n\n# Documents must have pre-computed embeddings\ncollection.insert([\n    zvec.Doc(\n        id=&quot;doc1&quot;,\n        vectors={&quot;emb&quot;: embedding_model.encode(&quot;Rust async programming&quot;)},\n        fields={&quot;title&quot;: &quot;Async in Rust&quot;}\n    ),\n])\n</code></pre>\n<p><strong>Terraphim (Rust):</strong></p>\n<pre><code data-lang=\"rust\">use terraphim_rolegraph::RoleGraph;\nuse terraphim_types::{Document, RoleName};\n\nlet mut graph = RoleGraph::new(\n    RoleName::new(&quot;engineer&quot;),\n    thesaurus\n).await?;\n\n// Documents are indexed into the graph\ngraph.index_documents(vec![\n    Document {\n        id: &quot;doc1&quot;.into(),\n        title: &quot;Async in Rust&quot;.into(),\n        body: &quot;Rust&#39;s async/await syntax...&quot;.into(),\n        // Graph extracts terms automatically\n        ..Default::default()\n    },\n]).await?;\n</code></pre>\n<h3 id=\"searching\">Searching</h3>\n<p><strong>zvec:</strong></p>\n<pre><code data-lang=\"python\"># Vector similarity search\nquery_vec = embedding_model.encode(&quot;how to write async code&quot;)\nresults = collection.query(\n    zvec.VectorQuery(&quot;emb&quot;, vector=query_vec),\n    topk=5\n)\n# Results ranked by cosine similarity\n</code></pre>\n<p><strong>Terraphim:</strong></p>\n<pre><code data-lang=\"rust\">// Graph-based search\nlet results = graph.query_graph(&quot;async code&quot;, None, Some(5))?;\n// Results ranked by:\n// 1. Node rank (concept frequency)\n// 2. Edge rank (relationship strength)\n// 3. Document rank (occurrence count)\n</code></pre>\n<h2 id=\"can-they-work-together\">Can They Work Together?</h2>\n<p>Absolutely. Here are some integration patterns:</p>\n<h3 id=\"1-hybrid-retrieval\">1. Hybrid Retrieval</h3>\n<p>Use zvec for initial broad retrieval, Terraphim for reranking:</p>\n<pre><code data-lang=\"python\"># Step 1: zvec ANN for candidate retrieval\ncandidates = zvec_collection.query(query_vector, topk=100)\n\n# Step 2: Terraphim graph reranking\n# Load candidates into temporary graph\n# Re-rank based on knowledge graph connectivity\n</code></pre>\n<h3 id=\"2-explainable-vector-search\">2. Explainable Vector Search</h3>\n<p>Use Terraphim's graph to explain zvec results:</p>\n<pre><code>User: &quot;Why did this document match?&quot;\nSystem:\n  - zvec: &quot;Vector similarity: 0.92&quot;\n  - Terraphim: &quot;Matched via concepts: async -&gt; tokio -&gt; concurrency&quot;\n</code></pre>\n<h2 id=\"conclusion\">Conclusion</h2>\n<p>zvec and Terraphim solve semantic search with fundamentally different approaches:</p>\n<ul>\n<li>\n<p><strong>zvec</strong> scales neural embeddings to billions of documents using ANN algorithms. It is the right choice for large-scale RAG systems, e-commerce search, and any application requiring dense vector similarity.</p>\n</li>\n<li>\n<p><strong>Terraphim</strong> builds interpretable knowledge graphs from term relationships. It excels at personal knowledge management, domain-specific expert systems, and any application where understanding <em>why</em> a document matched is as important as finding it.</p>\n</li>\n</ul>\n<p>The exciting possibility is combining both: zvec's scale with Terraphim's explainability. The future of semantic search might just be hybrid.</p>\n<h2 id=\"references\">References</h2>\n<ul>\n<li>zvec GitHub: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/alibaba/zvec\">github.com/alibaba/zvec</a></li>\n<li>Terraphim Documentation: <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.ai/docs\">terraphim.ai/docs</a></li>\n<li>Proxima (Alibaba's Vector Engine): <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/alibaba/proxima\">github.com/alibaba/proxima</a></li>\n</ul>\n<hr />\n<p><em>Have you used zvec or Terraphim? We would love to hear about your experiences on <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai/issues\">GitHub Issues</a>.</em></p>\n",
"https://terraphim.ai/properties/date" : "2026-02-16",
"https://atomicdata.dev/properties/tags" : {"categories":["Technical"],"tags":["Terraphim","vector-search","knowledge-graph","semantic-search","comparison"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/teaching-ai-agents-with-knowledge-graphs/",
"https://atomicdata.dev/properties/name" : "Teaching AI Coding Agents with Knowledge Graph Hooks",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/teaching-ai-agents-with-knowledge-graphs/",
"https://atomicdata.dev/properties/description" : "<p>How we use Aho-Corasick automata and knowledge graphs to automatically enforce coding standards across AI coding agents like Claude Code, Cursor, and Aider.</p>\n<blockquote>\n<p><strong>New:</strong> see <a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a> for the underlying engine that makes these hooks possible — sub-millisecond, deterministic, fully explainable.</p>\n</blockquote>\n<span id=\"continue-reading\"></span><h2 id=\"anthropic-bought-bun-claude-still-outputs-npm-install\">Anthropic Bought Bun. Claude Still Outputs <code>npm install</code>.</h2>\n<p>On December 3, 2025, <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://www.anthropic.com/news/anthropic-acquires-bun-as-claude-code-reaches-usd1b-milestone\">Anthropic announced its first-ever acquisition</a>: Bun, the blazing-fast JavaScript runtime. This came alongside Claude Code reaching <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://bun.com/blog/bun-joins-anthropic\">$1 billion in run-rate revenue</a> just six months after public launch.</p>\n<p>As Mike Krieger, Anthropic's Chief Product Officer, put it:</p>\n<blockquote>\n<p>\"Bun represents exactly the kind of technical excellence we want to bring into Anthropic... bringing the Bun team into Anthropic means we can build the infrastructure to compound that momentum.\"</p>\n</blockquote>\n<p>Claude Code <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://simonwillison.net/2025/Dec/2/anthropic-acquires-bun/\">ships as a Bun executable</a> to millions of developers. Anthropic now owns the runtime their flagship coding tool depends on.</p>\n<p><strong>And yet...</strong></p>\n<p>Ask Claude to set up a Node.js project, and what do you get?</p>\n<pre><code data-lang=\"bash\">npm install express\nyarn add lodash\npnpm install --save-dev jest\n</code></pre>\n<p>Yet Anthropic's own models still default to npm, yarn, and pnpm in their outputs. The training data predates the acquisition, and old habits die hard.</p>\n<p><strong>So how do you teach your AI coding tools to consistently use Bun, regardless of what the underlying LLM insists on?</strong></p>\n<h2 id=\"the-problem-llms-don-t-know-your-preferences\">The Problem: LLMs Don't Know Your Preferences</h2>\n<p>AI coding agents are powerful, but they're trained on the internet's collective habits—which means npm everywhere. Your team might have standardized on Bun for its speed (25% monthly growth, <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://devclass.com/2025/12/03/bun-javascript-runtime-acquired-by-anthropic-tying-its-future-to-ai-coding/\">7.2 million downloads</a> in October 2025), but every AI agent keeps suggesting the old ways.</p>\n<p>Manually fixing these inconsistencies is tedious. What if your knowledge graph could automatically intercept and transform AI outputs?</p>\n<h2 id=\"the-solution-knowledge-graph-hooks\">The Solution: Knowledge Graph Hooks</h2>\n<p>Terraphim provides a hook system that intercepts AI agent actions and applies knowledge graph-based transformations. The system uses:</p>\n<ol>\n<li><strong>Aho-Corasick automata</strong> for efficient multi-pattern matching</li>\n<li><strong>LeftmostLongest strategy</strong> ensuring specific patterns match before general ones</li>\n<li><strong>Markdown-based knowledge graph</strong> files that are human-readable and version-controlled</li>\n</ol>\n<h3 id=\"how-it-works\">How It Works</h3>\n<pre><code>Input Text → Aho-Corasick Automata → Pattern Match → Knowledge Graph Lookup → Transformed Output\n</code></pre>\n<p>The knowledge graph is built from simple markdown files:</p>\n<pre><code data-lang=\"markdown\"># bun install\n\nFast package installation with Bun.\n\nsynonyms:: pnpm install, npm install, yarn install\n</code></pre>\n<p>When the automata encounter any synonym, they replace it with the canonical term (the heading).</p>\n<h2 id=\"real-world-example-npm-bun\">Real-World Example: npm → bun</h2>\n<p>Let's prove it works. Here's a live test:</p>\n<pre><code data-lang=\"bash\">$ echo &quot;npm install&quot; | terraphim-agent replace\nbun install\n\n$ echo &quot;yarn install lodash&quot; | terraphim-agent replace\nbun install lodash\n\n$ echo &quot;pnpm install --save-dev jest&quot; | terraphim-agent replace\nbun install --save-dev jest\n</code></pre>\n<p>The LeftmostLongest matching ensures <code>npm install</code> matches the more specific pattern before standalone <code>npm</code> could match.</p>\n<h2 id=\"hook-integration-points\">Hook Integration Points</h2>\n<p>Terraphim hooks integrate at multiple points in the development workflow:</p>\n<h3 id=\"1-claude-code-pretooluse-hooks\">1. Claude Code PreToolUse Hooks</h3>\n<p>Intercept Bash commands before execution:</p>\n<pre><code data-lang=\"json\">{\n  &quot;hooks&quot;: {\n    &quot;PreToolUse&quot;: [{\n      &quot;matcher&quot;: &quot;Bash&quot;,\n      &quot;hooks&quot;: [{\n        &quot;type&quot;: &quot;command&quot;,\n        &quot;command&quot;: &quot;terraphim-agent replace&quot;\n      }]\n    }]\n  }\n}\n</code></pre>\n<p>When Claude Code tries to run <code>npm install express</code>, the hook transforms it to <code>bun install express</code> before execution.</p>\n<h3 id=\"2-git-prepare-commit-msg-hooks\">2. Git prepare-commit-msg Hooks</h3>\n<p>Enforce attribution standards in commits:</p>\n<pre><code data-lang=\"bash\">#!/bin/bash\nCOMMIT_MSG_FILE=$1\nORIGINAL=$(cat &quot;$COMMIT_MSG_FILE&quot;)\nTRANSFORMED=$(echo &quot;$ORIGINAL&quot; | terraphim-agent replace)\necho &quot;$TRANSFORMED&quot; &gt; &quot;$COMMIT_MSG_FILE&quot;\n</code></pre>\n<p>With a knowledge graph entry:</p>\n<pre><code data-lang=\"markdown\"># Terraphim AI\n\nAttribution for AI-assisted development.\n\nsynonyms:: Claude Code, Claude, Anthropic Claude\n</code></pre>\n<p>Every commit message mentioning \"Claude Code\" becomes \"Terraphim AI\".</p>\n<h3 id=\"3-mcp-tools\">3. MCP Tools</h3>\n<p>The <code>replace_matches</code> MCP tool exposes the same functionality to any MCP-compatible client:</p>\n<pre><code data-lang=\"json\">{\n  &quot;tool&quot;: &quot;replace_matches&quot;,\n  &quot;arguments&quot;: {\n    &quot;text&quot;: &quot;Run npm install to setup&quot;\n  }\n}\n</code></pre>\n<h2 id=\"architecture\">Architecture</h2>\n<p>The hook system is built on three crates:</p>\n<table><thead><tr><th>Crate</th><th>Purpose</th></tr></thead><tbody>\n<tr><td><code>terraphim_automata</code></td><td>Aho-Corasick pattern matching, thesaurus building</td></tr>\n<tr><td><code>terraphim_hooks</code></td><td>ReplacementService, HookResult, binary discovery</td></tr>\n<tr><td><code>terraphim_agent</code></td><td>CLI with <code>replace</code> subcommand</td></tr>\n</tbody></table>\n<h3 id=\"performance\">Performance</h3>\n<ul>\n<li><strong>Pattern matching</strong>: O(n) where n is input length (not pattern count)</li>\n<li><strong>Startup</strong>: ~50ms to load knowledge graph and build automata</li>\n<li><strong>Memory</strong>: Automata are compact finite state machines</li>\n</ul>\n<h2 id=\"extending-the-knowledge-graph\">Extending the Knowledge Graph</h2>\n<p>Adding new patterns is simple. Create a markdown file in the mdBook source tree under <code>docs/src/kg/</code> (published at https://docs.terraphim.ai/src/kg/).</p>\n<pre><code data-lang=\"markdown\"># pytest\n\nPython testing framework.\n\nsynonyms:: python -m unittest, unittest, nose\n</code></pre>\n<p>The system automatically rebuilds the automata on startup.</p>\n<h3 id=\"pattern-priority\">Pattern Priority</h3>\n<p>The LeftmostLongest strategy means:</p>\n<ul>\n<li><code>npm install</code> matches before <code>npm</code></li>\n<li><code>python -m pytest</code> matches before <code>python</code></li>\n<li>Longer, more specific patterns always win</li>\n</ul>\n<h2 id=\"installation\">Installation</h2>\n<h3 id=\"quick-setup\">Quick Setup</h3>\n<pre><code data-lang=\"bash\"># Install all hooks\n./scripts/install-terraphim-hooks.sh --easy-mode\n\n# Test the replacement\necho &quot;npm install&quot; | ./target/release/terraphim-agent replace\n</code></pre>\n<h3 id=\"manual-setup\">Manual Setup</h3>\n<ol>\n<li>Build the agent:</li>\n</ol>\n<pre><code data-lang=\"bash\">cargo build -p terraphim_agent --features repl-full --release\n</code></pre>\n<ol start=\"2\">\n<li>\n<p>Configure Claude Code hooks in <code>.claude/settings.local.json</code></p>\n</li>\n<li>\n<p>Install Git hooks:</p>\n</li>\n</ol>\n<pre><code data-lang=\"bash\">cp scripts/hooks/prepare-commit-msg .git/hooks/\nchmod +x .git/hooks/prepare-commit-msg\n</code></pre>\n<h2 id=\"use-cases\">Use Cases</h2>\n<table><thead><tr><th>Use Case</th><th>Pattern</th><th>Replacement</th></tr></thead><tbody>\n<tr><td>Package manager standardization</td><td>npm, yarn, pnpm</td><td>bun</td></tr>\n<tr><td>AI attribution</td><td>Claude Code, Claude</td><td>Terraphim AI</td></tr>\n<tr><td>Framework migration</td><td>React.Component</td><td>React functional components</td></tr>\n<tr><td>API versioning</td><td>/api/v1</td><td>/api/v2</td></tr>\n<tr><td>Deprecated function replacement</td><td>moment()</td><td>dayjs()</td></tr>\n</tbody></table>\n<h2 id=\"claude-code-skills-plugin\">Claude Code Skills Plugin</h2>\n<p>For AI agents that support skills, we provide a dedicated plugin:</p>\n<pre><code data-lang=\"bash\">claude plugin install terraphim-engineering-skills@terraphim-ai\n</code></pre>\n<p>The <code>terraphim-hooks</code> skill teaches agents how to:</p>\n<ul>\n<li>Use the replace command correctly</li>\n<li>Extend the knowledge graph</li>\n<li>Debug hook issues</li>\n</ul>\n<h2 id=\"conclusion\">Conclusion</h2>\n<p>Knowledge graph hooks provide a powerful, declarative way to enforce coding standards across AI agents. By defining patterns in simple markdown files, you can:</p>\n<ul>\n<li>Standardize package managers across your team</li>\n<li>Ensure consistent attribution in commits</li>\n<li>Migrate deprecated patterns automatically</li>\n<li>Keep your knowledge graph version-controlled and human-readable</li>\n</ul>\n<p>The Aho-Corasick automata ensure efficient matching regardless of pattern count, making this approach scale to large knowledge graphs.</p>\n<h2 id=\"next-steps\">Next Steps</h2>\n<p>To wire knowledge-graph hooks into your own project, the <a href=\"/how-tos/command-rewriting-howto/\">Command Rewriting How-to</a> walks through the configuration end to end. To understand <em>why</em> the matching is sub-millisecond and deterministic — and what that lets you promise to your users — read <a href=\"/posts/why-graph-embeddings-matter/\">Why Graph Embeddings Matter</a>.</p>\n<h2 id=\"resources\">Resources</h2>\n<ul>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-ai\">Terraphim AI Repository</a></li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://github.com/terraphim/terraphim-claude-skills\">Claude Code Skills Plugin</a></li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://docs.terraphim.ai/hooks/\">Hook Installation Guide</a></li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://docs.terraphim.ai/knowledge-graph/\">Knowledge Graph Documentation</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2025-12-28",
"https://atomicdata.dev/properties/tags" : {"tags":["Terraphim","ai","hooks","knowledge-graph","claude-code","developer-tools","bun","anthropic"],"categories":["Technical"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/posts/post-0/",
"https://atomicdata.dev/properties/name" : "Announcing Terraphim AI",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/posts/post-0/",
"https://atomicdata.dev/properties/description" : "<p>We have a few end-to-end demos and user journeys to discuss with early adopters of Terraphim - Privacy Preserving AI assistant.</p>\n",
"https://terraphim.ai/properties/date" : "2023-08-12",
"https://atomicdata.dev/properties/tags" : {"tags":["Terraphim","ai","announcement"],"categories":["Announcements"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/local-first/",
"https://atomicdata.dev/properties/name" : "Local-First Architecture",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/local-first/",
"https://atomicdata.dev/properties/description" : "<h2 id=\"privacy-by-architecture-not-by-policy\">Privacy by Architecture, Not by Policy</h2>\n<p>Terraphim AI is not \"privacy-respecting\" because of a terms-of-service promise. It is private because the architecture makes data exfiltration structurally impossible. All indexing, searching, and knowledge graph traversal happens entirely on your device. There is no cloud dependency, no telemetry, no analytics phone-home.</p>\n<h2 id=\"design-principles\">Design Principles</h2>\n<p>The search architecture is deliberately non-neural and offline-first:</p>\n<table><thead><tr><th>Principle</th><th>What It Means</th></tr></thead><tbody>\n<tr><td><strong>Offline-first</strong></td><td>No network calls or LLM inference required at search time</td></tr>\n<tr><td><strong>Deterministic</strong></td><td>Same query plus same corpus equals same results, always</td></tr>\n<tr><td><strong>Explainable</strong></td><td>Every score decomposes into frequency counts, field weights, or set overlaps</td></tr>\n<tr><td><strong>Low footprint</strong></td><td>Approximately 15-20 MB RAM for a typical knowledge graph; no GPU, no float vectors</td></tr>\n<tr><td><strong>Graph-native</strong></td><td>Explicit edges and nodes encode domain relationships, not latent geometry</td></tr>\n</tbody></table>\n<h2 id=\"no-account-required\">No Account Required</h2>\n<p>There is no sign-up, no API key, no subscription. You install Terraphim and it works. Your search history, your knowledge graphs, your indexed documents — all of it stays on your machine under your control.</p>\n<h2 id=\"how-it-compares\">How It Compares</h2>\n<p>Traditional enterprise search tools upload your data to cloud servers, process it with proprietary models, and return results through APIs that require authentication and ongoing payment. Terraphim inverts this model entirely:</p>\n<ul>\n<li><strong>Data stays on device</strong> — not uploaded, not cached remotely, not used for training</li>\n<li><strong>Works offline</strong> — full functionality without an internet connection</li>\n<li><strong>Zero telemetry</strong> — no usage tracking, no analytics, no fingerprinting</li>\n<li><strong>No vendor lock-in</strong> — open source, standard formats, portable knowledge graphs</li>\n</ul>\n<h2 id=\"deployment-options\">Deployment Options</h2>\n<p>Terraphim runs wherever you need it: as a native desktop application, a WebAssembly module in your browser, a CLI tool, or a TUI (terminal user interface). For teams that need shared access, Terraphim Private Cloud uses AWS Firecracker microVMs to give each user a dedicated virtual machine with their own TLS certificate — keeping data isolated even in multi-tenant deployments.</p>\n<h2 id=\"learn-more\">Learn More</h2>\n<ul>\n<li><a href=\"/docs/installation\">Installation guide</a></li>\n<li><a href=\"/docs/quickstart\">Quickstart</a></li>\n<li><a href=\"/capabilities/rust-wasm/\">Rust/WASM Performance</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-05",
"https://atomicdata.dev/properties/tags" : {"tags":["privacy","local-first","offline","architecture"],"categories":["Capabilities"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/knowledge-graph/",
"https://atomicdata.dev/properties/name" : "Knowledge Graph Engine",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/knowledge-graph/",
"https://atomicdata.dev/properties/description" : "<h2 id=\"thousands-of-patterns-in-a-single-pass\">Thousands of Patterns in a Single Pass</h2>\n<p>At the heart of Terraphim is a knowledge graph engine that uses <strong>Aho-Corasick finite state automata</strong> for multi-pattern matching. Unlike traditional keyword search, Aho-Corasick matches thousands of patterns simultaneously in O(n) time relative to the text length — regardless of how many patterns are in the graph.</p>\n<h2 id=\"how-it-works\">How It Works</h2>\n<p>The search pipeline operates in stages:</p>\n<ol>\n<li><strong>Lexical Extraction</strong> — Aho-Corasick automaton matches all thesaurus terms simultaneously, with case-insensitive, leftmost-longest matching</li>\n<li><strong>Graph Traversal</strong> — Matched nodes are traversed through the RoleGraph, accumulating node rank, edge rank, and document rank</li>\n<li><strong>Statistical Ranking</strong> — Pluggable scorers (BM25, BM25F, BM25+, TF-IDF, Jaccard, QueryRatio) rank results</li>\n<li><strong>Similarity Re-ranking</strong> — Optional fuzzy matching via Levenshtein, Jaro, or Jaro-Winkler distance</li>\n</ol>\n<h2 id=\"role-based-knowledge-graphs\">Role-Based Knowledge Graphs</h2>\n<p>Each Terraphim role has its own separate knowledge graph containing concepts relevant to that domain, with all synonyms. A systems engineer, a project manager, and a quality analyst each see the same documents ranked differently based on their role's knowledge graph.</p>\n<p>Knowledge graphs are built from industry standards, reference process models, handbooks, and curated taxonomies. Terraphim imports these sources and produces a graph following the SIPOC pattern — concepts at the input and output of processes, with activity names linking them.</p>\n<h2 id=\"dynamic-ontology\">Dynamic Ontology</h2>\n<p>Terraphim's Dynamic Ontology enables schema-first knowledge graph construction:</p>\n<ul>\n<li><strong>Two-layer architecture</strong>: Curated schema combined with an ontology catalogue</li>\n<li><strong>Normalisation</strong>: Aho-Corasick plus fuzzy matching — no vector embeddings needed</li>\n<li><strong>Coverage governance</strong>: Quality signals to judge extraction quality</li>\n<li><strong>Grounding metadata</strong>: Canonical URIs for interoperability</li>\n</ul>\n<h2 id=\"synonym-control-and-multilingual-matching\">Synonym Control and Multilingual Matching</h2>\n<p>Users specify synonyms manually and rebuild graph embeddings within 20 milliseconds. This allows matching terms in different languages to the same concept without running language detection. There is no need for a stop-word dictionary — \"The Pattern\" matches exactly as a project name, even though \"The\" would normally be filtered as a stop word.</p>\n<h2 id=\"learn-more\">Learn More</h2>\n<ul>\n<li><a href=\"/docs/graph-embeddings\">Graph Embeddings documentation</a></li>\n<li><a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Teaching AI Agents with Knowledge Graph Hooks</a></li>\n<li><a href=\"/capabilities/kg-file-search/\">KG-Boosted File Search</a></li>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.discourse.group/\">Discussion on Discourse</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-05",
"https://atomicdata.dev/properties/tags" : {"tags":["knowledge-graph","aho-corasick","ontology","semantic-search"],"categories":["Capabilities"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/mcp-server/",
"https://atomicdata.dev/properties/name" : "MCP Server Integration",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/mcp-server/",
"https://atomicdata.dev/properties/description" : "<h2 id=\"connect-your-ai-assistant-to-local-knowledge\">Connect Your AI Assistant to Local Knowledge</h2>\n<p>Terraphim includes a native <strong>Model Context Protocol (MCP) server</strong> that exposes your local knowledge graphs to AI coding assistants like Claude Code, Claude Desktop, and other MCP-compatible tools. Your AI assistant gains domain-specific context without your data leaving your machine.</p>\n<h2 id=\"available-mcp-tools\">Available MCP Tools</h2>\n<h3 id=\"paragraph-extraction\">Paragraph Extraction</h3>\n<p>Extract paragraphs from text starting at matched terms, with precise line numbers. Useful for referencing function definitions, getting context around specific patterns, or building documentation with accurate line references.</p>\n<h3 id=\"knowledge-graph-search\">Knowledge Graph Search</h3>\n<p>Semantic search that returns full document context with file paths. Unlike plain text search, this leverages your role-based knowledge graph to find documents by their conceptual relationship to your query, not just keyword overlap.</p>\n<h3 id=\"token-tracking\">Token Tracking</h3>\n<p>Monitor token usage across your AI assistant sessions. Track how much context is being consumed and optimise your prompts based on actual usage data.</p>\n<h2 id=\"role-based-context-enrichment\">Role-Based Context Enrichment</h2>\n<p>Different roles need different context. A systems engineer asking about \"requirements\" needs different search results from a quality analyst asking the same word. Terraphim's MCP server routes queries through the active role's knowledge graph, ensuring your AI assistant receives context that matches your current domain.</p>\n<h2 id=\"how-it-differs-from-cloud-solutions\">How It Differs from Cloud Solutions</h2>\n<p>Typical AI assistant integrations send your files and context to cloud APIs. Terraphim's MCP server runs locally:</p>\n<ul>\n<li><strong>No data upload</strong> — your files, code, and documents stay on your machine</li>\n<li><strong>No API key</strong> — no third-party service required for the knowledge graph layer</li>\n<li><strong>No latency</strong> — knowledge graph queries resolve in nanoseconds, not network round trips</li>\n<li><strong>Full control</strong> — you decide which knowledge graphs are active and what context your AI sees</li>\n</ul>\n<h2 id=\"integration-with-claude-code\">Integration with Claude Code</h2>\n<p>Terraphim works as a Claude Code MCP server, providing knowledge-graph-boosted file search, concept extraction, and role-based context enrichment directly within your development workflow.</p>\n<h2 id=\"learn-more\">Learn More</h2>\n<ul>\n<li><a href=\"/docs/terraphim_config\">Terraphim Configuration</a></li>\n<li><a href=\"/capabilities/knowledge-graph/\">Knowledge Graph Engine</a></li>\n<li><a href=\"/capabilities/hooks/\">Hooks System</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-05",
"https://atomicdata.dev/properties/tags" : {"categories":["Capabilities"],"tags":["mcp","ai-assistant","claude","context-engineering"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/rust-wasm/",
"https://atomicdata.dev/properties/name" : "Rust/WASM Performance",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/rust-wasm/",
"https://atomicdata.dev/properties/description" : "<h2 id=\"near-native-speed-everywhere\">Near-Native Speed, Everywhere</h2>\n<p>Terraphim AI is written in Rust and compiles to WebAssembly. This is not a marketing choice — it is a performance requirement. Knowledge graph inference runs in <strong>5 to 10 nanoseconds</strong>. Pipeline processing completes in hundreds of milliseconds. Search queries resolve in sub-millisecond time.</p>\n<h2 id=\"why-rust\">Why Rust</h2>\n<p>Rust provides <strong>memory safety without garbage collection</strong>, <strong>zero-cost abstractions</strong>, and <strong>thread safety</strong> guaranteed at compile time. For a system that builds and traverses knowledge graphs with thousands of nodes and edges, these guarantees translate directly into reliable performance without unexpected pauses or memory leaks.</p>\n<h2 id=\"why-webassembly\">Why WebAssembly</h2>\n<p>WebAssembly allows the same Rust codebase to run:</p>\n<ul>\n<li><strong>In your browser</strong> — no installation required, instant access</li>\n<li><strong>On your desktop</strong> — native performance as a standalone application</li>\n<li><strong>On your server</strong> — same code, same results, different deployment target</li>\n<li><strong>On mobile</strong> — lightweight enough for devices with limited resources</li>\n</ul>\n<p>One codebase, compiled to multiple targets, with no performance compromise.</p>\n<h2 id=\"performance-numbers\">Performance Numbers</h2>\n<p>The journey from The Pattern to Terraphim AI tells the performance story:</p>\n<table><thead><tr><th>Metric</th><th>Traditional ML Pipeline</th><th>The Pattern</th><th>Terraphim AI</th></tr></thead><tbody>\n<tr><td>Data processing for training</td><td>6 days</td><td>6 hours</td><td>Hundreds of milliseconds</td></tr>\n<tr><td>Inference latency</td><td>Seconds</td><td>Under 2 ms</td><td>5-10 nanoseconds</td></tr>\n<tr><td>RAM footprint</td><td>Gigabytes</td><td>Hundreds of MB</td><td>15-20 MB</td></tr>\n<tr><td>GPU required</td><td>Yes</td><td>No</td><td>No</td></tr>\n</tbody></table>\n<h2 id=\"benchmarking\">Benchmarking</h2>\n<p>Terraphim includes a comprehensive benchmarking framework covering:</p>\n<ul>\n<li>Knowledge graph construction time</li>\n<li>Search indexing throughput</li>\n<li>Query evaluation latency</li>\n<li>Aho-Corasick pattern matching speed</li>\n<li>End-to-end performance across all components</li>\n</ul>\n<p>All benchmarks run on standard hardware — no GPU, no specialised accelerator.</p>\n<h2 id=\"low-footprint\">Low Footprint</h2>\n<p>A typical Terraphim knowledge graph occupies approximately 15-20 MB of RAM. There are no float vectors, no dense embeddings, no GPU memory allocation. This means Terraphim runs comfortably on a laptop, a Raspberry Pi, or a modest cloud instance.</p>\n<h2 id=\"learn-more\">Learn More</h2>\n<ul>\n<li><a href=\"/capabilities/local-first/\">Local-First Architecture</a></li>\n<li><a href=\"/capabilities/knowledge-graph/\">Knowledge Graph Engine</a></li>\n<li><a href=\"/docs/installation\">Installation guide</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-05",
"https://atomicdata.dev/properties/tags" : {"categories":["Capabilities"],"tags":["rust","wasm","performance","benchmarks"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/hooks/",
"https://atomicdata.dev/properties/name" : "Hooks System",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/hooks/",
"https://atomicdata.dev/properties/description" : "<h2 id=\"two-stage-runtime-validation\">Two-Stage Runtime Validation</h2>\n<p>Terraphim implements a <strong>two-stage runtime validation system</strong> for AI-assisted development workflows. Every tool execution and LLM generation passes through hooks that provide both safety and intelligence enhancement.</p>\n<h3 id=\"stage-1-guard-stage\">Stage 1: Guard Stage</h3>\n<p>The guard stage prevents dangerous operations before any processing occurs:</p>\n<ul>\n<li><strong>Blocks</strong> bypass flags like <code>--no-verify</code> in git operations</li>\n<li><strong>Validates</strong> commands against security patterns (injection prevention, resource limits)</li>\n<li><strong>Logs</strong> all decisions with reasons for audit trails</li>\n</ul>\n<h3 id=\"stage-2-replacement-stage\">Stage 2: Replacement Stage</h3>\n<p>The replacement stage enhances text using knowledge graph patterns:</p>\n<ul>\n<li><strong>Applies</strong> role-based knowledge graph replacements to commands and text</li>\n<li><strong>Validates</strong> semantic connectivity and coherence</li>\n<li><strong>Transforms</strong> using thesaurus and autocomplete for terminology consistency</li>\n</ul>\n<h2 id=\"learning-from-mistakes\">Learning from Mistakes</h2>\n<p>Failed commands are automatically captured by post-tool-use hooks. When a bash command fails, Terraphim records the failure for later review. Over time, you build a searchable history of what went wrong and how it was corrected.</p>\n<pre><code data-lang=\"bash\"># Review captured learnings\nterraphim-agent learn list\n\n# Query by pattern\nterraphim-agent learn query &quot;npm&quot;\n\n# Add a correction\nterraphim-agent learn correct &lt;id&gt; &quot;use bun instead&quot;\n</code></pre>\n<p>This turns individual debugging sessions into institutional knowledge that persists across conversations and team members.</p>\n<h2 id=\"hook-types\">Hook Types</h2>\n<p>Terraphim supports hooks at four points in the AI workflow:</p>\n<table><thead><tr><th>Hook</th><th>Purpose</th></tr></thead><tbody>\n<tr><td><strong>Pre-LLM</strong></td><td>Validate prompts before generation; block, modify, or require human confirmation</td></tr>\n<tr><td><strong>Post-LLM</strong></td><td>Validate responses; catch harmful content, enforce formatting</td></tr>\n<tr><td><strong>Pre-Tool</strong></td><td>Validate commands before execution; security checks, injection prevention</td></tr>\n<tr><td><strong>Post-Tool</strong></td><td>Monitor results; track performance, capture failures for learning</td></tr>\n</tbody></table>\n<h2 id=\"configuration\">Configuration</h2>\n<p>Hooks are configured via TOML and environment variables, with sensible defaults:</p>\n<ul>\n<li><strong>Fail-open in development</strong> — hooks that crash do not block your work</li>\n<li><strong>Fail-closed in production</strong> — strict validation when it matters</li>\n<li><strong>Configurable timeouts</strong> — hooks that take too long are bypassed gracefully</li>\n</ul>\n<h2 id=\"deep-dives\">Deep Dives</h2>\n<ul>\n<li><a href=\"/posts/teaching-ai-agents-with-knowledge-graphs/\">Teaching AI Coding Agents with Knowledge Graph Hooks</a> — How Aho-Corasick automata intercept and transform AI agent commands</li>\n<li><a href=\"/posts/teaching-ai-agents-to-learn-from-mistakes/\">Teaching AI Agents to Learn from Their Mistakes</a> — Turning agent failures into institutional memory</li>\n<li><a href=\"/capabilities/mcp-server/\">MCP Server Integration</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-05",
"https://atomicdata.dev/properties/tags" : {"categories":["Capabilities"],"tags":["hooks","ai-assistant","learning","security","claude-code"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/kg-file-search/",
"https://atomicdata.dev/properties/name" : "KG-Boosted File Search",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/kg-file-search/",
"https://atomicdata.dev/properties/description" : "<h2 id=\"beyond-keyword-matching\">Beyond Keyword Matching</h2>\n<p>Traditional file search finds files by name or by text content. Terraphim's KG-boosted file search finds files by their <strong>semantic relationship</strong> to your domain concepts and role-specific vocabulary.</p>\n<p>When you search for \"requirements validation\", Terraphim does not just grep for those words. It traverses your knowledge graph to find documents connected to those concepts through synonyms, co-occurrence edges, and domain relationships. A document titled \"acceptance criteria review\" surfaces because the knowledge graph knows these concepts are related in a systems engineering context.</p>\n<h2 id=\"multi-source-search\">Multi-Source Search</h2>\n<p>Terraphim searches across multiple sources simultaneously:</p>\n<ul>\n<li><strong>Local filesystem</strong> — Markdown files, code, documentation</li>\n<li><strong>Knowledge repositories</strong> — Obsidian vaults, Logseq graphs</li>\n<li><strong>Shared servers</strong> — Atomic Server for team knowledge</li>\n<li><strong>Code search</strong> — GitHub, StackOverflow, and other configured haystacks</li>\n</ul>\n<p>Results from all sources are unified, deduplicated, and ranked using a single knowledge graph.</p>\n<h2 id=\"intelligent-deduplication\">Intelligent Deduplication</h2>\n<p>When searching across multiple haystacks, the same document often appears in different sources. Terraphim handles duplicates intelligently, merging results and preserving the highest-ranked version rather than showing the same content multiple times.</p>\n<h2 id=\"role-specific-relevance\">Role-Specific Relevance</h2>\n<p>The same search query produces different rankings for different roles:</p>\n<ul>\n<li>A <strong>systems engineer</strong> searching \"testing\" sees verification and validation documents first</li>\n<li>A <strong>project manager</strong> searching \"testing\" sees schedule and resource allocation documents</li>\n<li>A <strong>quality analyst</strong> searching \"testing\" sees compliance and audit documents</li>\n</ul>\n<p>Each role's knowledge graph contains domain-specific concepts and synonyms that reshape how results are scored and ordered.</p>\n<h2 id=\"context-collections\">Context Collections</h2>\n<p>Terraphim organises searchable content into context collections — curated sets of documents, taxonomies, and vocabulary that define a domain. Collections can be shared across a team or customised per user, providing consistent search relevance without centralised configuration.</p>\n<h2 id=\"integration-with-ai-assistants\">Integration with AI Assistants</h2>\n<p>Through the MCP server, KG-boosted file search is available directly in your AI coding assistant. When Claude Code searches for files, it uses your knowledge graph to find semantically relevant results, not just filename matches.</p>\n<h2 id=\"learn-more\">Learn More</h2>\n<ul>\n<li><a href=\"/capabilities/knowledge-graph/\">Knowledge Graph Engine</a></li>\n<li><a href=\"/capabilities/mcp-server/\">MCP Server Integration</a></li>\n<li><a href=\"/docs/quickstart\">Quickstart guide</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-05",
"https://atomicdata.dev/properties/tags" : {"categories":["Capabilities"],"tags":["search","knowledge-graph","semantic","role-based"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/origin-story/",
"https://atomicdata.dev/properties/name" : "Origin Story",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/origin-story/",
"https://atomicdata.dev/properties/description" : "<h2 id=\"inspired-by-science-fiction\">Inspired by Science Fiction</h2>\n<p>The name <em>Terraphim</em> comes from the <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://www.goodreads.com/en/book/show/196710046\">Relict series</a> of science fiction novels by <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://en.wikipedia.org/wiki/Vasili_Golovachov\">Vasiliy Golovachev</a>. In Golovachev's universe a Terraphim is an artificial intelligence that lives inside a spacesuit — part of an exocortex — or inside your house or vehicle, designed to help you with your tasks. You carry it with you.</p>\n<p>Similar companions are now familiar across modern science fiction. Destiny 2 has <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://www.destinypedia.com/Ghost\">Ghost</a>, a small floating AI bound to its Guardian. Star Wars Jedi: Survivor has <a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://starwars.fandom.com/wiki/BD-1\">BD-1</a>, a droid riding on Cal Kestis's back. Same pattern: a compact, mobile, personal intelligence that augments rather than replaces.</p>\n<p>That image — small, local, loyal, always with you — drives the engineering choices in the rest of this page. Terraphim runs on your hardware, codifies knowledge as compact graphs rather than heavyweight models, and never ships your data across a boundary. The sci-fi premise is the brief; what follows is how we built it.</p>\n<h2 id=\"from-kaggle-to-nanosecond-inference\">From Kaggle to Nanosecond Inference</h2>\n<p>Terraphim AI did not start as a product. It started as a frustration with how slowly machine learning pipelines process data.</p>\n<p>The project's predecessor, <strong>The Pattern</strong>, grew out of participation in two Kaggle data science competitions. The original ML pipeline could not finish processing data in six days. The Pattern processed the same data for training in six hours and achieved under two-millisecond inference. That hundredfold improvement was not a hardware upgrade — it was a fundamental rethinking of how search and retrieval should work.</p>\n<h2 id=\"redis-hackathon-platinum-winner\">Redis Hackathon Platinum Winner</h2>\n<p>The Pattern was awarded <strong>Platinum Winner</strong> at the Redis Hackathon, outperforming Nvidia's ML pipeline for BERT QA inference on CPU. The prize confirmed that external experts recognised the approach as genuinely innovative, not merely a clever optimisation.</p>\n<p>The results were presented and discussed at a <strong>public lecture at Oxford University, Green Templeton College</strong>, bringing academic scrutiny to the architecture that would become Terraphim AI.</p>\n<h2 id=\"the-breakthrough-graph-embeddings-without-attention\">The Breakthrough: Graph Embeddings Without Attention</h2>\n<p>Traditional search systems rely on dense vector embeddings and attention mechanisms that are expensive, opaque, and non-deterministic. Terraphim took a different path.</p>\n<p><strong>Terraphim Graph Embeddings</strong> maintain the position of terms in a sentence without requiring traditional training techniques like attention. Users can specify synonyms manually and rebuild graph embeddings for a role within 20 milliseconds. This allows matching terms in different languages to the same concept without running language detection, and eliminates the need for stop-word dictionaries entirely.</p>\n<p>By rethinking from the ground up, Terraphim AI achieves pipeline processing in hundreds of milliseconds and knowledge-graph-based inference in <strong>5 to 10 nanoseconds</strong>.</p>\n<h2 id=\"validated-by-incose\">Validated by INCOSE</h2>\n<p>The methodology has been validated within the <strong>INCOSE</strong> (International Council on Systems Engineering) community for the Systems Engineering Handbook v.4 and the Systems Engineering Digital Process Model v.1. It was recognised as a valid low-effort substitution for formal model-based systems engineering — particularly valuable for brownfield systems engineering, reverse-engineering, and professional certification.</p>\n<h2 id=\"why-search-is-broken\">Why Search is Broken</h2>\n<p>Research consistently shows the problem Terraphim exists to solve:</p>\n<ul>\n<li>88% of employees feel demoralised when they cannot find the information they need (Coveo 2023)</li>\n<li>Workers spend 1.8 hours daily searching for information (McKinsey)</li>\n<li>Traditional enterprise search addresses only 20% of actual business search use cases</li>\n<li>25% of all webpages that existed between 2013 and 2023 are no longer accessible (Pew Research 2024)</li>\n</ul>\n<p>Terraphim's answer: deterministic, privacy-first, knowledge-graph-powered search that runs entirely on your hardware.</p>\n<h2 id=\"three-differentiators\">Three Differentiators</h2>\n<ol>\n<li><strong>Privacy-first</strong>: Rather than transferring data across boundaries, Terraphim codifies and moves knowledge graphs. Runs in your browser via WebAssembly.</li>\n<li><strong>Action-oriented ontologies</strong>: Focused on required outcomes, not exhaustive recognition. Compact knowledge representations following YAGNI principles.</li>\n<li><strong>Role-based knowledge lenses</strong>: Separate knowledge graphs per role, with user control over modifications.</li>\n</ol>\n<h2 id=\"learn-more\">Learn More</h2>\n<ul>\n<li><a rel=\"noopener nofollow noreferrer external\" target=\"_blank\" href=\"https://terraphim.discourse.group/\">Join the discussion on Discourse</a></li>\n<li><a href=\"/docs/quickstart\">Quickstart guide</a></li>\n<li><a href=\"/capabilities/knowledge-graph/\">Knowledge Graph Engine</a></li>\n</ul>\n",
"https://terraphim.ai/properties/date" : "2026-04-17",
"https://atomicdata.dev/properties/tags" : {"tags":["history","the-pattern","redis","kaggle","oxford","incose","science-fiction","relict"],"categories":["Capabilities"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/graph-embeddings/",
"https://atomicdata.dev/properties/name" : "Graph Embeddings",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/graph-embeddings/",
"https://atomicdata.dev/properties/description" : "<p>Terraphim uses <strong>graph embeddings</strong> instead of neural vector embeddings. Terms and concepts\nare represented as nodes in a role-specific knowledge graph, with relationships encoded as\nedges. Matching a query against a graph is deterministic, auditable, and fast.</p>\n<p>See the <a href=\"/docs/graph-embeddings\">full design note</a> for how RoleGraph composes\nAho-Corasick matching with PageRank ordering to produce ranked results in\nsub-millisecond time, without any floating-point vector math.</p>\n<h2 id=\"why-graph-embeddings\">Why graph embeddings</h2>\n<ul>\n<li><strong>Deterministic</strong>: same query, same graph, same result. No stochastic retrieval.</li>\n<li><strong>Explainable</strong>: every matched node traces back to a term in the thesaurus.</li>\n<li><strong>Efficient</strong>: 15-20 MB RAM for a typical graph, no GPU required.</li>\n<li><strong>Domain-adapted</strong>: each role has its own vocabulary and relationships.</li>\n</ul>\n<p>This is a fundamentally different model from dense vector retrieval. It is what makes\nTerraphim run on a laptop, a Raspberry Pi, or inside a browser extension.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-15",
"https://atomicdata.dev/properties/tags" : {"categories":["Capability"],"tags":["graph-embeddings","aho-corasick","knowledge-graph","search"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/multi-haystack/",
"https://atomicdata.dev/properties/name" : "Multi-Haystack Search",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/multi-haystack/",
"https://atomicdata.dev/properties/description" : "<p>Terraphim roles can be configured with multiple haystacks — named sources of content.\nA single query fans out across every haystack attached to the role and the results come\nback ranked by the role's knowledge graph.</p>\n<p>Haystack adapters available today:</p>\n<ul>\n<li><strong>Local files</strong> — any directory tree, reindexed on change</li>\n<li><strong>Confluence</strong> — pages in a space</li>\n<li><strong>Jira</strong> — issues and comments</li>\n<li><strong>Discourse</strong> — forum threads</li>\n<li><strong>Email</strong> — IMAP mailboxes</li>\n<li><strong>grepapp</strong> — global code search across public repositories</li>\n<li><strong>Atomic Data</strong> — typed resources in an Atomic Server store</li>\n</ul>\n<p>See the release post <a href=\"/posts/multi-haystack-roles-grepapp\">Introducing Multi-Haystack Roles: Local + Global Code Search</a>\nfor a walkthrough of combining a local haystack with grepapp to search both your code and\nthe world's code in one query.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-15",
"https://atomicdata.dev/properties/tags" : {"categories":["Capability"],"tags":["haystack","search","grepapp","code-search"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/terraphim-agent/",
"https://atomicdata.dev/properties/name" : "terraphim-agent CLI",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/terraphim-agent/",
"https://atomicdata.dev/properties/description" : "<p><code>terraphim-agent</code> is a command-line AI agent that searches before it answers. It runs\nlocal knowledge graph queries, imports session history from Claude Code and Cursor,\ncaptures failed commands via post-tool-use hooks, and surfaces past corrections when\nyou encounter a recurring error.</p>\n<p>Install it with one command:</p>\n<pre><code data-lang=\"bash\">cargo install terraphim-agent\n</code></pre>\n<h2 id=\"what-it-does\">What it does</h2>\n<ul>\n<li><strong>Session search</strong>: query across imported Claude Code and Cursor session logs.</li>\n<li><strong>Knowledge-graph search</strong>: match queries against role-configured Aho-Corasick automata.</li>\n<li><strong>Learning capture</strong>: the <code>learn</code> subcommand records failed commands and their corrections.</li>\n<li><strong>Command rewriting</strong>: pre-tool-use hooks rewrite <code>npm install</code> to <code>bun add</code>,\n<code>pip install</code> to <code>uv add</code>, etc. via a thesaurus.</li>\n<li><strong>Learning compile</strong>: <code>learn compile</code> converts captured <code>ToolPreference</code> corrections into a\nthesaurus JSON that the <code>replace</code> command loads directly, closing the feedback loop from\nfailure to live rewrite.</li>\n<li><strong>Evaluation framework</strong>: <code>evaluate</code> measures automata classification accuracy with\nprecision, recall, and F1 against a ground-truth JSON file, and flags terms that\nconsistently produce false positives.</li>\n<li><strong>Listener dispatch</strong>: the listener executes <code>terraphim-agent</code> subcommands triggered by\n<code>@adf:&lt;agent-name&gt;</code> Gitea mentions, with three security layers (allowlist, metachar\nrejection, CommandGuard), and posts results back as markdown comments.</li>\n</ul>\n<p>See the <a href=\"/how-tos/command-rewriting-howto\">command rewriting how-to</a> and the blog post\n<a href=\"/posts/teaching-ai-agents-to-learn-from-mistakes\">Teaching AI Agents to Learn from Their Mistakes</a>\nfor worked examples.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-15",
"https://atomicdata.dev/properties/tags" : {"categories":["Capability"],"tags":["agent","cli","ai-agents","learning"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/evaluation/",
"https://atomicdata.dev/properties/name" : "Automata Evaluation",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/evaluation/",
"https://atomicdata.dev/properties/description" : "<p>The evaluation framework measures how accurately the Aho-Corasick automata classify terms in\nreal documents. Give it a JSON file of hand-labeled documents and it returns micro-averaged\nprecision, recall, and F1, a per-term breakdown, and a list of terms that consistently\nproduce false positives.</p>\n<h2 id=\"when-to-use-it\">When to use it</h2>\n<ul>\n<li>After changing a thesaurus, to verify the automata still behave as expected.</li>\n<li>To find patterns that match too broadly (systematic false positives).</li>\n<li>To quantify recall gaps before adding new synonyms.</li>\n</ul>\n<h2 id=\"ground-truth-file-format\">Ground-truth file format</h2>\n<pre><code data-lang=\"json\">[\n  {\n    &quot;id&quot;: &quot;doc1&quot;,\n    &quot;text&quot;: &quot;tokio powers async rust applications&quot;,\n    &quot;expected_terms&quot;: [\n      { &quot;term&quot;: &quot;tokio&quot;,  &quot;category&quot;: null },\n      { &quot;term&quot;: &quot;rust&quot;,   &quot;category&quot;: null },\n      { &quot;term&quot;: &quot;async&quot;,  &quot;category&quot;: null }\n    ]\n  }\n]\n</code></pre>\n<p>Each <code>term</code> must match the normalized term value (<code>nterm</code>) stored in the thesaurus.</p>\n<h2 id=\"running-an-evaluation\">Running an evaluation</h2>\n<pre><code data-lang=\"rust\">use terraphim_automata::evaluation::{evaluate, load_ground_truth};\n\nlet docs = load_ground_truth(Path::new(&quot;ground_truth.json&quot;))?;\nlet result = evaluate(&amp;docs, thesaurus);\n\nprintln!(&quot;F1: {:.2}&quot;, result.overall.f1);\nfor report in &amp;result.per_term {\n    println!(&quot;  {} precision={:.2} recall={:.2}&quot;,\n        report.term, report.metrics.precision, report.metrics.recall);\n}\nfor err in &amp;result.systematic_errors {\n    println!(&quot;  SYSTEMATIC FP: {} in {} documents&quot;, err.term, err.false_positive_count);\n}\n</code></pre>\n<h2 id=\"how-metrics-are-computed\">How metrics are computed</h2>\n<p>Metrics are <strong>micro-averaged</strong>: true positives, false positives, and false negatives are\nsummed across all documents before dividing.</p>\n<p>A term is flagged as a <code>SystematicError</code> when it appears as a false positive in 2 or more\ndocuments. Matching is case-insensitive; each term is counted at most once per document.</p>\n<p>Source: <code>crates/terraphim_automata/src/evaluation.rs</code></p>\n",
"https://terraphim.ai/properties/date" : "2026-04-16",
"https://atomicdata.dev/properties/tags" : {"categories":["Capabilities"],"tags":["evaluation","automata","testing","quality"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/capabilities/listener-dispatch/",
"https://atomicdata.dev/properties/name" : "Gitea Agent Dispatch",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/capabilities/listener-dispatch/",
"https://atomicdata.dev/properties/description" : "<p>The listener dispatch feature lets you run <code>terraphim-agent</code> subcommands by mentioning the\nagent in a Gitea issue or comment. The result is posted back as a formatted markdown comment.</p>\n<h2 id=\"triggering-a-command\">Triggering a command</h2>\n<p>Mention the agent in a Gitea comment with a subcommand and arguments:</p>\n<pre><code>@adf:worker search &quot;knowledge graph&quot; --role engineer\n@adf:worker evaluate --role engineer\n@adf:worker learn list\n</code></pre>\n<p>The listener parses the text after <code>@adf:</code>, validates it through three security layers, runs\nthe command, and posts the output back to the same issue.</p>\n<h2 id=\"security\">Security</h2>\n<p>Three checks must all pass before the process is spawned:</p>\n<p><strong>1. Shell metacharacter rejection</strong> — Input containing <code>|</code>, <code>;</code>, <code>&amp;</code>, <code>`</code>, <code>$</code>, <code>(</code>,\n<code>)</code>, <code>&lt;</code>, or <code>&gt;</code> is rejected immediately. No shell is involved in execution, but this\nprevents confusion if the input is ever logged or replayed.</p>\n<p><strong>2. Subcommand allowlist / denylist</strong> — Only known safe subcommands are permitted.\nDenied subcommands (<code>listen</code>, <code>repl</code>, <code>interactive</code>, <code>setup</code>, <code>update</code>, <code>sessions</code>)\nare blocked even if added to <code>extra_allowed_subcommands</code>.</p>\n<p><strong>3. CommandGuard</strong> — A pattern-based guard runs on the full command string before\nprocess spawn. It blocks destructive patterns such as <code>git reset --hard</code>.</p>\n<h2 id=\"configuration\">Configuration</h2>\n<p>Enable dispatch in your listener config JSON:</p>\n<pre><code data-lang=\"json\">{\n  &quot;identity&quot;: { &quot;agent_name&quot;: &quot;worker&quot; },\n  &quot;gitea&quot;: {\n    &quot;base_url&quot;: &quot;https://git.terraphim.cloud&quot;,\n    &quot;owner&quot;: &quot;terraphim&quot;,\n    &quot;repo&quot;: &quot;terraphim-ai&quot;\n  },\n  &quot;dispatch&quot;: {\n    &quot;timeout_secs&quot;: 300,\n    &quot;max_output_bytes&quot;: 48000,\n    &quot;specialist_routes&quot;: {\n      &quot;evaluate&quot;: &quot;eval-bot&quot;\n    }\n  }\n}\n</code></pre>\n<p><code>specialist_routes</code> routes specific subcommands to a named agent rather than running locally.</p>\n<h2 id=\"output\">Output</h2>\n<p>The comment posted back to Gitea includes the exit code, elapsed time, stdout in a code\nblock, and stderr in a collapsible section. Output is capped at 48 KB. Commands that exceed\n5 minutes are killed and marked <code>**TIMED OUT**</code>.</p>\n<p>The <code>--robot</code> flag is always appended, so output is machine-readable JSON regardless of the\nuser's default format.</p>\n<p>Source: <code>crates/terraphim_agent/src/shell_dispatch.rs</code>, <code>crates/terraphim_agent/src/listener.rs</code></p>\n",
"https://terraphim.ai/properties/date" : "2026-04-16",
"https://atomicdata.dev/properties/tags" : {"categories":["Capabilities"],"tags":["gitea","agent","dispatch","security","automation"]},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
},{"localId": "https://terraphim.ai/how-tos/command-rewriting-howto/",
"https://atomicdata.dev/properties/name" : "Command Rewriting How-To",
"https://terraphim.ai/properties/url" : "https://terraphim.ai/how-tos/command-rewriting-howto/",
"https://atomicdata.dev/properties/description" : "<h1 id=\"how-to-learning-driven-command-rewriting\">How-To: Learning-Driven Command Rewriting</h1>\n<p>This guide shows how to use terraphim-agent to rewrite shell commands before\nexecution — for example <code>npm install</code> -&gt; <code>bun add</code> or <code>pip install</code> -&gt; <code>uv add</code> — by plugging a knowledge-graph-backed thesaurus into your AI coding\nagent's tool-execution hook.</p>\n<p>The mechanism composes three pieces that already exist in terraphim-agent:</p>\n<ol>\n<li>A Logseq-style knowledge graph of command synonyms under\n<code>~/.config/terraphim/docs/src/kg/</code> (or any role-configured path).</li>\n<li><code>terraphim-agent replace</code> — Aho-Corasick replacement that rewrites text\nusing a role's compiled thesaurus.</li>\n<li>A plugin hook in your AI agent (OpenCode, Claude Code, etc.) that\nintercepts every Bash tool call, pipes the command through <code>replace</code>,\nand writes the result back into the tool's args.</li>\n</ol>\n<h2 id=\"prerequisites\">Prerequisites</h2>\n<ul>\n<li><code>terraphim-agent</code> on <code>PATH</code> (any recent release; 1.16.33 or later).</li>\n<li>A role whose KG directory you control. The default ships with a\n<code>Terraphim Engineer</code> role pointing at <code>~/.config/terraphim/docs/src/kg/</code>.</li>\n<li>An AI agent that exposes a <code>tool.execute.before</code> style plugin API\n(OpenCode has one; Claude Code exposes equivalent hooks via shell scripts).</li>\n</ul>\n<h2 id=\"1-curate-the-knowledge-graph\">1. Curate the knowledge graph</h2>\n<p>Each concept is one markdown file. The filename stem becomes the concept\nkey; the H1 heading provides the display name used as the replacement; the\n<code>synonyms::</code> line lists terms that should be rewritten to it.</p>\n<p>Example <code>~/.config/terraphim/docs/src/kg/bun install.md</code>:</p>\n<pre><code data-lang=\"markdown\"># bun add\n\nInstall dependencies using Bun package manager.\n\nsynonyms:: npm install, yarn install, pnpm install, npm i, yarn add, pnpm add\n</code></pre>\n<p>Conventions that matter in practice:</p>\n<ul>\n<li><strong>Filename uses spaces, not underscores</strong> if the concept has multiple\nwords. The matcher compares against the filename stem (<code>bun install</code>).</li>\n<li><strong>Multi-word synonyms are supported.</strong> <code>python -m pip install</code> is a valid\nsynonym and is matched as a whole phrase; the Aho-Corasick automaton uses\nLeftmostLongest, so the longer phrase wins when a shorter one would also\nmatch.</li>\n<li><strong>Do not overlap synonyms across files.</strong> If both <code>uv.md</code> and <code>uv add.md</code>\nclaim <code>pip install</code>, the behaviour becomes non-deterministic at rebuild\ntime. Keep single-token synonyms in the short file (<code>pip</code> -&gt; <code>uv</code>) and\nmulti-token phrases in the specific file (<code>pip install</code> -&gt; <code>uv add</code>).</li>\n<li><strong>Keep domain vocabulary separate from command vocabulary</strong> if you need\nboth. Create a dedicated role with its own KG path rather than bleeding\ndomain terms into shell commands.</li>\n</ul>\n<h3 id=\"seed-set-shipped-with-the-terraphim-engineer-role\">Seed set shipped with the <code>Terraphim Engineer</code> role</h3>\n<table><thead><tr><th>File</th><th>Maps to</th><th>Covers</th></tr></thead><tbody>\n<tr><td><code>bun.md</code></td><td>bun</td><td>npm, yarn, pnpm</td></tr>\n<tr><td><code>bun install.md</code></td><td>bun add</td><td>npm install, yarn install, pnpm install, npm i, yarn add, pnpm add</td></tr>\n<tr><td><code>bun run.md</code></td><td>bun run</td><td>npm run, yarn run, pnpm run</td></tr>\n<tr><td><code>bunx.md</code></td><td>bunx</td><td>npx, pnpx, yarn dlx</td></tr>\n<tr><td><code>uv.md</code></td><td>uv</td><td>pip, pip3, pipx</td></tr>\n<tr><td><code>uv add.md</code></td><td>uv add</td><td>pip install, pip3 install, pip add, pipx install, python -m pip install</td></tr>\n<tr><td><code>uv sync.md</code></td><td>uv sync</td><td>pip install -r requirements.txt</td></tr>\n</tbody></table>\n<h2 id=\"2-verify-with-the-cli\">2. Verify with the CLI</h2>\n<pre><code data-lang=\"bash\">printf &quot;npm install express&quot; \\\n  | terraphim-agent replace --role &quot;Terraphim Engineer&quot; --fail-open --json\n</code></pre>\n<p>Expected output:</p>\n<pre><code data-lang=\"json\">{&quot;result&quot;:&quot;bun add express&quot;,&quot;original&quot;:&quot;npm install express&quot;,&quot;replacements&quot;:1,&quot;changed&quot;:true}\n</code></pre>\n<p>Flags worth knowing:</p>\n<ul>\n<li><code>--fail-open</code> — on any error, emits the input unchanged. Mandatory in\nhooks so a misconfigured terraphim-agent never wedges the agent.</li>\n<li><code>--json</code> — structured output with <code>result</code>, <code>changed</code>, <code>replacements</code>.\nUse this if the hook needs to branch on whether anything changed.</li>\n<li><code>--format plain|markdown|wiki|html</code> — how the replacement is wrapped.\nHooks want <code>plain</code>.</li>\n</ul>\n<h2 id=\"3-flush-the-cache-after-kg-edits\">3. Flush the cache after KG edits</h2>\n<p>Terraphim caches compiled thesauri in a SQLite database at\n<code>/tmp/terraphim_sqlite/terraphim.db</code> (path configured by\n<code>crates/terraphim_settings/default/settings.toml</code>). Editing a KG markdown\nfile does <strong>not</strong> invalidate this cache; <code>replace</code> keeps returning the old\nmapping until you flush it.</p>\n<pre><code data-lang=\"bash\">sqlite3 /tmp/terraphim_sqlite/terraphim.db \\\n  &quot;DELETE FROM terraphim_kv WHERE key LIKE &#39;thesaurus_%&#39; OR key LIKE &#39;document_ripgrep_%&#39;;&quot;\n</code></pre>\n<p>Because <code>/tmp/</code> is wiped on reboot, a fresh boot always gives the\nup-to-date thesaurus.</p>\n<h2 id=\"4-wire-up-the-hook-opencode-example\">4. Wire up the hook (OpenCode example)</h2>\n<p>OpenCode plugins expose <code>tool.execute.before(input, output)</code> where\n<code>output.args.command</code> is the mutable shell command about to run. The same\npattern works in Claude Code via the <code>PreToolUse</code> hook script, just with\nshell-stdin instead of a JS closure.</p>\n<pre><code data-lang=\"js\">// ~/.config/opencode/plugin/terraphim-hooks.js\nconst REWRITE_MODE = process.env.TERRAPHIM_REWRITE_MODE || &quot;suggest&quot;\nconst REWRITE_ROLE = process.env.TERRAPHIM_REWRITE_ROLE || &quot;Terraphim Engineer&quot;\nconst AUDIT_LOG    = `${process.env.HOME}/Library/Application Support/terraphim/rewrites.log`\n\n// Narrow whitelist of commands whose argument grammar survives a synonym swap.\nconst REWRITEABLE_HEADS =\n  /^\\s*(npm|yarn|pnpm|npx|pnpx|pip|pip3|pipx|python\\s+-m\\s+pip|python3\\s+-m\\s+pip)\\b/i\n\nexport const TerraphimHooks = async ({ $ }) =&gt; ({\n  &quot;tool.execute.before&quot;: async (input, output) =&gt; {\n    if (input.tool !== &quot;Bash&quot; || !output.args?.command) return\n    const command = output.args.command\n\n    const agent = `${process.env.HOME}/.cargo/bin/terraphim-agent`\n\n    // Always run the destructive-command guard first.\n    const g = await $`${agent} guard ${command} --json --fail-open 2&gt;/dev/null || echo &#39;{&quot;decision&quot;:&quot;allow&quot;}&#39;`\n    const guard = JSON.parse(g.stdout)\n    if (guard.decision === &quot;block&quot;) {\n      throw new Error(`BLOCKED: ${guard.reason}`)\n    }\n\n    const isGitCommit   = /git\\s+(-C\\s+\\S+\\s+)?commit/i.test(command)\n    const isRewriteable = REWRITEABLE_HEADS.test(command)\n    if (!isGitCommit &amp;&amp; !isRewriteable) return\n\n    const res     = await $`echo ${command} | ${agent} replace --role ${REWRITE_ROLE} --fail-open --json 2&gt;/dev/null`\n    const parsed  = JSON.parse(res.stdout)\n    const rewrite = (parsed.result || &quot;&quot;).trim()\n    if (!parsed.changed || !rewrite || rewrite === command) return\n\n    const line = [\n      new Date().toISOString(), REWRITE_MODE,\n      isGitCommit ? &quot;git-commit&quot; : &quot;pkg-mgr&quot;,\n      command.replace(/[\\t\\n\\r]/g, &quot; &quot;),\n      rewrite.replace(/[\\t\\n\\r]/g, &quot; &quot;),\n    ].join(&quot;\\t&quot;) + &quot;\\n&quot;\n    await $`mkdir -p &quot;$(dirname ${AUDIT_LOG})&quot; &lt; /dev/null &amp;&amp; printf %s ${line} &gt;&gt; ${AUDIT_LOG}`\n\n    if (REWRITE_MODE === &quot;apply&quot; || isGitCommit) {\n      output.args.command = rewrite\n    }\n  },\n})\n</code></pre>\n<p>Design notes:</p>\n<ul>\n<li><strong>Whitelist, not blacklist.</strong> Arbitrary shell is never rewritten. Only\ncommands whose head matches <code>REWRITEABLE_HEADS</code> are candidates.</li>\n<li><strong>Suggest mode by default.</strong> Set <code>TERRAPHIM_REWRITE_MODE=apply</code> once you\ntrust the diffs. Git commit rewriting always applies because commit\nmessages are prose, not syntax.</li>\n<li><strong>Audit log.</strong> Every rewrite is logged tab-separated to\n<code>~/Library/Application Support/terraphim/rewrites.log</code> so you can diff\nbefore flipping modes.</li>\n<li><strong>Fail-open.</strong> Each external call is wrapped in try/catch with <code>||</code>\nfallbacks. If terraphim-agent is missing, commands pass through unchanged.</li>\n</ul>\n<h2 id=\"5-confirm-end-to-end\">5. Confirm end-to-end</h2>\n<p>With the hook installed and the cache flushed, open your agent, ask it to\nrun <code>npm install express</code>, and inspect the audit log:</p>\n<pre><code data-lang=\"bash\">tail -n 5 ~/Library/Application\\ Support/terraphim/rewrites.log\n</code></pre>\n<p>You should see a line like:</p>\n<pre><code>2026-04-15T11:32:51.129Z    suggest    pkg-mgr    npm install express    bun add express\n</code></pre>\n<p>In <code>suggest</code> mode the command still executes as <code>npm install express</code>; in\n<code>apply</code> mode the agent actually runs <code>bun add express</code>.</p>\n<h2 id=\"6-capturing-user-corrections-preview\">6. Capturing user corrections (preview)</h2>\n<p><code>terraphim-agent learn hook --format &lt;claude|codex|opencode&gt;</code> has three\nmodes driven by <code>--learn-hook-type</code>:</p>\n<ul>\n<li><code>post-tool-use</code> — the default, captures failed Bash commands as\nlearnings. This is already wired into the OpenCode plugin's\n<code>tool.execute.after</code> callback.</li>\n<li><code>pre-tool-use</code> — checks if the command matches a past failure pattern and\nstashes the hint to <code>~/.local/share/terraphim/session-hints.txt</code> for LLM\nconsumption. Does not block and does not print to the user terminal.</li>\n<li><code>user-prompt-submit</code> — scans the user's prompt for patterns like \"use X\ninstead of Y\" or \"prefer X over Y\" and records a <code>ToolPreference</code>\ncorrection under\n<code>~/Library/Application Support/terraphim/learnings/correction-*.md</code>.</li>\n</ul>\n<p>At present these corrections are stored but <strong>not yet fed back</strong> into the\nreplacement thesaurus. Closing that loop is tracked as future work — see\nthe accompanying GitHub issue \"Learning-driven command correction: Phase 2\n&amp; 3\".</p>\n<h2 id=\"troubleshooting\">Troubleshooting</h2>\n<p><strong><code>replace</code> returns the original unchanged.</strong>\nRun <code>terraphim-agent search \"&lt;synonym&gt;\" --role \"&lt;role&gt;\"</code> — if the concept\nappears, the KG is loaded but the synonym is not. Confirm the synonym is on\nthe <code>synonyms::</code> line (case-insensitive; commas separate entries). Flush\nthe cache (section 3) and retry.</p>\n<p><strong><code>Failed to load thesaurus: NotFound(\"thesaurus_...\")</code> in stderr.</strong>\nCosmetic. The agent looked for a pre-compiled JSON thesaurus first, didn't\nfind one, and fell back to building from markdown. Expected on first run.</p>\n<p><strong>Hook does nothing in OpenCode.</strong>\nCheck the plugin loaded: <code>grep terraphim-hooks ~/.local/share/opencode/log/$(ls -t ~/.local/share/opencode/log/ | head -1)</code>.\nYou should see a line like <code>service=plugin path=...terraphim-hooks.js loading plugin</code>. If absent, the plugin file is in the wrong directory —\nOpenCode autoloads from <code>~/.config/opencode/plugin/</code> and\n<code>~/.config/opencode/plugins/</code>.</p>\n<p><strong>Commands get double-rewritten on retry.</strong>\nThe hook only touches <code>tool.execute.before</code>; the agent does not loop back\nthrough the hook on its own retries. If you see double rewrites, check\nwhether <code>input.tool === \"Bash\"</code> is spelt exactly — OpenCode passes\n<code>\"Bash\"</code>, not <code>\"bash\"</code>.</p>\n",
"https://terraphim.ai/properties/date" : "2026-04-15",
"https://atomicdata.dev/properties/tags" : {},
"https://atomicdata.dev/properties/isA": [
    "https://atomicdata.dev/classes/Article"
  ]
}]