Graphify: Turn Any Folder Into a Knowledge Graph Your OpenClaw Agent Can Query
Graphify is a new
open-source skill that adds a /graphify command to OpenClaw (and a dozen other
AI agents). Point it at any folder โ a codebase, a notes dump, a research archive โ and it
builds a persistent, queryable knowledge graph. The headline number: 71.5x fewer
tokens per query compared to reading raw files.
The Problem It Solves
Andrej Karpathy has a /raw folder where he drops papers, tweets, screenshots,
and notes. The graphify README calls this out directly: that folder is a classic problem for
AI agents โ too much unstructured content to dump into context, no structure to query against.
The same problem shows up everywhere: a codebase you're trying to understand, a research dump, years of markdown notes, a client project folder. Your agent can read individual files, but it can't reason about the relationships between them without loading all of them โ which burns tokens and blows context limits.
Graphify builds a persistent graph of concepts and relationships across all those files once, then your agent queries the graph instead of re-reading the raw files every session.
What It Produces
Running /graphify . on any folder outputs:
graphify-out/
โโโ graph.html # interactive graph โ click nodes, search, filter
โโโ GRAPH_REPORT.md # god nodes, surprising connections, suggested questions
โโโ graph.json # persistent graph โ query weeks later without re-reading
โโโ cache/ # SHA256 cache โ re-runs only process changed files
The GRAPH_REPORT.md is particularly useful โ it surfaces "god nodes" (concepts
that appear everywhere), surprising connections you wouldn't have found manually, and
suggested questions worth exploring. The graph.html is an interactive visual
you can open in any browser.
Fully Multimodal
Graphify doesn't just handle code. It processes:
- Source code โ 25 languages via tree-sitter AST (Python, JS/TS, Go, Rust, Java, C/C++, Ruby, C#, Kotlin, Swift, and more)
- PDFs and markdown
- Screenshots and diagrams (including whiteboard photos and images in foreign languages)
- Video and audio โ transcribed with Whisper using a domain-aware prompt derived from your corpus
Everything goes into one unified graph, cross-referenced.
How to Use It With OpenClaw
Install via pip:
pip install graphifyy
Then in your OpenClaw agent, just type:
/graphify /path/to/your/folder
The agent runs the extraction, builds the graph, and saves the output. On subsequent
sessions, it queries graph.json directly โ no re-reading files, no re-running
extraction unless files changed (SHA256 cache handles that).
To exclude folders (same syntax as .gitignore):
# .graphifyignore
node_modules/
dist/
vendor/
*.generated.py
The Token Math
The 71.5x token reduction claim comes from querying the graph structure vs. loading raw files. A codebase with hundreds of files might take 500K+ tokens to load entirely; querying the graph for "what calls the payment module?" returns the relevant subgraph in a fraction of that.
For OpenClaw users running on API billing, this is directly relevant. Large workspace contexts are one of the biggest drivers of API cost. If you're working with a complex codebase or large research archive, graphify could cut your per-session token spend significantly.
OpenClaw-Specific Notes
The graphify README notes that OpenClaw uses sequential extraction (parallel agent support is still early). This means large folders will take longer to process on first run compared to Codex or Claude Code (which support parallel extraction). For most use cases this is a one-time cost โ the cache handles incremental updates efficiently.
Use Cases Worth Trying
- Understanding a new codebase โ run graphify before diving in, let the GRAPH_REPORT.md orient you
- Research archive โ papers, notes, bookmarks accumulated over years; ask "what themes show up?" without reading everything
- Client project context โ index all project files so your agent can answer questions about any part without context overflow
- Workspace memory extension โ complement OpenClaw's built-in memory.md with structured graph knowledge for large/complex domains
GitHub: safishamsi/graphify ยท PyPI: graphifyy
Want your OpenClaw set up to handle complex projects like this?
ClawReady configures your agent's memory architecture, workspace structure, and tooling so it can actually handle real workloads โ not just answer simple questions. Starting at $99.
Book a Free Call โ