Local-first LLM state branching & debugging.
Treat AI context windows like a Git repo. Capture, visualize, branch, and regression-test LLM calls as a DAG โ free & local via Ollama, any OpenAI-compatible API, and MCP for agents. Zero config, no cloud, no account.
The dashboard: a conversation tree with a branch off the root, the node inspector (request/response, tokens, provenance), and the Fork dialog.
Every LLM call is a node in a DAG. Fork an alternative prompt or model from any historical turn.
Defaults to open-source models via Ollama. No API key, no database, no telemetry.
Provider-agnostic proxy. OpenAI, Groq, OpenRouter, Together, LM Studio, vLLM, Anthropic.
Bytes pass through untouched; the proxy reconstructs the full message โ text and tool calls.
An agent can query its own history mid-task โ recall attempts, trace lineage, self-correct.
Pin good outputs as baselines, re-run after prompt tweaks, catch drift in CI.
# 1. a free local model (install Ollama from ollama.com)
ollama pull llama3
# 2. start ForkMind (proxy + dashboard on :4500)
npx github:medhovarsh/forkmind start
# 3. point your OpenAI client at the proxy
# baseURL: http://localhost:4500/v1
# open http://localhost:4500 for the dashboard
ForkMind is a local-first tool that treats AI context windows like a Git repository. It captures every LLM call into a local .forkmind directory, visualizes the conversation as a Directed Acyclic Graph (DAG), and lets you branch alternative prompts or models from any historical turn โ all on your machine, no cloud, no account.
Yes. ForkMind is MIT-licensed and runs entirely on your machine. It defaults to free, open-source models via Ollama โ no API key, no database, no telemetry.
ForkMind is a provider-agnostic proxy. It works with any OpenAI-compatible API, including Ollama, OpenAI, Anthropic, Groq, OpenRouter, Together, LM Studio, and vLLM.
ForkMind runs straight from its git link โ no npm registry needed. Run npx github:medhovarsh/forkmind start to launch the proxy and dashboard on port 4500, then point your OpenAI client at http://localhost:4500/v1.
ForkMind exposes an MCP (Model Context Protocol) server. An agent can query its own history mid-task to recall previous attempts, trace lineage, and self-correct.