mycelium

Local-only code intelligence. Structural graph + hybrid search + AI chat — all on your machine.

Quick Start · Documentation · MCP Setup · FAQ

🧬 What it does

Mycelium parses your local repos, builds a structural graph of every function, class, import, and call relationship, embeds the code for semantic search, and exposes everything through a chat UI and MCP server.

Index — crawls files with tree-sitter, detects workspaces, resolves imports, embeds code via OpenAI, stores everything in a Postgres graph (pgvector + full-text search).
Search — hybrid search fuses keyword matching and vector similarity via Reciprocal Rank Fusion. Structural queries traverse the code graph (callers, callees, dependencies, dependents, importers).
Chat — ask questions about your codebase and get answers grounded in the indexed graph, with source attribution and streamed responses.
MCP — expose search and graph queries as tools for AI coding agents (Claude Code, Cursor, etc.).

🏗️ Architecture

Key design decisions

Postgres does everything. No Redis, no Elasticsearch, no Milvus. pgvector handles embeddings, built-in full-text search handles keywords, recursive CTEs handle graph traversal. One database, one connection pool.
Hybrid search with RRF. Two parallel searches (keyword + semantic) merged via Reciprocal Rank Fusion. Exact name matches rank first; conceptual matches still surface.
Incremental indexing. git diff + body hash comparison. Only modified symbols hit the OpenAI API.
Generated tsvector column. Postgres auto-maintains the keyword index on every insert/update. Zero application code.
Tree-sitter for parsing. Language-agnostic AST extraction. Adding a new language = implementing one interface.

Full design rationale →

⚡ Quick Start

Prerequisites: Docker (full list)

# 1. Clone and configure
git clone https://github.com/maximilianfalco/mycelium.git
cd mycelium
cat > .env <<EOF
OPENAI_API_KEY=sk-...
REPOS_PATH=/path/to/your/code
EOF

# 2. Start everything (runs in background — no terminal needed)
make docker-up

REPOS_PATH is the root directory containing the repos you want to index (e.g., ~/Desktop/Code). It's bind-mounted into the API container so the indexer can read your source files.

This starts:

Service	URL
Next.js frontend	localhost:3773
Go API	localhost:8080
Postgres	localhost:5433
pgAdmin	localhost:5050

All services run in the background. Use make docker-logs to tail output, make docker-down to stop.

🔌 MCP server setup (for Claude Code, Cursor, etc.)

The .mcp.json in the project root auto-configures Claude Code. For other clients, add to your MCP config:

{
  "mcpServers": {
    "mycelium": {
      "command": "bash",
      "args": ["/path/to/mycelium/scripts/mcp.sh"],
      "env": {
        "DATABASE_URL": "postgresql://mycelium:mycelium@localhost:5433/mycelium",
        "OPENAI_API_KEY": "sk-..."
      }
    }
  }
}

Available tools: search, query_graph, list_projects

Full MCP setup guide →

📋 All make commands

# Docker (recommended — runs in background)
make docker-up      # build and start all services
make docker-down    # stop all services
make docker-logs    # tail logs from all services
make docker-build   # build images without starting
make docker-rebuild # full rebuild from scratch

# Local development (requires Go 1.22+, Node.js 22+)
make dev        # start full stack with hot reload (requires open terminal)
make build      # compile Go binary
make test       # run all tests (unit + integration)
make lint       # go vet
make clean      # remove binary + test cache
make db         # start Postgres only
make api        # start Go API only
make frontend   # start Next.js frontend only

🔍 Features

Supported languages

Language	Extensions	Parser	Workspace detection
TypeScript	`.ts`, `.tsx`	Tree-sitter	package.json, tsconfig.json, pnpm/yarn/npm workspaces
JavaScript	`.js`, `.jsx`	Tree-sitter	package.json, pnpm/yarn/npm workspaces
Go	`.go`	Tree-sitter	go.mod, go.work

7-stage indexing pipeline

Change detection — git diff against last indexed commit. Threshold guard prevents accidental full re-indexes.
Workspace detection — finds package.json / go.mod / go.work, resolves monorepo structure.
File crawling — walks the directory tree, respects .gitignore.
Parsing — tree-sitter extracts functions, classes, types, and all edges. 8 parallel workers.
Import resolution — resolves specifiers against alias maps, tsconfig paths, and filesystem.
Embedding — body hash comparison skips unchanged nodes. Batched OpenAI API calls.
Graph storage — upserts to Postgres, cleans up stale nodes.

Full pipeline documentation →

Hybrid search

Every query runs two searches in a single Postgres transaction:

Signal	How it works
Keyword	Postgres full-text search over GIN-indexed generated column. Weighted: names (A), signatures (B), docstrings (C).
Semantic	pgvector cosine similarity against 1536-dim embeddings. IVFFlat index.
Fusion	RRF: `score = 1/(60 + rank_vector) + 1/(60 + rank_keyword)`. 3x candidate oversampling.

How hybrid search works →

Structural graph queries

Query	Returns
`callers`	Functions that call the target symbol
`callees`	Functions called by the target symbol
`importers`	Files that import the target
`dependencies`	Transitive dependencies (up to 5 hops)
`dependents`	Transitive dependents (up to 5 hops)
`file`	All symbols in the same file

Graph query documentation →

Streamed AI chat

Context assembly pulls relevant code from the graph (hybrid search + graph expansion), packs it within a token budget, and streams responses via SSE with source attribution.

🛠️ Tech Stack

Component	Choice
Backend	Go (Chi router, pgx for Postgres)
Frontend	Next.js 16 (App Router, TypeScript, shadcn/ui)
Database	Postgres 16 + pgvector
Parsing	Tree-sitter (TypeScript, JavaScript, Go)
Embeddings	OpenAI `text-embedding-3-small`
Search	Hybrid: Postgres FTS + pgvector cosine, fused via RRF
Chat	OpenAI `gpt-4o`
MCP	`mcp-go` (stdio transport)

🍄 Frontend

The UI uses fungi terminology:

UI term	Backend term
Colony	Project
Substrate	Source (linked repo/directory)
Decompose	Index
Forage	Chat/Search
Spore lab	Debug mode

Four tabs per project:

Substrates — manage linked source directories, trigger indexing
Forage — chat with your codebase (streamed responses, source attribution)
Spore lab — run individual pipeline stages interactively for debugging
Mycelial map — graph visualization (coming soon)

📖 Documentation

Section	Description
Quick Start	Get up and running in 1 minute
Prerequisites	Required and optional dependencies
MCP Setup	Configure for Claude Code, Cursor, etc.
Environment Variables	All configuration options
Design Decisions	Why Postgres-only, why RRF, why tree-sitter
Pipeline Orchestrator	7-stage indexing pipeline
Change Detector	Git diff + mtime change detection
Workspace Detection	Monorepo and package discovery
Parsers & Crawling	Tree-sitter + file crawling
Chunker	Embedding input preparation + tokenization
Embedder	OpenAI API wrapper with batching + retry
Graph Builder	Postgres upsert, stale cleanup
Hybrid Search	Keyword + semantic fusion via RRF
Graph Queries	Structural traversal (callers, deps, etc.)
FAQ	Common questions

📄 License

Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
.claude		.claude
.github/workflows		.github/workflows
cmd/myc		cmd/myc
docs		docs
frontend		frontend
internal		internal
scripts		scripts
tests		tests
.air.toml		.air.toml
.dockerignore		.dockerignore
.gitignore		.gitignore
.mcp.json		.mcp.json
CLAUDE.md		CLAUDE.md
Dockerfile.api		Dockerfile.api
Dockerfile.frontend		Dockerfile.frontend
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mycelium

🧬 What it does

🏗️ Architecture

⚡ Quick Start

🔍 Features

Supported languages

7-stage indexing pipeline

Hybrid search

Structural graph queries

Streamed AI chat

🛠️ Tech Stack

🍄 Frontend

📖 Documentation

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mycelium

🧬 What it does

🏗️ Architecture

⚡ Quick Start

🔍 Features

Supported languages

7-stage indexing pipeline

Hybrid search

Structural graph queries

Streamed AI chat

🛠️ Tech Stack

🍄 Frontend

📖 Documentation

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages