What can code-review-graph do?

tree-sitter-based incremental codebase parsing with sha-256 change tracking, blast-radius impact analysis with dependency graph traversal, evaluation framework with benchmark metrics and token reduction reporting, graph storage and persistence with sqlite backend, mcp server integration with claude code and llm assistants, semantic search and embedding-based code retrieval, watch mode with auto-update hooks for continuous graph synchronization, code review context generation with token-optimized summaries, vs code extension with graph visualization and interactive exploration, multi-language support with language-agnostic graph schema, incremental graph update system with delta computation, cli command suite for graph management and analysis

code-review-graph

MCP ServerFree

Local knowledge graph for Claude Code. Builds a persistent map of your codebase so Claude reads only what matters — 6.8× fewer tokens on reviews and up to 49× on daily coding tasks.

Open Source

/ 100

12 capabilities

Capabilities12 decomposed

tree-sitter-based incremental codebase parsing with sha-256 change tracking

Medium confidence

Parses source code using Tree-sitter AST parsing across 40+ languages, extracting structural entities (functions, classes, types, imports) and storing them in a persistent knowledge graph. Tracks file changes via SHA-256 hashing to enable incremental updates—only re-parsing modified files rather than rescanning the entire codebase on each invocation. The parser system maintains a directed graph of code entities and their relationships (CALLS, IMPORTS_FROM, INHERITS, CONTAINS, TESTED_BY, DEPENDS_ON) without requiring full re-indexing.

Solves for

Build a persistent structural index of my codebase that survives across sessionsDetect which files have actually changed since the last analysis to avoid redundant parsingExtract function signatures, class hierarchies, and import dependencies from source codeSupport multiple programming languages in a single unified graph representation

Best for

teams maintaining large codebases (10k+ files) where full re-parsing is prohibitively expensive

developers using Claude Code or similar LLM assistants who need persistent context across sessions

monorepo maintainers needing language-agnostic structural analysis

Requires

Python 3.9+

Tree-sitter library and language-specific grammar bindings (auto-installed via pyproject.toml)

Writable filesystem for graph storage (SQLite or similar backend)

Limitations

Tree-sitter parsing accuracy varies by language maturity; less mature grammars may miss edge cases in dynamic or metaprogramming-heavy code

SHA-256 change detection is file-level only—moving or renaming files without content changes may trigger unnecessary re-parsing

Graph construction time scales linearly with codebase size; initial indexing of 100k+ file monorepos may take minutes

What makes it unique

Uses Tree-sitter AST parsing with SHA-256 incremental tracking instead of regex or line-based analysis, enabling structural awareness across 40+ languages while avoiding redundant re-parsing of unchanged files. The incremental update system (diagram 4) tracks file hashes to determine which entities need re-extraction, reducing indexing time from O(n) to O(delta) for large codebases.

vs alternatives

Faster and more accurate than LSP-based indexing for offline analysis because it maintains a persistent graph that survives session boundaries and doesn't require a running language server per language.

blast-radius impact analysis with dependency graph traversal

Medium confidence

When a file changes, the system traces the directed graph to identify all potentially affected code entities—callers, dependents, inheritors, and tests. This 'blast radius' computation uses graph traversal algorithms (BFS/DFS) to walk the CALLS, IMPORTS_FROM, INHERITS, DEPENDS_ON, and TESTED_BY edges, producing a minimal set of files and functions that Claude must review. The system excludes irrelevant files from context, reducing token consumption by 6.8x to 49x depending on repository structure and change scope.

Solves for

Determine which parts of my codebase could be affected by a specific code changeExclude irrelevant files from code review context to reduce LLM token consumptionIdentify all tests that exercise a modified function or classUnderstand the full dependency chain for a file change in a monorepo

Best for

code review workflows where context window is limited (e.g., Claude Code with 200k token limit)

large monorepos (Next.js, Kubernetes, etc.) where naive full-file inclusion would exceed token budgets

teams needing to understand change impact before merging pull requests

Requires

Pre-built knowledge graph from the incremental parsing capability

Graph database or in-memory representation with edge traversal support

File change information (git diff, file paths, or modification timestamps)

Limitations

Blast radius is conservative—it may include false positives if the graph contains indirect or transitive dependencies that don't actually affect behavior

Dynamic imports, reflection, or metaprogramming patterns may not be captured in the static graph, leading to incomplete blast radius computation

Graph traversal time scales with the number of edges; highly interconnected codebases may require seconds to compute blast radius for a single change

What makes it unique

Implements graph-based blast radius computation (diagram 3) that traces structural dependencies to identify affected code, rather than heuristic-based approaches like 'files in the same directory' or 'files modified in the same commit'. The system achieves 49x token reduction on monorepos by excluding 27,000+ irrelevant files from review context.

vs alternatives

More precise than git-based impact analysis (which only tracks file co-modification history) because it understands actual code dependencies and can exclude files that changed together but don't affect each other.

evaluation framework with benchmark metrics and token reduction reporting

Medium confidence

Includes an automated evaluation framework (`code-review-graph eval --all`) that benchmarks the tool against real open-source repositories, measuring token reduction, impact analysis accuracy, and query performance. The framework compares naive full-file context inclusion against graph-optimized context, reporting metrics like average token reduction (8.2x across tested repos, up to 49x on monorepos), precision/recall of blast radius analysis, and query latency. Results are aggregated and visualized in benchmark reports, enabling teams to understand the expected token savings for their codebase.

Solves for

Measure token reduction achieved by code-review-graph on my codebaseBenchmark graph query performance and impact analysis accuracyUnderstand expected cost savings from using graph-optimized contextValidate that the tool is working correctly on my code

Best for

teams evaluating code-review-graph before adoption

organizations tracking LLM API cost savings

developers validating tool performance on their specific codebase

Requires

Python 3.9+ with code-review-graph installed

Real codebase or test repositories for evaluation

Optional: API keys for embedding services if semantic search is enabled

Limitations

Evaluation framework requires running against real repositories; results may not generalize to other codebases with different structures

Token reduction metrics are specific to Claude and may vary for other LLM assistants

Evaluation requires API calls to embedding services (if semantic search is enabled), adding cost and latency

What makes it unique

Includes an automated evaluation framework that benchmarks token reduction against real open-source repositories, reporting metrics like 8.2x average reduction and up to 49x on monorepos. The framework enables teams to understand expected cost savings and validate tool performance on their specific codebase.

vs alternatives

More rigorous than anecdotal claims because it provides quantified metrics from real repositories and enables teams to measure performance on their own code, rather than relying on vendor claims.

graph storage and persistence with sqlite backend

Medium confidence

Persists the knowledge graph to a local SQLite database, enabling the graph to survive across sessions and be queried without re-parsing the entire codebase. The storage layer maintains tables for nodes (entities), edges (relationships), and metadata, with indexes optimized for common query patterns (entity lookup, relationship traversal, impact analysis). The SQLite backend is lightweight, requires no external services, and supports concurrent read access, making it suitable for local development workflows and CI/CD integration.

Solves for

Store the knowledge graph persistently so it survives across sessionsQuery the graph without re-parsing the entire codebaseEnable fast graph access for code review and analysis tasksSupport concurrent read access from multiple tools (CLI, MCP server, VS Code extension)

Best for

local development workflows where the graph must persist across sessions

teams without access to external database services

developers who want a lightweight, zero-configuration storage solution

Requires

Python 3.9+ with code-review-graph installed

Writable filesystem for SQLite database file

Sufficient disk space for graph storage (typically 10-100MB per 10k files)

Limitations

SQLite is single-writer, multiple-reader; concurrent write access may cause locking issues

SQLite performance degrades with very large graphs (100k+ nodes); query latency may increase

No built-in replication or backup; graph loss requires manual recovery

What makes it unique

Uses SQLite as a lightweight, zero-configuration graph storage backend with indexes optimized for common query patterns (entity lookup, relationship traversal, impact analysis). The storage layer supports concurrent read access and requires no external services.

vs alternatives

Simpler than cloud-based graph databases (Neo4j, ArangoDB) because it requires no external services or configuration, making it suitable for local development and CI/CD pipelines.

mcp server integration with claude code and llm assistants

Medium confidence

Exposes the knowledge graph as an MCP (Model Context Protocol) server that Claude Code and other LLM assistants can query via standardized tool calls. The MCP server implements a set of tools (graph management, query, impact analysis, review context, semantic search, utility, and advanced analysis tools) that allow Claude to request only the relevant code context for a task instead of re-reading entire files. Integration is bidirectional: Claude sends queries (e.g., 'what functions call this one?'), and the MCP server returns structured graph results that fit within token budgets.

Solves for

Enable Claude Code to query my codebase structure without re-reading entire filesProvide Claude with precise function signatures, class hierarchies, and dependency informationAllow Claude to understand code impact before suggesting changesIntegrate code-review-graph seamlessly into Claude's existing workflow without manual context pasting

Best for

Claude Code users who want to reduce token consumption on code reviews and refactoring tasks

teams using Claude as their primary LLM assistant for development

developers who want to maintain a persistent codebase index that Claude can query across sessions

Requires

Python 3.9+ with code-review-graph installed

Claude Code or another MCP-compatible LLM client

Pre-built knowledge graph from the incremental parsing capability

Limitations

MCP server must be running locally or accessible over the network; no cloud-hosted version reduces latency

Claude's tool-calling latency adds ~100-200ms per query; rapid iterative queries may feel slower than direct file reading

MCP tool results are limited by Claude's context window; very large blast radius results may still exceed token budgets

What makes it unique

Implements MCP server with a comprehensive tool suite (graph management, query, impact analysis, review context, semantic search, utility, and advanced analysis tools) that allows Claude to query the knowledge graph directly rather than relying on manual context injection. The MCP integration is bidirectional—Claude can request specific code context and receive only what's needed.

vs alternatives

More efficient than context injection (copy-pasting code into Claude) because the MCP server can return only the relevant subgraph, and Claude can make follow-up queries without re-reading the entire codebase.

semantic search and embedding-based code retrieval

Medium confidence

Generates embeddings for code entities (functions, classes, documentation) and stores them in a vector index, enabling semantic search queries like 'find functions that handle authentication' or 'locate all database connection logic'. The system uses embedding models (likely OpenAI or similar) to convert code and natural language queries into vector space, then performs similarity search to retrieve relevant code entities without requiring exact keyword matches. Results are ranked by semantic relevance and integrated into the MCP tool suite for Claude to query.

Solves for

Find related code by semantic meaning rather than exact keyword matchingLocate all authentication-related functions across a large codebaseDiscover similar patterns or implementations for referenceAugment code review context with semantically relevant code snippets

Best for

large codebases (10k+ files) where keyword search is insufficient

teams with diverse code patterns and naming conventions

developers exploring unfamiliar codebases and needing semantic guidance

Requires

Pre-built knowledge graph with code entities

Embedding model API key (OpenAI, Anthropic, or local embedding service)

Vector database or index (e.g., Pinecone, Weaviate, or local FAISS)

Limitations

Embedding generation requires API calls to external services (OpenAI, etc.), adding latency and cost per indexed entity

Semantic search quality depends on embedding model quality; poor embeddings lead to irrelevant results

Vector index must be updated when code changes; incremental embedding updates may lag behind code changes

What makes it unique

Integrates semantic search into the MCP tool suite, allowing Claude to discover code by meaning rather than keyword matching. The system generates embeddings for code entities and maintains a vector index that supports similarity queries, enabling Claude to find related code patterns without explicit keyword searches.

vs alternatives

More effective than regex or keyword-based search for discovering related code patterns because it understands semantic relationships (e.g., 'authentication' and 'login' are related even if they don't share keywords).

watch mode with auto-update hooks for continuous graph synchronization

Medium confidence

Monitors the filesystem for code changes (via file watchers or git hooks) and automatically triggers incremental graph updates without manual intervention. When files are modified, the system detects changes via SHA-256 hashing, re-parses only affected files, and updates the knowledge graph in real-time. Auto-update hooks integrate with git workflows (pre-commit, post-commit) to keep the graph synchronized with the working directory, ensuring Claude always has current structural information.

Solves for

Keep the knowledge graph synchronized with my codebase as I make changesAvoid manual graph rebuild commands during active developmentEnsure Claude has current code structure information for real-time code reviewIntegrate graph updates into my git workflow automatically

Best for

developers working in active development cycles who need real-time graph updates

teams using git-based workflows with pre-commit or post-commit hooks

continuous integration pipelines where graph updates should happen automatically

Requires

Python 3.9+ with code-review-graph installed

File watcher library (watchdog or similar, auto-installed)

Writable filesystem for graph storage

Limitations

File watchers add background process overhead; may impact system performance on very large codebases

Watch mode requires the process to remain running; stopping the process halts graph synchronization

Git hooks add latency to commit operations; very large commits may trigger slow graph updates

What makes it unique

Implements filesystem-level watch mode with git hook integration (diagram 4) that automatically triggers incremental graph updates without manual intervention. The system uses SHA-256 change detection to identify modified files and re-parses only those files, keeping the graph synchronized in real-time.

vs alternatives

More convenient than manual graph rebuild commands because it runs continuously in the background and integrates with git workflows, ensuring the graph is always current without developer action.

code review context generation with token-optimized summaries

Medium confidence

Generates concise, token-optimized summaries of code changes and their context by combining blast radius analysis with semantic search. Instead of sending entire files to Claude, the system produces structured summaries that include: changed code snippets, affected functions/classes, test coverage, and related code patterns. The summaries are designed to fit within Claude's context window while providing sufficient information for accurate code review, achieving 6.8x to 49x token reduction compared to naive full-file inclusion.

Solves for

Generate code review context that fits within Claude's token budgetSummarize code changes with their impact and related code patternsProvide Claude with just enough context to understand a change without overwhelming detailOptimize token consumption for cost-sensitive code review workflows

Best for

teams using Claude Code for code reviews with limited token budgets

large monorepos where full-file context would exceed token limits

cost-conscious teams optimizing LLM API spending

Requires

Pre-built knowledge graph with blast radius and semantic search capabilities

Code change information (git diff, file paths, or modification timestamps)

Claude Code or another MCP-compatible LLM client

Limitations

Summary generation may omit important context if the blast radius is conservative or semantic search misses related code

Token reduction is variable (6.8x to 49x) depending on repository structure and change scope; small changes in tightly-coupled code may not benefit as much

Summaries are optimized for Claude; other LLM assistants may require different formatting or context structure

What makes it unique

Combines blast radius analysis with semantic search to generate token-optimized code review context that includes changed code, affected entities, and related patterns. The system achieves 6.8x to 49x token reduction by excluding irrelevant files and providing structured summaries instead of full-file context.

vs alternatives

More efficient than sending entire changed files to Claude because it uses graph-based impact analysis to identify only the relevant code and semantic search to find related patterns, resulting in significantly lower token consumption.

vs code extension with graph visualization and interactive exploration

Medium confidence

Provides a VS Code extension that visualizes the knowledge graph as an interactive diagram, allowing developers to explore code structure, dependencies, and impact analysis results directly in the editor. The extension displays nodes (files, functions, classes) and edges (relationships) in a visual format, supports filtering and search, and integrates with VS Code's code navigation (go-to-definition, find-references). Developers can click on entities to view details, trace dependencies, and understand code structure without leaving the editor.

Solves for

Visualize my codebase structure as an interactive graph in VS CodeExplore code dependencies and relationships visuallyUnderstand blast radius impact by seeing affected code highlightedNavigate code structure more intuitively than text-based search

Best for

developers who prefer visual exploration of code structure

teams onboarding new developers who need to understand codebase architecture

developers debugging complex dependency chains

Requires

VS Code 1.80+

code-review-graph extension installed from VS Code marketplace

Pre-built knowledge graph from the incremental parsing capability

Limitations

Graph visualization performance degrades with very large graphs (10k+ nodes); rendering may be slow or unresponsive

VS Code extension API limitations may prevent full-featured graph visualization (e.g., real-time layout updates)

Graph filtering and search features may be limited compared to dedicated graph visualization tools

What makes it unique

Integrates graph visualization directly into VS Code as an extension, allowing developers to explore code structure interactively without leaving the editor. The extension supports filtering, search, and integration with VS Code's code navigation features.

vs alternatives

More integrated than standalone graph visualization tools because it runs within VS Code and connects to the editor's code navigation, allowing developers to jump directly to code definitions from the graph.

multi-language support with language-agnostic graph schema

Medium confidence

Supports parsing and indexing code in 40+ programming languages (Python, JavaScript, TypeScript, Go, Rust, Java, C++, etc.) using language-specific Tree-sitter grammars, while maintaining a unified, language-agnostic graph schema. All languages are represented with the same node types (File, Class, Function, Type, Test) and edge types (CALLS, IMPORTS_FROM, INHERITS, CONTAINS, TESTED_BY, DEPENDS_ON), enabling cross-language dependency analysis in monorepos. The parser system automatically detects language based on file extension and applies the appropriate grammar.

Solves for

Index and analyze code in multiple programming languages in a single graphUnderstand dependencies across language boundaries in polyglot monoreposPerform cross-language impact analysis for changes that affect multiple languagesMaintain a unified codebase index regardless of language diversity

Best for

polyglot monorepos with code in multiple languages (e.g., backend in Go, frontend in TypeScript, scripts in Python)

teams migrating between languages who need to understand cross-language dependencies

organizations with diverse technology stacks

Requires

Python 3.9+ with code-review-graph installed

Tree-sitter language grammars for each language (auto-installed via pyproject.toml)

Source code files in supported languages

Limitations

Language support quality varies; mature languages (Python, JavaScript) have better Tree-sitter grammars than newer languages

Cross-language dependency analysis is limited to explicit imports/calls; implicit dependencies (e.g., REST API contracts) are not captured

Language-specific features (e.g., Python decorators, TypeScript generics) may not be fully represented in the unified schema

What makes it unique

Maintains a unified, language-agnostic graph schema across 40+ languages using Tree-sitter grammars, enabling cross-language dependency analysis in polyglot monorepos. All languages are represented with the same node and edge types, allowing consistent impact analysis regardless of language mix.

vs alternatives

More comprehensive than language-specific tools because it supports multiple languages in a single graph and enables cross-language dependency analysis, whereas most tools focus on a single language.

incremental graph update system with delta computation

Medium confidence

Implements an incremental update mechanism that computes the delta between the current codebase state and the previous indexed state, then applies only the necessary graph updates. The system uses SHA-256 file hashing to detect changes, identifies which entities (functions, classes, imports) were added, modified, or deleted, and updates only the affected graph nodes and edges. This delta-based approach reduces update time from O(n) to O(delta), where delta is the number of changed entities, enabling fast graph synchronization even in large codebases.

Solves for

Update the knowledge graph quickly after code changes without full re-indexingMinimize graph update latency during active developmentReduce computational overhead of keeping the graph synchronizedSupport real-time graph updates in watch mode

Best for

developers in active development cycles who need fast graph updates

large codebases where full re-indexing would be prohibitively slow

continuous integration pipelines where graph updates must complete quickly

Requires

Pre-built knowledge graph with entity tracking

File change detection mechanism (SHA-256 hashing or file modification timestamps)

Graph database with support for selective node/edge updates

Limitations

Delta computation assumes file-level change detection; moving or renaming files without content changes may trigger unnecessary updates

Incremental updates may miss indirect effects of changes (e.g., if a function signature changes, all callers may need re-analysis but may not be detected as 'changed')

Graph consistency must be maintained during incremental updates; concurrent updates may cause inconsistencies

What makes it unique

Implements delta-based incremental updates (diagram 4) that compute the difference between current and previous codebase states, then apply only necessary graph changes. The system uses SHA-256 hashing to detect file changes and identifies which entities were added/modified/deleted, reducing update time from O(n) to O(delta).

vs alternatives

Faster than full re-indexing because it only re-parses changed files and updates affected graph nodes, whereas naive approaches would re-parse the entire codebase on every change.

cli command suite for graph management and analysis

Medium confidence

Provides a comprehensive command-line interface for building, querying, and analyzing the knowledge graph. Commands include: `build` (initial graph construction), `update` (incremental updates), `query` (graph traversal and entity lookup), `impact` (blast radius analysis), `review` (code review context generation), `watch` (continuous synchronization), `visualize` (graph export for visualization), and `eval` (evaluation and benchmarking). The CLI is designed for both interactive use and integration into CI/CD pipelines, with structured output formats (JSON, YAML) for programmatic consumption.

Solves for

Build and maintain the knowledge graph from the command lineQuery the graph for specific entities or relationshipsAnalyze code impact and dependencies programmaticallyIntegrate graph operations into CI/CD pipelines and automation scripts+1 more

Best for

developers who prefer command-line tools over GUI interfaces

CI/CD pipelines that need to build and update graphs automatically

teams integrating code-review-graph into custom workflows

Requires

Python 3.9+ with code-review-graph installed

Command-line shell (bash, zsh, PowerShell, etc.)

Optional: jq or other JSON processing tools for output parsing

Limitations

CLI output is text-based; complex graph visualization requires additional tools or export to external formats

Some advanced features may be easier to use through the VS Code extension or MCP interface than CLI

CLI requires Python environment setup; less accessible to non-technical team members

What makes it unique

Provides a comprehensive CLI suite with commands for graph building, querying, impact analysis, review context generation, and evaluation. The CLI supports structured output formats (JSON, YAML) for programmatic consumption and integration into CI/CD pipelines.

vs alternatives

More flexible than GUI-only tools because it supports scripting, automation, and integration into existing workflows, while also being accessible to developers who prefer command-line interfaces.

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Related Artifactssharing capabilities

Artifacts that share capabilities with code-review-graph, ranked by overlap. Discovered automatically through the match graph.

Repository26

Scaffold

** - Scaffold is a Retrieval-Augmented Generation (RAG) system designed to structural understanding of large codebases. It transforms your source code into a living knowledge graph, allowing for precise, context-aware interactions that go far beyond simple file retrieval.

incremental codebase indexing with change detectionmulti-language source code parsing with ast extractiondependency graph analysis and impact assessment

3 shared capabilities

MCP Server49

repomix

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

tree-sitter-based code compression and comment strippingmulti-format codebase packaging with llm-optimized output

2 shared capabilities

MCP Server41

codebase-memory-mcp

High-performance code intelligence MCP server. Indexes codebases into a persistent knowledge graph — average repo in milliseconds. 66 languages, sub-ms queries, 99% fewer tokens. Single static binary, zero dependencies.

multi-language ast parsing and entity extraction with tree-sitterincremental reindexing with content-hash change detection

2 shared capabilities

Product17

Second

Automated migrations and upgrades for your code

automated dependency version migration with ast-aware refactoringbatch codebase analysis and impact assessment before migration

2 shared capabilities

MCP Server22

Sourcerer

** - MCP for semantic code search & navigation that reduces token waste

tree-sitter based code parsing and semantic chunking

1 shared capability

Extension36

Claude 4, DeepSeek R1, ChatGPT, Copilot, Cursor AI and Cline, AI Agents, AI Copilot, and Debugger, Code Assistants, Code Chat, Code Completion, Code Generator, Autocomplete, Codestral, Generative AI

Bugzi: Multi-Agent AI and Code Scanning. Your AI Partner for Development. Bugzi is a powerful AI assistant that seamlessly integrates into your VS Code workflow, designed to enhance productivity and streamline your entire development process. While Bugzi includes a realtime security scanner to prote

project-scope-code-analysis

1 shared capability

Best For

✓teams maintaining large codebases (10k+ files) where full re-parsing is prohibitively expensive
✓developers using Claude Code or similar LLM assistants who need persistent context across sessions
✓monorepo maintainers needing language-agnostic structural analysis
✓code review workflows where context window is limited (e.g., Claude Code with 200k token limit)
✓large monorepos (Next.js, Kubernetes, etc.) where naive full-file inclusion would exceed token budgets
✓teams needing to understand change impact before merging pull requests
✓developers optimizing LLM API costs by minimizing token consumption per review
✓teams evaluating code-review-graph before adoption

Known Limitations

⚠Tree-sitter parsing accuracy varies by language maturity; less mature grammars may miss edge cases in dynamic or metaprogramming-heavy code
⚠SHA-256 change detection is file-level only—moving or renaming files without content changes may trigger unnecessary re-parsing
⚠Graph construction time scales linearly with codebase size; initial indexing of 100k+ file monorepos may take minutes
⚠No built-in support for generated code or transpiled outputs; requires manual configuration to exclude build artifacts
⚠Blast radius is conservative—it may include false positives if the graph contains indirect or transitive dependencies that don't actually affect behavior
⚠Dynamic imports, reflection, or metaprogramming patterns may not be captured in the static graph, leading to incomplete blast radius computation

Requirements

Python 3.9+Tree-sitter library and language-specific grammar bindings (auto-installed via pyproject.toml)Writable filesystem for graph storage (SQLite or similar backend)Source code files accessible locally (no remote repository streaming)Pre-built knowledge graph from the incremental parsing capabilityGraph database or in-memory representation with edge traversal supportFile change information (git diff, file paths, or modification timestamps)Python 3.9+ with code-review-graph installed

Input / Output

Accepts: source code files (Python, JavaScript, TypeScript, Go, Rust, Java, C++, etc.), file paths and directory structures, git diff or file modification timestamps, changed file paths or git diff output, graph node identifiers (file, function, class names), codebase paths or repository URLs, evaluation configuration (which metrics to measure), graph nodes and edges from the parser system, query parameters (entity names, relationship types), MCP tool calls with structured parameters (file paths, function names, query types), natural language queries from Claude (translated to tool calls by Claude's reasoning), natural language queries (e.g., 'authentication logic'), code snippets for similarity search, entity identifiers (function names, class names), filesystem events (file creation, modification, deletion), git hook triggers (pre-commit, post-commit), git diff or file change information, file paths of modified code, optional: natural language context or review guidelines, graph node and edge data from the knowledge graph, user interactions (clicks, filters, search queries), source code files in any of 40+ supported languages, file paths with language-specific extensions, file change information (paths, modification timestamps, or git diff), previous graph state, command-line arguments and flags

Produces: directed graph nodes (File, Class, Function, Type, Test entities), graph edges (relationship tuples with metadata), structured entity metadata (signatures, line numbers, docstrings), set of affected file paths, set of affected function/class identifiers, graph subsets (nodes and edges relevant to the change), impact metrics (number of affected entities, estimated token reduction), benchmark reports with token reduction metrics, impact analysis accuracy metrics (precision, recall), query performance statistics, visualized benchmark results, persisted graph data in SQLite format, query results from graph traversal, structured JSON responses with graph entities and relationships, code snippets and summaries, impact analysis results, semantic search results, ranked list of semantically similar code entities, relevance scores (0-1), code snippets and metadata for retrieved entities, updated knowledge graph, log messages indicating which files were re-parsed, graph synchronization status, structured code review context (changed code, affected entities, related patterns), token count estimates, impact analysis summary, test coverage information, interactive graph visualization, node details and metadata, navigation links to code locations, filtered subgraphs based on search or filter criteria, unified graph with language-agnostic node and edge types, language-specific metadata (e.g., function signatures, type annotations), delta summary (added/modified/deleted entities), update timing information, structured output (JSON, YAML, or plain text), graph statistics and metrics, query results (entity lists, relationship data), impact analysis summaries

UnfragileRank

Adoption36%(30% weight)

Quality51%(25% weight)

Ecosystem80%(25% weight)

Match Graph10%(15% weight)

Freshness75%(5% weight)

UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.

Type: MCP Server

12 capabilities

Visit code-review-graph→

Repository Details

12,361

Stars

1,366

Forks

Python

Language

MIT

License

Topics

ai-codingclaudeclaude-codecode-reviewgraphragincrementalknowledge-graphllmmcppythonstatic-analysistree-sitter

Last commit: Apr 21, 2026

About

Local knowledge graph for Claude Code. Builds a persistent map of your codebase so Claude reads only what matters — 6.8× fewer tokens on reviews and up to 49× on daily coding tasks.

Alternatives to code-review-graph

IntelliCode50Extension

AI-assisted development

Compare →

GitHub Copilot Chat53Extension

AI chat features powered by Copilot

Compare →

GitHub Copilot52Extension

Your AI pair programmer

Compare →

Claude Code for VS Code52Extension

Claude Code for VS Code: Harness the power of Claude Code without leaving your IDE

Compare →

Are you the builder of code-review-graph?

Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.

Claim this artifact →Verification via email

Get the weekly brief

New tools, rising stars, and what's actually worth your time. No spam.

Data Sources

github

Looking for something else?

Search →

Capabilities12 decomposed

tree-sitter-based incremental codebase parsing with sha-256 change tracking

Medium confidence

Solves for

Best for

teams maintaining large codebases (10k+ files) where full re-parsing is prohibitively expensive

developers using Claude Code or similar LLM assistants who need persistent context across sessions

monorepo maintainers needing language-agnostic structural analysis

Requires

Python 3.9+

Tree-sitter library and language-specific grammar bindings (auto-installed via pyproject.toml)

Writable filesystem for graph storage (SQLite or similar backend)

Limitations

Tree-sitter parsing accuracy varies by language maturity; less mature grammars may miss edge cases in dynamic or metaprogramming-heavy code

SHA-256 change detection is file-level only—moving or renaming files without content changes may trigger unnecessary re-parsing

Graph construction time scales linearly with codebase size; initial indexing of 100k+ file monorepos may take minutes

What makes it unique

vs alternatives

blast-radius impact analysis with dependency graph traversal

Medium confidence

Solves for

Best for

code review workflows where context window is limited (e.g., Claude Code with 200k token limit)

large monorepos (Next.js, Kubernetes, etc.) where naive full-file inclusion would exceed token budgets

teams needing to understand change impact before merging pull requests

Requires

Pre-built knowledge graph from the incremental parsing capability

Graph database or in-memory representation with edge traversal support

File change information (git diff, file paths, or modification timestamps)

Limitations

Blast radius is conservative—it may include false positives if the graph contains indirect or transitive dependencies that don't actually affect behavior

Dynamic imports, reflection, or metaprogramming patterns may not be captured in the static graph, leading to incomplete blast radius computation

Graph traversal time scales with the number of edges; highly interconnected codebases may require seconds to compute blast radius for a single change

What makes it unique

vs alternatives

evaluation framework with benchmark metrics and token reduction reporting

Medium confidence

Solves for

Best for

teams evaluating code-review-graph before adoption

organizations tracking LLM API cost savings

developers validating tool performance on their specific codebase

Requires

Python 3.9+ with code-review-graph installed

Real codebase or test repositories for evaluation

Optional: API keys for embedding services if semantic search is enabled

Limitations

Evaluation framework requires running against real repositories; results may not generalize to other codebases with different structures

Token reduction metrics are specific to Claude and may vary for other LLM assistants

Evaluation requires API calls to embedding services (if semantic search is enabled), adding cost and latency

What makes it unique

vs alternatives

More rigorous than anecdotal claims because it provides quantified metrics from real repositories and enables teams to measure performance on their own code, rather than relying on vendor claims.

graph storage and persistence with sqlite backend

Medium confidence

Solves for

Best for

local development workflows where the graph must persist across sessions

teams without access to external database services

developers who want a lightweight, zero-configuration storage solution

Requires

Python 3.9+ with code-review-graph installed

Writable filesystem for SQLite database file

Sufficient disk space for graph storage (typically 10-100MB per 10k files)

Limitations

SQLite is single-writer, multiple-reader; concurrent write access may cause locking issues

SQLite performance degrades with very large graphs (100k+ nodes); query latency may increase

No built-in replication or backup; graph loss requires manual recovery

What makes it unique

vs alternatives

Simpler than cloud-based graph databases (Neo4j, ArangoDB) because it requires no external services or configuration, making it suitable for local development and CI/CD pipelines.

mcp server integration with claude code and llm assistants

Medium confidence

Solves for

Best for

Claude Code users who want to reduce token consumption on code reviews and refactoring tasks

teams using Claude as their primary LLM assistant for development

developers who want to maintain a persistent codebase index that Claude can query across sessions

Requires

Python 3.9+ with code-review-graph installed

Claude Code or another MCP-compatible LLM client

Pre-built knowledge graph from the incremental parsing capability

Limitations

MCP server must be running locally or accessible over the network; no cloud-hosted version reduces latency

Claude's tool-calling latency adds ~100-200ms per query; rapid iterative queries may feel slower than direct file reading

MCP tool results are limited by Claude's context window; very large blast radius results may still exceed token budgets

What makes it unique

vs alternatives

semantic search and embedding-based code retrieval

Medium confidence

Solves for

Best for

large codebases (10k+ files) where keyword search is insufficient

teams with diverse code patterns and naming conventions

developers exploring unfamiliar codebases and needing semantic guidance

Requires

Pre-built knowledge graph with code entities

Embedding model API key (OpenAI, Anthropic, or local embedding service)

Vector database or index (e.g., Pinecone, Weaviate, or local FAISS)

Limitations

Embedding generation requires API calls to external services (OpenAI, etc.), adding latency and cost per indexed entity

Semantic search quality depends on embedding model quality; poor embeddings lead to irrelevant results

Vector index must be updated when code changes; incremental embedding updates may lag behind code changes

What makes it unique

vs alternatives

watch mode with auto-update hooks for continuous graph synchronization

Medium confidence

Solves for

Best for

developers working in active development cycles who need real-time graph updates

teams using git-based workflows with pre-commit or post-commit hooks

continuous integration pipelines where graph updates should happen automatically

Requires

Python 3.9+ with code-review-graph installed

File watcher library (watchdog or similar, auto-installed)

Writable filesystem for graph storage

Limitations

File watchers add background process overhead; may impact system performance on very large codebases

Watch mode requires the process to remain running; stopping the process halts graph synchronization

Git hooks add latency to commit operations; very large commits may trigger slow graph updates

What makes it unique

vs alternatives

More convenient than manual graph rebuild commands because it runs continuously in the background and integrates with git workflows, ensuring the graph is always current without developer action.

code review context generation with token-optimized summaries

Medium confidence

Solves for

Best for

teams using Claude Code for code reviews with limited token budgets

large monorepos where full-file context would exceed token limits

cost-conscious teams optimizing LLM API spending

Requires

Pre-built knowledge graph with blast radius and semantic search capabilities

Code change information (git diff, file paths, or modification timestamps)

Claude Code or another MCP-compatible LLM client

Limitations

Summary generation may omit important context if the blast radius is conservative or semantic search misses related code

Token reduction is variable (6.8x to 49x) depending on repository structure and change scope; small changes in tightly-coupled code may not benefit as much

Summaries are optimized for Claude; other LLM assistants may require different formatting or context structure

What makes it unique

vs alternatives

vs code extension with graph visualization and interactive exploration

Medium confidence

Solves for

Best for

developers who prefer visual exploration of code structure

teams onboarding new developers who need to understand codebase architecture

developers debugging complex dependency chains

Requires

VS Code 1.80+

code-review-graph extension installed from VS Code marketplace

Pre-built knowledge graph from the incremental parsing capability

Limitations

Graph visualization performance degrades with very large graphs (10k+ nodes); rendering may be slow or unresponsive

VS Code extension API limitations may prevent full-featured graph visualization (e.g., real-time layout updates)

Graph filtering and search features may be limited compared to dedicated graph visualization tools

What makes it unique

vs alternatives

multi-language support with language-agnostic graph schema

Medium confidence

Solves for

Best for

polyglot monorepos with code in multiple languages (e.g., backend in Go, frontend in TypeScript, scripts in Python)

teams migrating between languages who need to understand cross-language dependencies

organizations with diverse technology stacks

Requires

Python 3.9+ with code-review-graph installed

Tree-sitter language grammars for each language (auto-installed via pyproject.toml)

Source code files in supported languages

Limitations

Language support quality varies; mature languages (Python, JavaScript) have better Tree-sitter grammars than newer languages

Cross-language dependency analysis is limited to explicit imports/calls; implicit dependencies (e.g., REST API contracts) are not captured

Language-specific features (e.g., Python decorators, TypeScript generics) may not be fully represented in the unified schema

What makes it unique

vs alternatives

More comprehensive than language-specific tools because it supports multiple languages in a single graph and enables cross-language dependency analysis, whereas most tools focus on a single language.

incremental graph update system with delta computation

Medium confidence

Solves for

Best for

developers in active development cycles who need fast graph updates

large codebases where full re-indexing would be prohibitively slow

continuous integration pipelines where graph updates must complete quickly

Requires

Pre-built knowledge graph with entity tracking

File change detection mechanism (SHA-256 hashing or file modification timestamps)

Graph database with support for selective node/edge updates

Limitations

Delta computation assumes file-level change detection; moving or renaming files without content changes may trigger unnecessary updates

Incremental updates may miss indirect effects of changes (e.g., if a function signature changes, all callers may need re-analysis but may not be detected as 'changed')

Graph consistency must be maintained during incremental updates; concurrent updates may cause inconsistencies

What makes it unique

vs alternatives

Faster than full re-indexing because it only re-parses changed files and updates affected graph nodes, whereas naive approaches would re-parse the entire codebase on every change.

cli command suite for graph management and analysis

Medium confidence

Solves for

Best for

developers who prefer command-line tools over GUI interfaces

CI/CD pipelines that need to build and update graphs automatically

teams integrating code-review-graph into custom workflows

Requires

Python 3.9+ with code-review-graph installed

Command-line shell (bash, zsh, PowerShell, etc.)

Optional: jq or other JSON processing tools for output parsing

Limitations

CLI output is text-based; complex graph visualization requires additional tools or export to external formats

Some advanced features may be easier to use through the VS Code extension or MCP interface than CLI

CLI requires Python environment setup; less accessible to non-technical team members

What makes it unique

vs alternatives

More flexible than GUI-only tools because it supports scripting, automation, and integration into existing workflows, while also being accessible to developers who prefer command-line interfaces.

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Alternatives to code-review-graph

IntelliCode50Extension

AI-assisted development

Compare →

GitHub Copilot Chat53Extension

AI chat features powered by Copilot

Compare →

GitHub Copilot52Extension

Your AI pair programmer

Compare →

Claude Code for VS Code52Extension

Claude Code for VS Code: Harness the power of Claude Code without leaving your IDE

Compare →

code-review-graph

Capabilities12 decomposed

tree-sitter-based incremental codebase parsing with sha-256 change tracking

blast-radius impact analysis with dependency graph traversal

evaluation framework with benchmark metrics and token reduction reporting

graph storage and persistence with sqlite backend

mcp server integration with claude code and llm assistants

semantic search and embedding-based code retrieval

watch mode with auto-update hooks for continuous graph synchronization

code review context generation with token-optimized summaries

vs code extension with graph visualization and interactive exploration

multi-language support with language-agnostic graph schema

incremental graph update system with delta computation

cli command suite for graph management and analysis

Related Artifactssharing capabilities

Scaffold

repomix

codebase-memory-mcp

Second

Sourcerer

Claude 4, DeepSeek R1, ChatGPT, Copilot, Cursor AI and Cline, AI Agents, AI Copilot, and Debugger, Code Assistants, Code Chat, Code Completion, Code Generator, Autocomplete, Codestral, Generative AI

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to code-review-graph

Are you the builder of code-review-graph?

Get the weekly brief

Data Sources

code-review-graph

Capabilities12 decomposed

tree-sitter-based incremental codebase parsing with sha-256 change tracking

blast-radius impact analysis with dependency graph traversal

evaluation framework with benchmark metrics and token reduction reporting

graph storage and persistence with sqlite backend

mcp server integration with claude code and llm assistants

semantic search and embedding-based code retrieval

watch mode with auto-update hooks for continuous graph synchronization

code review context generation with token-optimized summaries

vs code extension with graph visualization and interactive exploration

multi-language support with language-agnostic graph schema

incremental graph update system with delta computation

cli command suite for graph management and analysis

Related Artifactssharing capabilities

Scaffold

repomix

codebase-memory-mcp

Second

Sourcerer

Claude 4, DeepSeek R1, ChatGPT, Copilot, Cursor AI and Cline, AI Agents, AI Copilot, and Debugger, Code Assistants, Code Chat, Code Completion, Code Generator, Autocomplete, Codestral, Generative AI

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to code-review-graph

Are you the builder of code-review-graph?

Get the weekly brief

Data Sources