Rebuff vs code-review-graph — Comparison | Unfragile

Rebuff vs code-review-graph

Side-by-side comparison to help you choose.

Rebuff

Framework

/ 100

Free

code-review-graph

MCP Server

/ 100

Free

Feature	Rebuff	code-review-graph
Type	Framework	MCP Server
UnfragileRank	43/100	49/100
Adoption	1	0
Quality	0	1
Ecosystem

Rebuff Capabilities

multi-layered heuristic prompt injection detection

Analyzes incoming prompts using fast, pattern-based rules to detect common prompt injection attack signatures (keywords, structural patterns, encoding tricks). Operates as the first defense layer before LLM-based detection, using configurable keyword lists and regex-based pattern matching to identify malicious intent without requiring model inference. Returns a heuristic score that can be compared against a configurable threshold to block suspicious inputs.

Unique: Implements defense-in-depth as first layer with configurable keyword and pattern registries, allowing teams to customize detection rules without retraining models. Uses strategy pattern to enable/disable heuristic tactics independently from other detection layers.

vs alternatives: Faster than LLM-only detection (no inference latency) and more transparent than black-box ML approaches, but less semantically sophisticated than LLM-based detection alone

llm-based semantic prompt injection detection

Delegates prompt injection detection to a dedicated language model that analyzes user input semantically to identify malicious intent, jailbreak attempts, and instruction-override attacks. The SDK abstracts the LLM backend (OpenAI, Anthropic, local models via Ollama) and returns a detection score based on the model's confidence in identifying an attack. This layer captures sophisticated, context-aware attacks that simple heuristics miss.

Unique: Abstracts LLM provider selection via strategy pattern, supporting OpenAI, Anthropic, and local Ollama models with unified interface. Configurable thresholds per provider allow tuning sensitivity based on model capabilities and false-positive tolerance.

vs alternatives: More semantically accurate than heuristics but slower; unlike static rule-based systems, adapts to new attack patterns without code changes, though still vulnerable to adversarial prompts targeting the detection model itself

incident logging and attack pattern learning loop

Provides APIs to log detected attacks (especially canary token leaks) to the vector database, enabling the system to learn from incidents and improve future detection. When isCanaryWordLeaked() detects a leak, the application can call logAttack() to store the attack input and metadata, which gets embedded and added to the vector database. This creates a feedback loop where each incident improves detection of similar future attacks.

Unique: Implements closed-loop learning: detected attacks (especially canary token leaks) are automatically logged to vector database, improving future detection without manual curation. Metadata logging enables forensic analysis and trend tracking.

vs alternatives: Enables continuous improvement of detection over time, unlike static rule-based or pre-trained model approaches; requires operational discipline to sanitize sensitive data before logging

per-tactic detection scoring and explainability

Returns detailed detection results that include individual scores from each enabled tactic (heuristic score, LLM confidence, vector similarity score) alongside the final detection decision. This enables developers to understand which tactic flagged an input and why, supporting debugging, threshold tuning, and explainability to stakeholders. Detection results include metadata like matched attack patterns from vector database or heuristic rules triggered.

Unique: Returns granular per-tactic scores and metadata (matched attack patterns, heuristic rules triggered) enabling developers to understand detection decisions at multiple levels of detail. Supports both high-level flagged boolean and detailed scoring for debugging.

vs alternatives: More transparent than black-box detection systems; enables threshold tuning and debugging unavailable in opaque approaches, though requires application-level handling of detailed results

vector similarity matching against known attack patterns

Stores embeddings of previously detected or known prompt injection attacks in a vector database (Pinecone, Supabase, or custom backends), then compares incoming prompts against this corpus using semantic similarity. When a user input's embedding exceeds a similarity threshold to known attacks, the system flags it as a potential injection. This layer learns from past incidents and enables zero-shot detection of attack variants.

Unique: Implements pluggable vector database backends (Pinecone, Supabase, custom) via abstraction layer, enabling teams to choose storage based on compliance, latency, and cost requirements. Stores attack metadata alongside embeddings for incident correlation and forensics.

vs alternatives: Learns from organizational incident history without retraining, unlike static heuristics; more scalable than maintaining curated rule lists, but requires active management of attack corpus and periodic re-embedding as threat landscape evolves

canary token injection and leak detection

Inserts randomly generated, unique canary tokens into system prompts before sending to the LLM, then monitors the model's response to detect if those tokens appear in the output. If a canary token leaks, it indicates the model has exposed its system instructions, revealing a successful prompt injection. The SDK provides addCanaryWord() to inject tokens and isCanaryWordLeaked() to check responses, enabling post-hoc detection of instruction leakage.

Unique: Generates cryptographically random, unique canary tokens per request and provides explicit APIs (addCanaryWord, isCanaryWordLeaked) for application-level integration. Enables closed-loop learning: detected leaks can be automatically logged to vector database to improve future detection.

vs alternatives: Detects successful attacks that bypass all preventive layers; unlike purely preventive approaches, provides forensic evidence of instruction exposure and enables continuous improvement through incident-driven learning

configurable multi-tactic detection strategy with threshold tuning

Implements strategy pattern to compose heuristic, LLM-based, and vector database detection tactics into a unified detection pipeline. Each tactic has an independent, configurable threshold that determines sensitivity. The SDK allows enabling/disabling tactics, adjusting thresholds per tactic, and combining scores across tactics to make a final detection decision. This architecture enables teams to tune detection sensitivity for their specific risk tolerance and false-positive budget.

Unique: Uses strategy pattern to decouple detection tactics from orchestration logic, enabling runtime composition and threshold tuning without code changes. Each tactic is independently testable and can be swapped for custom implementations.

vs alternatives: More flexible than single-method detection (heuristics-only or LLM-only); allows cost-latency-accuracy tradeoffs unavailable in monolithic approaches, though requires operational discipline to tune thresholds correctly

python sdk with synchronous and asynchronous detection apis

Provides Python bindings for Rebuff detection with both sync (detect_injection) and async (async detect_injection) methods, enabling integration into synchronous Flask/Django applications and async FastAPI/Starlette services. The SDK abstracts backend configuration (LLM provider, vector database, heuristic rules) via environment variables or constructor parameters, reducing boilerplate and enabling environment-specific configuration.

Unique: Provides both sync and async APIs with unified interface, enabling drop-in integration into existing Python frameworks. Configuration abstraction via environment variables and constructor parameters allows same code to run across dev/staging/prod with different backends.

vs alternatives: More Pythonic than REST API calls; async support enables non-blocking detection in high-throughput services, unlike synchronous-only SDKs

+4 more capabilities

code-review-graph Capabilities

tree-sitter-based incremental codebase parsing with sha-256 change tracking

Parses source code using Tree-sitter AST parsing across 40+ languages, extracting structural entities (functions, classes, types, imports) and storing them in a persistent knowledge graph. Tracks file changes via SHA-256 hashing to enable incremental updates—only re-parsing modified files rather than rescanning the entire codebase on each invocation. The parser system maintains a directed graph of code entities and their relationships (CALLS, IMPORTS_FROM, INHERITS, CONTAINS, TESTED_BY, DEPENDS_ON) without requiring full re-indexing.

Unique: Uses Tree-sitter AST parsing with SHA-256 incremental tracking instead of regex or line-based analysis, enabling structural awareness across 40+ languages while avoiding redundant re-parsing of unchanged files. The incremental update system (diagram 4) tracks file hashes to determine which entities need re-extraction, reducing indexing time from O(n) to O(delta) for large codebases.

vs alternatives: Faster and more accurate than LSP-based indexing for offline analysis because it maintains a persistent graph that survives session boundaries and doesn't require a running language server per language.

blast-radius impact analysis with dependency graph traversal

When a file changes, the system traces the directed graph to identify all potentially affected code entities—callers, dependents, inheritors, and tests. This 'blast radius' computation uses graph traversal algorithms (BFS/DFS) to walk the CALLS, IMPORTS_FROM, INHERITS, DEPENDS_ON, and TESTED_BY edges, producing a minimal set of files and functions that Claude must review. The system excludes irrelevant files from context, reducing token consumption by 6.8x to 49x depending on repository structure and change scope.

Unique: Implements graph-based blast radius computation (diagram 3) that traces structural dependencies to identify affected code, rather than heuristic-based approaches like 'files in the same directory' or 'files modified in the same commit'. The system achieves 49x token reduction on monorepos by excluding 27,000+ irrelevant files from review context.

Rebuff vs code-review-graph

Rebuff Capabilities

code-review-graph Capabilities

Verdict

Company