agentshield

MCP ServerFree

AI agent security scanner. Detect vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, ECC plugin, and GitHub App integration. 🛡️

Open Source

/ 100

17 capabilities

Capabilities17 decomposed

static configuration vulnerability scanning with 102+ rule registry

Medium confidence

Discovers Claude-related configuration files (settings.json, mcp.json, CLAUDE.md) across the filesystem and runs them through a curated registry of 102+ static analysis rules organized by threat category (secrets, permissions, hooks, MCP, prompt injection). Each rule produces a Finding object with severity level, vulnerability description, and remediation steps, enabling systematic detection of misconfigurations before runtime.

Solves for

scan my agent configuration files for hardcoded secrets and API keysidentify overly permissive tool permissions in my MCP setupdetect hook injection vulnerabilities in PreToolUse and SessionStart handlersaudit my entire agent codebase for security misconfigurations in one pass

Best for

teams building Claude Code agents who need pre-deployment security validation

developers integrating MCP servers and want to prevent supply chain attacks

organizations enforcing security baselines across multiple agent configurations

Requires

Node.js 18+

TypeScript runtime or compiled JavaScript

read access to agent configuration directories

Limitations

static analysis only — cannot detect runtime behavioral exploits or zero-day patterns not in rule registry

requires files to be discoverable on local filesystem — no remote scanning of cloud-hosted configs

rule false-positive rate documented in false-positive-audit.md; some rules may flag legitimate patterns

What makes it unique

Implements a domain-specific rule registry tailored to Claude Code + MCP threat model (102+ rules covering secrets, permissions, hooks, supply chain, prompt injection) rather than generic SAST tools; rules are organized by vulnerability category and include built-in remediation guidance specific to agent configurations

vs alternatives

More specialized for AI agent security than generic code scanners (Semgrep, Snyk) because it understands MCP server semantics, hook injection patterns, and prompt-based capability escalation unique to agent architectures

hardcoded secrets detection with multi-provider pattern matching

Medium confidence

Scans configuration files for exposed API keys, tokens, and private keys using pattern matching rules for Anthropic, OpenAI, AWS, and other providers. Detects both common formats (e.g., sk-* prefixes) and entropy-based anomalies in string values, flagging findings with severity levels and remediation steps recommending environment variable substitution or secret management tools.

Solves for

find accidentally committed API keys in my agent configuration before pushing to GitHubdetect hardcoded tokens in MCP server definitions that could be exfiltratedaudit existing configurations for legacy secrets that should be rotatedenforce a policy that no secrets appear in version control

Best for

developers working with Claude Code who want to prevent credential leakage

DevOps teams implementing pre-commit hooks for agent configuration validation

security teams auditing third-party agent configurations for exposure risks

Requires

Node.js 18+

read access to configuration files

optional: integration with secret scanning tools (git-secrets, TruffleHog)

Limitations

pattern-based detection may miss obfuscated or custom secret formats not in the rule set

cannot detect secrets already rotated or invalidated — only identifies presence

high false-positive rate on legitimate long alphanumeric strings; requires manual review

What makes it unique

Combines provider-specific pattern matching (Anthropic sk-*, OpenAI sk-*, AWS AKIA*) with entropy-based anomaly detection to catch both well-known secret formats and custom tokens; integrates with AgentShield's Finding system to provide context-aware remediation (e.g., 'use ANTHROPIC_API_KEY environment variable instead')

vs alternatives

More targeted for agent configurations than generic secret scanners (git-secrets, Snyk) because it understands where secrets appear in MCP server definitions and hook configurations, not just source code

supply chain verification with source authenticity and maintenance status checks

Medium confidence

Validates the authenticity and trustworthiness of MCP server sources by cross-referencing against known-good registries, checking maintainer reputation, and verifying code signatures. Assesses maintenance status (last update, active development, community engagement) to identify abandoned or unmaintained servers that pose supply chain risks. Integrates with GitHub API to gather maintainer and repository metadata.

Solves for

verify that MCP servers in my configuration come from trusted sourcesidentify unmaintained or abandoned MCP servers that pose security riskscheck the reputation and activity of MCP server maintainersensure my agent only uses actively maintained dependencies

Best for

teams using community MCP servers who need to validate trustworthiness

organizations with strict supply chain security policies

developers building agent ecosystems with multiple dependencies

Requires

Node.js 18+

MCP server definitions with source information

GitHub API token (optional, for higher rate limits)

Limitations

verification relies on external data sources (GitHub, registries) — may be incomplete or outdated

cannot detect compromised maintainer accounts or supply chain attacks after initial verification

reputation assessment is heuristic-based and may not reflect actual security posture

What makes it unique

Integrates with GitHub API to gather maintainer metadata, repository activity, and code signatures; assesses both source authenticity (is this really from the claimed maintainer?) and maintenance status (is this actively developed?) to identify supply chain risks beyond just CVE databases

vs alternatives

More thorough than generic dependency scanners because it validates source authenticity and maintenance status, not just known vulnerabilities; provides context about maintainer reputation and project health

vulnerability severity scoring and risk prioritization engine

Medium confidence

Aggregates findings from all scanning modules (static rules, deep scan, taint analysis, injection testing, sandbox monitoring) and computes a composite vulnerability severity score based on exploitability, impact, and blast radius. Prioritizes findings for remediation using a scoring engine that considers attack complexity, required privileges, and potential damage. Generates risk reports with remediation guidance ranked by severity.

Solves for

understand which vulnerabilities in my agent are most critical to fixprioritize remediation efforts based on actual risk, not just rule severityget a composite security score for my agent configurationcommunicate security posture to stakeholders with quantified risk metrics

Best for

security teams managing vulnerability remediation across multiple agents

developers deciding which findings to fix first

organizations reporting security metrics to compliance and leadership

Requires

Node.js 18+

findings from scanning modules

optional: organizational risk context for custom scoring

Limitations

scoring is heuristic-based and may not reflect actual risk in specific contexts

does not account for compensating controls outside AgentShield's scope

risk prioritization assumes standard threat model — may not apply to specialized deployments

What makes it unique

Implements a composite scoring engine that combines findings from multiple analysis modules (static rules, deep scan, taint analysis, injection testing, sandbox) into a unified risk score; prioritizes remediation based on exploitability and impact rather than just rule severity

vs alternatives

More sophisticated than simple rule-based severity assignment because it considers attack complexity, required privileges, and blast radius; aggregates multiple analysis techniques into a unified risk metric

miniclaw secure agent runtime with tool whitelist and egress firewall

Medium confidence

Provides a hardened, minimal agent runtime (MiniClaw) that enforces security policies at execution time. Implements a tool whitelist that only allows explicitly approved tools, path sanitization for file access, and an egress firewall that prevents unauthorized network requests. Acts as a secure alternative to standard agent setups, with hooks into the agent lifecycle to validate tool calls against a RuntimePolicy before execution.

Solves for

run my agent in a hardened runtime that enforces security policiesensure my agent can only execute whitelisted toolsprevent my agent from making unauthorized network requestsenforce path restrictions for file access at runtime

Best for

teams deploying agents in production who need runtime security enforcement

organizations with strict security requirements (financial, healthcare, government)

developers building multi-tenant agent platforms

Requires

Node.js 18+

MiniClaw runtime installation

RuntimePolicy definition with tool whitelist and network rules

Limitations

MiniClaw adds runtime overhead — may impact agent performance

tool whitelist must be manually maintained — requires operational effort

egress firewall may block legitimate agent use cases

What makes it unique

Implements a minimal, hardened agent runtime (MiniClaw) that enforces security policies at execution time through tool whitelisting, path sanitization, and egress firewall; integrates with AgentShield's policy definitions to enforce detected security requirements

vs alternatives

More practical than relying solely on static analysis because it enforces security policies at runtime; more lightweight than full sandboxing because it only restricts specific dangerous operations rather than isolating the entire runtime

ci/cd integration with github actions and baseline quality gates

Medium confidence

Provides GitHub Action integration that runs AgentShield scans automatically on pull requests and commits. Supports baseline comparison to detect regressions (new vulnerabilities introduced), quality gates that fail builds if severity thresholds are exceeded, and watch mode that alerts on configuration changes. Integrates with GitHub's status checks and pull request reviews to block merges with critical vulnerabilities.

Solves for

automatically scan my agent configuration on every commit or pull requestprevent merging of code that introduces new security vulnerabilitiesenforce security baselines and prevent regressionget alerts when my agent configuration changes in risky ways

Best for

teams using GitHub for version control who want automated security scanning

organizations enforcing security policies across multiple agent projects

developers wanting shift-left security (catch issues early in development)

Requires

GitHub repository with Actions enabled

GitHub Action workflow file configuration

optional: GitHub App installation for enhanced permissions

Limitations

GitHub Action integration requires GitHub repository — not suitable for other VCS platforms

baseline comparison requires historical scan data — not available on first run

quality gates are binary (pass/fail) — cannot enforce graduated policies

What makes it unique

Integrates with GitHub Actions to run AgentShield scans automatically on commits/PRs; supports baseline comparison to detect regressions and quality gates that fail builds if severity thresholds are exceeded; provides GitHub App integration for enhanced permissions and pull request review comments

vs alternatives

More integrated than running AgentShield manually because it automates scanning and blocks risky merges; more practical than generic security scanning tools because it understands agent-specific vulnerabilities

auto-fix engine with configuration remediation and policy initialization

Medium confidence

Automatically generates and applies fixes for detected vulnerabilities, including moving hardcoded secrets to environment variables, removing wildcard tool permissions, sanitizing hook code, and pinning MCP server versions. Provides an initialization mode that creates secure baseline configurations from scratch. Uses code transformation patterns to modify configuration files safely while preserving structure and comments.

Solves for

automatically fix detected vulnerabilities in my agent configurationmove hardcoded secrets to environment variablesremove overly permissive tool permissionsinitialize a new agent configuration with security best practices

Best for

developers wanting quick remediation of detected vulnerabilities

teams initializing new agent projects with secure defaults

organizations automating security compliance across multiple agents

Requires

Node.js 18+

write access to configuration files

optional: --fix flag to apply transformations

Limitations

auto-fix may not be appropriate for all vulnerabilities — requires manual review

code transformations may not preserve all formatting or comments

cannot fix vulnerabilities that require architectural changes

What makes it unique

Implements code transformation patterns that safely modify configuration files to fix detected vulnerabilities (moving secrets to env vars, removing wildcard permissions, pinning versions) while preserving file structure and comments; provides initialization mode for creating secure baseline configurations

vs alternatives

More practical than manual remediation because it automates fix application; more careful than generic code transformers because it understands agent configuration semantics and preserves structure

organizational policy enforcement with custom rules and compliance reporting

Medium confidence

Enables organizations to define custom security policies that extend AgentShield's built-in rules, enforcing organization-specific requirements (e.g., 'all MCP servers must be from approved registry', 'no external network access'). Generates compliance reports showing which agents meet organizational policies and which require remediation. Integrates with policy management systems to enforce policies across multiple agent projects.

Solves for

enforce my organization's security policies across all agent projectsdefine custom rules specific to my organization's threat modelgenerate compliance reports for auditors and leadershipensure all agents meet organizational security baselines

Best for

large organizations with multiple agent projects and security requirements

teams with compliance obligations (SOC2, ISO27001, HIPAA)

enterprises wanting to enforce consistent security policies

Requires

Node.js 18+

organizational policy definitions

optional: policy management system integration

Limitations

custom policy definition requires security expertise

policy enforcement is only as good as the rules defined

compliance reporting requires manual interpretation of results

What makes it unique

Extends AgentShield's built-in rules with organization-specific policies that can enforce custom security requirements; generates compliance reports showing which agents meet organizational policies and provides remediation guidance for non-compliant configurations

vs alternatives

More flexible than fixed rule sets because it allows organizations to define custom policies; more practical than manual compliance audits because it automates policy checking and reporting

skills health system with dependency tracking and update notifications

Medium confidence

Monitors the health of MCP servers and agent skills by tracking dependency versions, maintenance status, and security updates. Provides notifications when new versions are available, when dependencies become unmaintained, or when security patches are released. Maintains a skills registry that tracks which agents use which skills and enables impact analysis for updates.

Solves for

stay informed about security updates for MCP servers my agents depend onidentify unmaintained or abandoned skills that should be replacedunderstand the impact of updating a skill across multiple agentsplan skill upgrades and deprecations systematically

Best for

teams managing multiple agents with shared MCP server dependencies

organizations wanting to track skill health across their agent ecosystem

developers planning skill upgrades and deprecations

Requires

Node.js 18+

skills registry with dependency information

network access to GitHub and other registries

Limitations

health tracking requires continuous monitoring — adds operational overhead

update notifications may be noisy if not filtered appropriately

impact analysis is based on declared dependencies — may miss indirect impacts

What makes it unique

Implements continuous monitoring of MCP server health (maintenance status, security updates, version availability) and provides impact analysis showing which agents would be affected by skill updates; integrates with notification systems to alert teams about critical updates

vs alternatives

More proactive than manual dependency tracking because it continuously monitors health and provides notifications; more practical than generic dependency management tools because it understands agent-specific skill dependencies

permissive tool permission analysis with wildcard and deny-list detection

Medium confidence

Analyzes agent tool permission definitions to identify overly broad access patterns, including wildcard permissions (e.g., Bash(*)), missing deny lists for destructive operations, and privilege escalation vectors. Uses pattern matching on tool definitions to flag configurations where an agent could execute arbitrary shell commands or access sensitive files without restrictions.

Solves for

identify if my agent has wildcard bash access that could be exploitedensure my agent cannot access sensitive file paths or execute destructive commandsaudit tool permissions to enforce least-privilege principledetect missing deny lists for operations like rm, dd, or network exfiltration

Best for

teams deploying agents in production who need to minimize blast radius of compromise

security architects designing agent permission models

organizations with compliance requirements (SOC2, ISO27001) for access control

Requires

Node.js 18+

agent configuration with explicit tool definitions

optional: MiniClaw runtime for enforcement of detected policies

Limitations

cannot detect runtime permission escalation through tool chaining or indirect access

does not validate whether deny lists are actually enforced by the runtime

requires understanding of tool semantics — may miss domain-specific dangerous operations

What makes it unique

Implements agent-specific permission semantics (understanding that Bash(*) is dangerous, that file access should be path-restricted, that network tools need egress controls) rather than generic RBAC analysis; integrates with MiniClaw runtime to enforce detected policies at execution time

vs alternatives

More specialized than generic IAM policy analyzers (AWS IAM Access Analyzer) because it understands agent tool semantics and the specific attack surface of autonomous code execution

hook injection vulnerability detection with command and exfiltration pattern analysis

Medium confidence

Analyzes PreToolUse and SessionStart hooks for command injection vulnerabilities and data exfiltration patterns. Scans hook code for dangerous patterns (shell metacharacters, subprocess calls, network requests) and detects capability escalation attempts where hooks could bypass tool restrictions or leak system prompts. Uses AST-level or regex-based pattern matching to identify risky hook implementations.

Solves for

detect if my PreToolUse hooks could be exploited for command injectionidentify hooks that might exfiltrate system prompts or sensitive dataaudit hook code for dangerous subprocess calls or network requestsensure hooks enforce security policies rather than bypass them

Best for

developers implementing custom hooks in Claude Code agents

security teams reviewing hook implementations for compliance

teams using hooks for tool filtering or policy enforcement

Requires

Node.js 18+

hook definitions in agent configuration files

optional: deep scan mode (--deep flag) for more thorough analysis

Limitations

pattern-based detection may miss sophisticated injection techniques or obfuscated code

cannot detect hooks that are dynamically generated or loaded at runtime

false positives on legitimate hook code that uses subprocess for valid purposes

What makes it unique

Specifically targets hook-based attack vectors in Claude Code (PreToolUse/SessionStart) rather than generic code injection detection; understands that hooks are a privileged execution context that can bypass tool restrictions, making them high-value targets for exploitation

vs alternatives

More targeted than generic code injection scanners because it understands the specific hook lifecycle in Claude Code agents and the privilege escalation risk they represent

mcp supply chain risk assessment with version pinning and source verification

Medium confidence

Analyzes MCP server configurations to identify supply chain vulnerabilities including unpinned versions, npx auto-installs, and risky server sources. Cross-references servers against a threat intelligence database (CVE database) to flag known vulnerable versions. Detects dynamic server loading patterns that could allow injection of malicious servers and validates server source authenticity.

Solves for

identify MCP servers in my configuration that have known CVEs or security issuesensure all MCP server versions are pinned to prevent auto-update attacksdetect if my configuration uses npx to auto-install servers (supply chain risk)audit third-party MCP servers for trustworthiness and maintenance status

Best for

teams using community MCP servers who need to validate their security posture

organizations with supply chain security requirements

developers building agent ecosystems with multiple MCP server dependencies

Requires

Node.js 18+

MCP server definitions in configuration files

network access to threat intelligence database (CVE, supply chain verification)

Limitations

CVE database may lag behind actual vulnerability disclosures

cannot detect zero-day vulnerabilities in MCP servers

version pinning check does not validate whether pinned versions are actually secure

What makes it unique

Integrates MCP-specific threat intelligence (understanding that npx auto-installs are risky, that unpinned versions enable supply chain attacks, that MCP servers run with elevated privileges) with CVE database lookups; provides supply chain verification that validates server sources against known-good registries

vs alternatives

More specialized than generic dependency scanners (npm audit, Snyk) because it understands MCP server semantics and the specific risk of dynamic server loading in agent configurations

prompt injection and capability escalation detection with multi-chain analysis

Medium confidence

Detects prompt injection vulnerabilities and capability escalation attacks in agent prompts, including 'Russian Doll' multi-chain injection vectors where an attacker chains multiple prompts to bypass restrictions. Analyzes prompt definitions for patterns that could allow an attacker to override system instructions, escalate tool access, or manipulate agent behavior. Uses pattern matching and semantic analysis to identify risky prompt structures.

Solves for

identify if my system prompt could be overridden through user inputdetect capability escalation attempts in multi-turn conversationsaudit prompts for 'Russian Doll' injection patterns that chain multiple exploitsensure my agent prompts enforce security boundaries and cannot be manipulated

Best for

teams building Claude Code agents with user-facing interfaces

security researchers testing agent robustness against prompt injection

organizations with strict prompt security requirements

Requires

Node.js 18+

agent prompt definitions in CLAUDE.md or settings.json

optional: Claude 3.5 Opus API key for deep scan mode (--opus flag)

Limitations

prompt injection detection is heuristic-based and may miss sophisticated attacks

cannot detect runtime prompt injection from external sources (user input, API responses)

false positives on legitimate prompts that discuss security or tool usage

What makes it unique

Implements multi-chain injection analysis using Claude 3.5 Opus (in deep scan mode) to simulate 'Russian Doll' attacks where an attacker chains multiple prompts to bypass restrictions; combines static pattern matching with adversarial LLM-based testing to detect both obvious and subtle injection vectors

vs alternatives

More sophisticated than generic prompt injection detectors because it understands agent-specific attack patterns (tool escalation, system prompt override, multi-turn manipulation) and uses adversarial LLM testing to find novel injection techniques

deep scan adversarial analysis with three-agent opus pipeline

Medium confidence

Activates an advanced security analysis mode using Claude 3.5 Opus in a three-agent pipeline (Attacker/Defender/Auditor) to simulate real-world exploits against agent configurations. The Attacker agent generates adversarial prompts and attack scenarios, the Defender agent proposes mitigations, and the Auditor agent validates findings. This goes beyond static rules to discover novel vulnerabilities through adversarial reasoning.

Solves for

simulate real attacks against my agent configuration to find vulnerabilities static analysis missesget expert security analysis of my agent setup from an LLM perspectivediscover novel prompt injection techniques specific to my agent's designvalidate that my security mitigations actually work against adversarial attacks

Best for

security teams doing comprehensive agent security audits

developers building high-security agents (financial, healthcare, critical infrastructure)

researchers studying agent security and adversarial robustness

Requires

Node.js 18+

Claude 3.5 Opus API key with sufficient quota

network access to Anthropic API

Limitations

requires Claude 3.5 Opus API access — adds significant cost and latency (minutes per scan)

Opus-based analysis is non-deterministic — results may vary between runs

cannot guarantee finding all vulnerabilities — adversarial analysis is heuristic-based

What makes it unique

Implements a three-agent Opus pipeline (Attacker/Defender/Auditor) that simulates adversarial reasoning rather than relying solely on static rules; the Attacker agent generates novel attack scenarios, Defender proposes mitigations, and Auditor validates findings, enabling discovery of vulnerabilities beyond the static rule registry

vs alternatives

More thorough than static analysis tools because it uses adversarial LLM reasoning to discover novel vulnerabilities; more practical than manual security audits because it automates the attack simulation and mitigation validation process

taint analysis for data flow tracking and exfiltration detection

Medium confidence

Performs data flow analysis to track how sensitive data (system prompts, API keys, user inputs) flows through agent configurations, hooks, and tool calls. Identifies potential exfiltration paths where sensitive data could leak to external systems (network requests, logs, tool outputs). Uses taint propagation to mark sensitive sources and detect when tainted data reaches dangerous sinks.

Solves for

ensure my agent cannot exfiltrate system prompts through tool outputs or network requeststrack how API keys and secrets flow through my agent configurationidentify if hooks could leak sensitive data to external systemsvalidate that user inputs cannot be used to extract sensitive information

Best for

teams building agents that handle sensitive data (PII, financial info, secrets)

security teams validating data isolation in multi-tenant agent setups

developers implementing custom hooks and want to ensure data safety

Requires

Node.js 18+

complete agent configuration with data flow paths

--deep flag or --taint-analysis flag

Limitations

taint analysis is conservative — may flag legitimate data flows as risky

cannot track data flows through external systems or APIs

requires understanding of tool semantics to identify dangerous sinks

What makes it unique

Implements taint analysis specifically for agent data flows, tracking how sensitive data (system prompts, API keys) propagates through hooks, tools, and external calls; identifies exfiltration paths that static analysis alone would miss by modeling data dependencies

vs alternatives

More specialized than generic data flow analyzers because it understands agent-specific data sources (system prompts, tool outputs) and sinks (network requests, logs, tool parameters)

injection testing with adversarial prompt generation and execution simulation

Medium confidence

Generates adversarial prompts designed to exploit detected vulnerabilities and simulates their execution against the agent configuration without actually running them. Tests injection vectors including prompt override, tool escalation, and data exfiltration. Uses Claude 3.5 Opus to generate realistic attack prompts and validates whether the agent's security controls would prevent exploitation.

Solves for

test if my agent is actually vulnerable to the injection patterns AgentShield detectedgenerate realistic adversarial prompts that could exploit my agentvalidate that my security mitigations actually prevent detected attacksunderstand the practical impact of detected vulnerabilities

Best for

security teams validating vulnerability severity before remediation

developers testing security fixes to ensure they work

researchers studying agent robustness and adversarial resilience

Requires

Node.js 18+

Claude 3.5 Opus API key

complete agent configuration

Limitations

execution simulation does not run actual agent code — may miss runtime-specific vulnerabilities

generated prompts are heuristic-based and may not represent real attacker techniques

requires Claude 3.5 Opus API — adds cost and latency

What makes it unique

Uses Claude 3.5 Opus to generate realistic adversarial prompts that target detected vulnerabilities, then simulates their execution against the agent configuration to validate whether security controls would prevent exploitation; bridges static analysis findings with practical impact assessment

vs alternatives

More practical than static vulnerability detection alone because it validates whether detected vulnerabilities are actually exploitable; more efficient than manual penetration testing because it automates prompt generation and execution simulation

sandbox behavioral analysis with runtime execution monitoring

Medium confidence

Executes agent configurations in an isolated sandbox environment and monitors their runtime behavior for security violations. Tracks system calls, network requests, file access, and tool invocations to detect whether the agent violates its declared permissions or exhibits suspicious behavior. Compares actual behavior against the declared security policy to identify policy violations.

Solves for

verify that my agent actually respects the tool permissions I declareddetect if my agent makes unexpected network requests or file accessesmonitor agent behavior for signs of compromise or malicious activityvalidate that security controls are actually enforced at runtime

Best for

teams deploying agents in production who need runtime security validation

security teams monitoring agent behavior for anomalies

developers testing that security fixes actually work in practice

Requires

Node.js 18+

ability to execute agent code in isolated environment

system call tracing tools (strace, dtrace, or equivalent)

Limitations

sandbox execution adds significant latency and resource overhead

may not detect sophisticated attacks that avoid triggering monitored behaviors

requires ability to execute agent code — not suitable for untrusted configurations

What makes it unique

Executes agent configurations in an isolated sandbox and monitors runtime behavior (system calls, network requests, file access) against declared security policies; detects policy violations and behavioral anomalies that static analysis cannot find by observing actual execution

vs alternatives

More comprehensive than static analysis because it validates runtime behavior; more practical than manual testing because it automates behavior monitoring and policy violation detection

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Related Artifactssharing capabilities

Artifacts that share capabilities with agentshield, ranked by overlap. Discovered automatically through the match graph.

CLI Tool42

Semgrep

Static analysis — custom rules for bugs and security, 30+ languages, AI-powered triage.

supply chain scanning with dependency vulnerability detectionsecrets detection with semantic validation

2 shared capabilities

CLI Tool42

Semgrep CLI

AI-powered static analysis for security.

supply chain scanning with dependency vulnerability detection and reachability analysissecrets detection with semantic validation and entropy analysis

2 shared capabilities

Platform40

Aikido Security

All-in-one appsec platform with AI-powered triage.

secrets detection and credential exposure preventionsoftware composition analysis with cve detection and sbom generation

2 shared capabilities

Product27

Coderbuds

Coderbuds is a code review tool that automates the code review process, providing feedback and recommendations to...

security-vulnerability-scanning

1 shared capability

Agent39

Sourcery

AI code review agent for pull requests.

security vulnerability scanning across repositories

1 shared capability

Repository23

PR-Agent

AI-powered tool for automated PR analysis, feedback, suggestions and more.

security vulnerability detection in code changes

1 shared capability

Best For

✓teams building Claude Code agents who need pre-deployment security validation
✓developers integrating MCP servers and want to prevent supply chain attacks
✓organizations enforcing security baselines across multiple agent configurations
✓developers working with Claude Code who want to prevent credential leakage
✓DevOps teams implementing pre-commit hooks for agent configuration validation
✓security teams auditing third-party agent configurations for exposure risks
✓teams using community MCP servers who need to validate trustworthiness
✓organizations with strict supply chain security policies

Known Limitations

⚠static analysis only — cannot detect runtime behavioral exploits or zero-day patterns not in rule registry
⚠requires files to be discoverable on local filesystem — no remote scanning of cloud-hosted configs
⚠rule false-positive rate documented in false-positive-audit.md; some rules may flag legitimate patterns
⚠pattern-based detection may miss obfuscated or custom secret formats not in the rule set
⚠cannot detect secrets already rotated or invalidated — only identifies presence
⚠high false-positive rate on legitimate long alphanumeric strings; requires manual review

Requirements

Node.js 18+TypeScript runtime or compiled JavaScriptread access to agent configuration directoriesread access to configuration filesoptional: integration with secret scanning tools (git-secrets, TruffleHog)MCP server definitions with source informationGitHub API token (optional, for higher rate limits)network access to GitHub and other registries

Input / Output

Accepts: JSON configuration files (settings.json, mcp.json), Markdown files (CLAUDE.md with prompt definitions), YAML/TOML agent configs, JSON configuration files, environment variable definitions, markdown prompt definitions, MCP server configurations with source URLs, package.json or lock files with dependency information, Finding objects from all scanning modules, agent configuration context, RuntimePolicy definitions, tool whitelist configuration, network egress rules, agent configuration files in repository, pull request metadata, commit history, agent configuration files, Finding objects with remediation suggestions, custom policy definitions, agent configurations from multiple projects, compliance requirements, MCP server definitions, dependency version information, skills usage across agents, tool permission definitions in settings.json or mcp.json, tool whitelist/blacklist configurations, hook code embedded in settings.json or CLAUDE.md, JavaScript/TypeScript hook implementations, MCP server configurations with name, version, and source, package.json or lock files for dependency pinning, system prompts in CLAUDE.md, prompt templates in agent configuration, user-facing prompt instructions, complete agent configuration (settings.json, mcp.json, CLAUDE.md), tool definitions and permissions, hook implementations, agent configuration with tool definitions, data source definitions (user input, API responses), agent configuration with detected vulnerabilities, prompt definitions, executable agent configuration, tool implementations, hook code

Produces: structured Finding objects with severity, description, remediation, JSON/CSV/HTML reports with aggregated vulnerability counts, exit codes for CI/CD integration, Finding objects with secret type, location, and remediation, severity-tagged reports (CRITICAL for API keys, HIGH for tokens), supply chain risk scores, maintainer reputation assessment, maintenance status (last update, activity level), Finding objects for untrusted or unmaintained servers, composite vulnerability severity scores (0-100), prioritized finding lists ranked by risk, risk reports with remediation guidance, security scorecards for compliance reporting, enforced tool call validation, blocked tool execution with policy violation logs, network request filtering, GitHub status checks (pass/fail), pull request comments with findings, GitHub Action logs with detailed scan results, alerts on configuration changes, modified configuration files with fixes applied, transformation logs showing what was changed, initialized baseline configuration, compliance reports showing policy adherence, policy violation findings, remediation guidance for non-compliant agents, health status for each skill, update notifications, impact analysis for skill updates, deprecation warnings, Finding objects identifying wildcard patterns and missing deny lists, remediation suggestions with specific tool restrictions, Finding objects with injection pattern location and severity, remediation guidance for secure hook implementation, Finding objects with CVE references and severity, remediation steps (pin versions, update to patched releases), Finding objects with injection pattern and severity, remediation guidance for prompt hardening, adversarial prompt examples (in deep scan mode), adversarial attack scenarios and exploitation techniques, Finding objects with Opus-generated severity and remediation, detailed security report with attack chains and mitigations, data flow diagrams showing taint propagation, Finding objects identifying exfiltration paths, remediation guidance for data isolation, generated adversarial prompts, execution simulation results, vulnerability impact assessment, runtime behavior trace (system calls, network requests, file access), behavioral anomaly alerts

UnfragileRank

Adoption20%(30% weight)

Quality53%(25% weight)

Ecosystem70%(25% weight)

Match Graph10%(15% weight)

Freshness75%(5% weight)

UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.

Type: MCP Server

17 capabilities

Visit agentshield→

Repository Details

526

Stars

109

Forks

TypeScript

Language

MIT

License

Topics

ai-agentanthropicclaude-codehackathonmcpopussecurity

Last commit: Apr 17, 2026

About

AI agent security scanner. Detect vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, ECC plugin, and GitHub App integration. 🛡️

Alternatives to agentshield

IntelliCode50Extension

AI-assisted development

Compare →

GitHub Copilot Chat53Extension

AI chat features powered by Copilot

Compare →

GitHub Copilot52Extension

Your AI pair programmer

Compare →

Claude Code for VS Code52Extension

Claude Code for VS Code: Harness the power of Claude Code without leaving your IDE

Compare →

Are you the builder of agentshield?

Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.

Claim this artifact →Verification via email

Get the weekly brief

New tools, rising stars, and what's actually worth your time. No spam.

Data Sources

mcp registry

Looking for something else?

Search →

Capabilities17 decomposed

static configuration vulnerability scanning with 102+ rule registry

Medium confidence

Solves for

Best for

teams building Claude Code agents who need pre-deployment security validation

developers integrating MCP servers and want to prevent supply chain attacks

organizations enforcing security baselines across multiple agent configurations

Requires

Node.js 18+

TypeScript runtime or compiled JavaScript

read access to agent configuration directories

Limitations

static analysis only — cannot detect runtime behavioral exploits or zero-day patterns not in rule registry

requires files to be discoverable on local filesystem — no remote scanning of cloud-hosted configs

rule false-positive rate documented in false-positive-audit.md; some rules may flag legitimate patterns

What makes it unique

vs alternatives

hardcoded secrets detection with multi-provider pattern matching

Medium confidence

Solves for

Best for

developers working with Claude Code who want to prevent credential leakage

DevOps teams implementing pre-commit hooks for agent configuration validation

security teams auditing third-party agent configurations for exposure risks

Requires

Node.js 18+

read access to configuration files

optional: integration with secret scanning tools (git-secrets, TruffleHog)

Limitations

pattern-based detection may miss obfuscated or custom secret formats not in the rule set

cannot detect secrets already rotated or invalidated — only identifies presence

high false-positive rate on legitimate long alphanumeric strings; requires manual review

What makes it unique

vs alternatives

supply chain verification with source authenticity and maintenance status checks

Medium confidence

Solves for

Best for

teams using community MCP servers who need to validate trustworthiness

organizations with strict supply chain security policies

developers building agent ecosystems with multiple dependencies

Requires

Node.js 18+

MCP server definitions with source information

GitHub API token (optional, for higher rate limits)

Limitations

verification relies on external data sources (GitHub, registries) — may be incomplete or outdated

cannot detect compromised maintainer accounts or supply chain attacks after initial verification

reputation assessment is heuristic-based and may not reflect actual security posture

What makes it unique

vs alternatives

vulnerability severity scoring and risk prioritization engine

Medium confidence

Solves for

Best for

security teams managing vulnerability remediation across multiple agents

developers deciding which findings to fix first

organizations reporting security metrics to compliance and leadership

Requires

Node.js 18+

findings from scanning modules

optional: organizational risk context for custom scoring

Limitations

scoring is heuristic-based and may not reflect actual risk in specific contexts

does not account for compensating controls outside AgentShield's scope

risk prioritization assumes standard threat model — may not apply to specialized deployments

What makes it unique

vs alternatives

miniclaw secure agent runtime with tool whitelist and egress firewall

Medium confidence

Solves for

Best for

teams deploying agents in production who need runtime security enforcement

organizations with strict security requirements (financial, healthcare, government)

developers building multi-tenant agent platforms

Requires

Node.js 18+

MiniClaw runtime installation

RuntimePolicy definition with tool whitelist and network rules

Limitations

MiniClaw adds runtime overhead — may impact agent performance

tool whitelist must be manually maintained — requires operational effort

egress firewall may block legitimate agent use cases

What makes it unique

vs alternatives

ci/cd integration with github actions and baseline quality gates

Medium confidence

Solves for

Best for

teams using GitHub for version control who want automated security scanning

organizations enforcing security policies across multiple agent projects

developers wanting shift-left security (catch issues early in development)

Requires

GitHub repository with Actions enabled

GitHub Action workflow file configuration

optional: GitHub App installation for enhanced permissions

Limitations

GitHub Action integration requires GitHub repository — not suitable for other VCS platforms

baseline comparison requires historical scan data — not available on first run

quality gates are binary (pass/fail) — cannot enforce graduated policies

What makes it unique

vs alternatives

auto-fix engine with configuration remediation and policy initialization

Medium confidence

Solves for

Best for

developers wanting quick remediation of detected vulnerabilities

teams initializing new agent projects with secure defaults

organizations automating security compliance across multiple agents

Requires

Node.js 18+

write access to configuration files

optional: --fix flag to apply transformations

Limitations

auto-fix may not be appropriate for all vulnerabilities — requires manual review

code transformations may not preserve all formatting or comments

cannot fix vulnerabilities that require architectural changes

What makes it unique

vs alternatives

More practical than manual remediation because it automates fix application; more careful than generic code transformers because it understands agent configuration semantics and preserves structure

organizational policy enforcement with custom rules and compliance reporting

Medium confidence

Solves for

Best for

large organizations with multiple agent projects and security requirements

teams with compliance obligations (SOC2, ISO27001, HIPAA)

enterprises wanting to enforce consistent security policies

Requires

Node.js 18+

organizational policy definitions

optional: policy management system integration

Limitations

custom policy definition requires security expertise

policy enforcement is only as good as the rules defined

compliance reporting requires manual interpretation of results

What makes it unique

vs alternatives

More flexible than fixed rule sets because it allows organizations to define custom policies; more practical than manual compliance audits because it automates policy checking and reporting

skills health system with dependency tracking and update notifications

Medium confidence

Solves for

Best for

teams managing multiple agents with shared MCP server dependencies

organizations wanting to track skill health across their agent ecosystem

developers planning skill upgrades and deprecations

Requires

Node.js 18+

skills registry with dependency information

network access to GitHub and other registries

Limitations

health tracking requires continuous monitoring — adds operational overhead

update notifications may be noisy if not filtered appropriately

impact analysis is based on declared dependencies — may miss indirect impacts

What makes it unique

vs alternatives

permissive tool permission analysis with wildcard and deny-list detection

Medium confidence

Solves for

Best for

teams deploying agents in production who need to minimize blast radius of compromise

security architects designing agent permission models

organizations with compliance requirements (SOC2, ISO27001) for access control

Requires

Node.js 18+

agent configuration with explicit tool definitions

optional: MiniClaw runtime for enforcement of detected policies

Limitations

cannot detect runtime permission escalation through tool chaining or indirect access

does not validate whether deny lists are actually enforced by the runtime

requires understanding of tool semantics — may miss domain-specific dangerous operations

What makes it unique

vs alternatives

More specialized than generic IAM policy analyzers (AWS IAM Access Analyzer) because it understands agent tool semantics and the specific attack surface of autonomous code execution

hook injection vulnerability detection with command and exfiltration pattern analysis

Medium confidence

Solves for

Best for

developers implementing custom hooks in Claude Code agents

security teams reviewing hook implementations for compliance

teams using hooks for tool filtering or policy enforcement

Requires

Node.js 18+

hook definitions in agent configuration files

optional: deep scan mode (--deep flag) for more thorough analysis

Limitations

pattern-based detection may miss sophisticated injection techniques or obfuscated code

cannot detect hooks that are dynamically generated or loaded at runtime

false positives on legitimate hook code that uses subprocess for valid purposes

What makes it unique

vs alternatives

More targeted than generic code injection scanners because it understands the specific hook lifecycle in Claude Code agents and the privilege escalation risk they represent

mcp supply chain risk assessment with version pinning and source verification

Medium confidence

Solves for

Best for

teams using community MCP servers who need to validate their security posture

organizations with supply chain security requirements

developers building agent ecosystems with multiple MCP server dependencies

Requires

Node.js 18+

MCP server definitions in configuration files

network access to threat intelligence database (CVE, supply chain verification)

Limitations

CVE database may lag behind actual vulnerability disclosures

cannot detect zero-day vulnerabilities in MCP servers

version pinning check does not validate whether pinned versions are actually secure

What makes it unique

vs alternatives

More specialized than generic dependency scanners (npm audit, Snyk) because it understands MCP server semantics and the specific risk of dynamic server loading in agent configurations

prompt injection and capability escalation detection with multi-chain analysis

Medium confidence

Solves for

Best for

teams building Claude Code agents with user-facing interfaces

security researchers testing agent robustness against prompt injection

organizations with strict prompt security requirements

Requires

Node.js 18+

agent prompt definitions in CLAUDE.md or settings.json

optional: Claude 3.5 Opus API key for deep scan mode (--opus flag)

Limitations

prompt injection detection is heuristic-based and may miss sophisticated attacks

cannot detect runtime prompt injection from external sources (user input, API responses)

false positives on legitimate prompts that discuss security or tool usage

What makes it unique

vs alternatives

deep scan adversarial analysis with three-agent opus pipeline

Medium confidence

Solves for

Best for

security teams doing comprehensive agent security audits

developers building high-security agents (financial, healthcare, critical infrastructure)

researchers studying agent security and adversarial robustness

Requires

Node.js 18+

Claude 3.5 Opus API key with sufficient quota

network access to Anthropic API

Limitations

requires Claude 3.5 Opus API access — adds significant cost and latency (minutes per scan)

Opus-based analysis is non-deterministic — results may vary between runs

cannot guarantee finding all vulnerabilities — adversarial analysis is heuristic-based

What makes it unique

vs alternatives

taint analysis for data flow tracking and exfiltration detection

Medium confidence

Solves for

Best for

teams building agents that handle sensitive data (PII, financial info, secrets)

security teams validating data isolation in multi-tenant agent setups

developers implementing custom hooks and want to ensure data safety

Requires

Node.js 18+

complete agent configuration with data flow paths

--deep flag or --taint-analysis flag

Limitations

taint analysis is conservative — may flag legitimate data flows as risky

cannot track data flows through external systems or APIs

requires understanding of tool semantics to identify dangerous sinks

What makes it unique

vs alternatives

More specialized than generic data flow analyzers because it understands agent-specific data sources (system prompts, tool outputs) and sinks (network requests, logs, tool parameters)

injection testing with adversarial prompt generation and execution simulation

Medium confidence

Solves for

Best for

security teams validating vulnerability severity before remediation

developers testing security fixes to ensure they work

researchers studying agent robustness and adversarial resilience

Requires

Node.js 18+

Claude 3.5 Opus API key

complete agent configuration

Limitations

execution simulation does not run actual agent code — may miss runtime-specific vulnerabilities

generated prompts are heuristic-based and may not represent real attacker techniques

requires Claude 3.5 Opus API — adds cost and latency

What makes it unique

vs alternatives

sandbox behavioral analysis with runtime execution monitoring

Medium confidence

Solves for

Best for

teams deploying agents in production who need runtime security validation

security teams monitoring agent behavior for anomalies

developers testing that security fixes actually work in practice

Requires

Node.js 18+

ability to execute agent code in isolated environment

system call tracing tools (strace, dtrace, or equivalent)

Limitations

sandbox execution adds significant latency and resource overhead

may not detect sophisticated attacks that avoid triggering monitored behaviors

requires ability to execute agent code — not suitable for untrusted configurations

What makes it unique

vs alternatives

More comprehensive than static analysis because it validates runtime behavior; more practical than manual testing because it automates behavior monitoring and policy violation detection

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Alternatives to agentshield

IntelliCode50Extension

AI-assisted development

Compare →

GitHub Copilot Chat53Extension

AI chat features powered by Copilot

Compare →

GitHub Copilot52Extension

Your AI pair programmer

Compare →

Claude Code for VS Code52Extension

Claude Code for VS Code: Harness the power of Claude Code without leaving your IDE

Compare →

agentshield

Capabilities17 decomposed

static configuration vulnerability scanning with 102+ rule registry

hardcoded secrets detection with multi-provider pattern matching

supply chain verification with source authenticity and maintenance status checks

vulnerability severity scoring and risk prioritization engine

miniclaw secure agent runtime with tool whitelist and egress firewall

ci/cd integration with github actions and baseline quality gates

auto-fix engine with configuration remediation and policy initialization

organizational policy enforcement with custom rules and compliance reporting

skills health system with dependency tracking and update notifications

permissive tool permission analysis with wildcard and deny-list detection

hook injection vulnerability detection with command and exfiltration pattern analysis

mcp supply chain risk assessment with version pinning and source verification

prompt injection and capability escalation detection with multi-chain analysis

deep scan adversarial analysis with three-agent opus pipeline

taint analysis for data flow tracking and exfiltration detection

injection testing with adversarial prompt generation and execution simulation

sandbox behavioral analysis with runtime execution monitoring

Related Artifactssharing capabilities

Semgrep

Semgrep CLI

Aikido Security

Coderbuds

Sourcery

PR-Agent

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to agentshield

Are you the builder of agentshield?

Get the weekly brief

Data Sources

agentshield

Capabilities17 decomposed

static configuration vulnerability scanning with 102+ rule registry

hardcoded secrets detection with multi-provider pattern matching

supply chain verification with source authenticity and maintenance status checks

vulnerability severity scoring and risk prioritization engine

miniclaw secure agent runtime with tool whitelist and egress firewall

ci/cd integration with github actions and baseline quality gates

auto-fix engine with configuration remediation and policy initialization

organizational policy enforcement with custom rules and compliance reporting

skills health system with dependency tracking and update notifications

permissive tool permission analysis with wildcard and deny-list detection

hook injection vulnerability detection with command and exfiltration pattern analysis

mcp supply chain risk assessment with version pinning and source verification

prompt injection and capability escalation detection with multi-chain analysis

deep scan adversarial analysis with three-agent opus pipeline

taint analysis for data flow tracking and exfiltration detection

injection testing with adversarial prompt generation and execution simulation

sandbox behavioral analysis with runtime execution monitoring

Related Artifactssharing capabilities

Semgrep

Semgrep CLI

Aikido Security

Coderbuds

Sourcery

PR-Agent

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to agentshield

Are you the builder of agentshield?

Get the weekly brief

Data Sources