Headless Browser Automation With Puppeteer

1

Puppeteer MCP ServerMCP Server82/100

via “mcp server for browser automation with puppeteer”

Automate browser interactions and take screenshots via Puppeteer MCP.

Unique: This MCP server is specifically tailored for Puppeteer, providing a unique integration for browser automation tasks.

vs others: Unlike generic automation tools, this server is built specifically for Puppeteer, offering streamlined functionality for headless browsing.

2

ClineAgent61/100

via “headless browser automation with screenshot and dom inspection”

Autonomous AI coding assistant for VS Code — reads, edits, runs commands with human-in-the-loop approval.

Unique: Integrates headless browser automation with screenshot capture and DOM extraction, feeding both visual and structural information to the LLM for reasoning. Actions are gated by approval, and screenshots are captured after each action to provide visual feedback. This combines visual understanding with structured DOM access, which most agents lack.

vs others: More capable than Copilot for web testing because it can actually navigate and interact with web applications, capture screenshots, and reason about visual state, rather than just suggesting test code.

3

puppeteer-mcp-serverMCP Server59/100

via “headless-browser-automation-via-mcp”

Experimental MCP server for browser automation using Puppeteer (inspired by @modelcontextprotocol/server-puppeteer)

Unique: Wraps Puppeteer as an MCP server rather than a direct library, enabling LLM agents to invoke browser automation through standardized tool-calling protocols without managing browser lifecycle or connection pooling themselves

vs others: Provides MCP-native browser automation (compatible with Claude and other MCP clients) whereas direct Puppeteer requires custom API wrappers and manual integration into LLM tool schemas

4

BrowserbasePlatform57/100

via “managed-headless-browser-provisioning-with-stealth-mode”

Headless browser infrastructure for AI agents — stealth mode, CAPTCHA solving, session recording.

Unique: Combines managed Chromium provisioning with tiered stealth implementations (basic fingerprint spoofing vs advanced masking) and per-plan concurrency quotas, eliminating need for self-hosted Selenium/Playwright infrastructure while providing bot-detection evasion out-of-box

vs others: Simpler than self-hosted Playwright/Selenium (no infrastructure management) and more affordable than BrowserStack for AI agent workloads, but less transparent about stealth effectiveness and session persistence guarantees than open-source alternatives

5

Crawl4AIRepository57/100

via “javascript-rendered web content extraction with headless browser pooling”

AI-optimized web crawler — clean markdown extraction, JS rendering, structured output for RAG.

Unique: Implements browser pooling with adaptive memory management and per-URL session reuse via AsyncWebCrawler orchestrator, allowing efficient rendering of hundreds of pages without spawning new browser processes for each URL. Integrates Chrome DevTools Protocol for programmatic control over rendering behavior, network interception, and virtual scroll triggering.

vs others: Faster than Selenium-based crawlers due to Playwright's native async/await support and connection pooling; more memory-efficient than spawning new browser per page; supports modern CDP features that Puppeteer alone cannot leverage.

6

gemini-cliAgent55/100

via “browser agent with web navigation and content extraction”

An open-source AI agent that brings the power of Gemini directly into your terminal.

Unique: Implements a browser automation tool that can be invoked by the agent for web navigation and content extraction, enabling real-time web research and interaction with web-based services as part of the agent's reasoning loop.

vs others: More capable than simple web search because it enables full browser automation including JavaScript execution, form interaction, and dynamic content extraction, allowing the agent to work with modern web applications.

7

ScraplingRepository55/100

via “stealth browser automation with anti-detection evasion”

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

Unique: Combines Playwright stealth plugin with user-agent randomization, header spoofing, and behavioral mimicry (random delays, mouse movements) to mask automation indicators. Integrates proxy rotation at the fetcher level, enabling transparent IP distribution without application-level code changes.

vs others: Selenium and raw Playwright expose WebDriver properties by default; Scrapling's StealthyFetcher layer automatically injects stealth JavaScript and randomizes behavioral patterns, reducing detection likelihood by ~40-50% on sites using basic bot detection.

8

sandboxMCP Server52/100

via “browser-automation-with-chromium-integration”

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Unique: Integrates Chromium directly into the sandbox container with shared file system access, allowing downloaded files and captured DOM state to be immediately available to other runtimes (shell, Jupyter, Node.js) without API calls or external storage. Supports both REST API and MCP protocol for agent integration.

vs others: Faster than cloud-based browser APIs (Browserless, Puppeteer Cloud) for multi-step workflows because file I/O and inter-component communication happen locally within the container; eliminates network round-trips for data sharing between browser and code execution.

9

UI-TARS-desktopAgent52/100

via “browser automation with intelligent element interaction and search integration”

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

Unique: Integrates browser automation with semantic search capabilities and VLM-based element identification, allowing agents to understand page content visually rather than relying solely on DOM selectors. The architecture supports both low-level Playwright APIs and high-level semantic interactions through the GUI agent.

vs others: More flexible than Selenium because it supports both headless and headed modes, modern async/await patterns, and integrates with VLM-based element understanding, versus Selenium which requires explicit waits and CSS/XPath selectors.

10

UI-TARS-desktopRepository51/100

via “browser-automation-with-headless-control-and-search-integration”

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

Unique: Integrates headless browser control (Puppeteer/Playwright) with a search system layer and agent-aware state feedback, providing agents with both visual and DOM-level understanding of web pages. Abstracts browser lifecycle management and search provider integration, allowing agents to reason about web content without explicit browser control code.

vs others: More capable than simple web search APIs because it combines search with interactive browser control and visual reasoning, enabling agents to navigate search results and interact with web pages, whereas standalone search tools only return snippets.

11

Azad Coder (GPT 5 & Claude)Extension50/100

via “browser automation with playwright integration”

Azad Coder: Your AI pair programmer in VSCode. Powered by Anthropic's Claude and GPT 5 !, it assists both beginners and pros in coding, debugging, and more. Create/edit files and execute commands with AI guidance. Perfect for no-coders to senior devs. Enjoy free credits to supercharge your coding ex

Unique: Integrates Playwright as a first-class tool in the agent's action space, allowing it to reason about browser state and adapt interactions based on observed DOM changes. Unlike static test scripts, the agent can handle dynamic content, retry failed interactions, and adjust selectors if page structure changes.

vs others: Provides autonomous browser automation with error recovery, whereas Selenium-based tools require explicit error handling and retry logic in test code.

12

Playwright MCP ServerMCP Server49/100

via “headless and headed browser mode switching”

** - An MCP server using Playwright for browser automation and webscrapping

Unique: Exposes Playwright's native headless/headed mode toggle through MCP tool parameters, allowing Claude to dynamically request visual debugging without server reconfiguration. Manages browser lifecycle for each mode transparently.

vs others: More flexible than static headless-only solutions; allows same MCP server to serve both automated and interactive debugging use cases by switching modes per request.

13

nanobrowserExtension47/100

via “dom-aware browser action execution with puppeteer anti-detection”

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

Unique: Integrates Puppeteer directly into the Chrome extension background script (rather than spawning external processes) and applies anti-detection techniques at the action execution layer, making it harder to detect automation compared to naive Puppeteer scripts. The action system is extensible — new actions can be registered without modifying the Navigator agent.

vs others: More stealthy than raw Puppeteer scripts due to built-in anti-detection measures, and more flexible than Selenium by supporting modern browser APIs and JavaScript execution within the extension context.

14

mcp-smart-crawlerMCP Server40/100

via “playwright-based browser automation crawling”

A command-line tool acting as an MCP (ModelContextProtocol) server, using Playwright to crawl web content for AI models.

Unique: Leverages Playwright's multi-browser support (Chromium, Firefox, WebKit) with native MCP integration, providing browser-agnostic crawling without requiring separate Selenium or Puppeteer wrappers

vs others: More reliable for JavaScript-heavy sites than Cheerio/jsdom-based crawlers, and simpler to configure than raw Puppeteer with built-in MCP protocol handling

15

Comet MCP – Give Claude Code a browser that can clickMCP Server39/100

via “headless browser control with click-based interaction”

Hey HN,Claude Code is pretty agentic now. It writes scripts, calls APIs, uses CLIs. But when something requires actually clicking through a website, it stops and asks me to do it.Problem is, I'm often unfamiliar with these platforms myself. "Go to App Store Connect and generate a P8 key&qu

Unique: Exposes browser interactions as MCP tools rather than requiring Claude to write Puppeteer/Playwright code directly, abstracting away browser library complexity and allowing Claude to focus on task logic rather than API details.

vs others: Simpler for Claude to use than teaching it Playwright syntax because interactions are declarative tool calls rather than imperative code, reducing hallucination risk and improving reliability.

16

playwright-skillSkill38/100

via “visible browser execution with real-time observation”

Claude Code Skill for browser automation with Playwright. Model-invoked - Claude autonomously writes and executes custom automation for testing and validation.

Unique: Defaults to visible browser execution (headless: false) rather than headless mode, prioritizing developer observability and debugging over performance. This is a deliberate design choice documented in SKILL.md that differs from most automation frameworks which default to headless for efficiency.

vs others: Provides real-time visual debugging by default, whereas most Playwright setups default to headless mode and require explicit configuration to enable visibility, making debugging less accessible to developers unfamiliar with headless browser concepts.

17

@iflow-mcp/mbadkins-puppeteer-plus-martechMCP Server37/100

via “headless-browser-automation-with-puppeteer”

Puppeteer+ MarTech - Enhanced Puppeteer MCP server with specialized digital marketing analytics capabilities. This builds upon the official @modelcontextprotocol/server-puppeteer with tools for analyzing marketing technologies, analytics platforms, tag ma

Unique: Wraps Puppeteer's CDP bindings as an MCP server, allowing LLM agents to treat browser automation as a first-class tool with structured input/output schemas rather than requiring custom integration code

vs others: Tighter LLM integration than standalone Puppeteer scripts because MCP standardizes tool discovery and parameter validation, reducing boilerplate for multi-step browser workflows

18

@hisma/server-puppeteerMCP Server37/100

via “headless-browser-automation-via-mcp”

Fork and update (v0.6.5) of the original @modelcontextprotocol/server-puppeteer MCP server for browser automation using Puppeteer.

Unique: Wraps Puppeteer as an MCP server rather than a direct library, enabling LLM agents to invoke browser automation through standardized MCP tool/resource endpoints without language-specific SDK dependencies. Uses MCP's stdio transport for process-level isolation and multi-client support.

vs others: Provides standardized MCP interface for browser automation (vs. Puppeteer's direct Node.js API), making it compatible with any MCP client including Claude Desktop, while maintaining full Puppeteer capability surface.

19

firecrawl-mcpMCP Server37/100

via “javascript-rendered content scraping with headless browser support”

MCP server for Firecrawl — search, scrape, and interact with the web. Supports both cloud and self-hosted instances. Features include web search, scraping, page interaction, batch processing, and LLM-powered content analysis.

Unique: Abstracts headless browser complexity behind Firecrawl's backend, enabling MCP clients to scrape JavaScript-heavy sites without managing Puppeteer/Playwright locally. Supports wait conditions and session injection for handling dynamic and authenticated content.

vs others: Simpler than managing Puppeteer directly; more reliable than static HTML scraping for SPAs; avoids client-side browser overhead by delegating to cloud backend.

20

AnyCrawlMCP Server36/100

via “headless browser-based crawling with javascript execution”

** - [AnyCrawl](https://anycrawl.dev) MCP Server, Powerful web scraping and crawling for Cursor, Claude, and other LLM clients via the Model Context Protocol (MCP).

Unique: Integrates headless browser automation as an optional mode within the MCP scraping interface, allowing LLM clients to transparently upgrade from static parsing to dynamic rendering without changing the tool invocation pattern

vs others: More capable than static HTML parsing for modern web apps, but with explicit latency/resource tradeoffs exposed to the user; simpler than building custom Puppeteer scripts because browser lifecycle and wait conditions are abstracted

Top Matches

Also Known As

Company