What can google-search do?

playwright-based google search execution with anti-bot evasion, mcp server integration for ai assistant search access, command-line interface with configurable search parameters, browser state persistence for captcha mitigation, structured result extraction with title, link, snippet fields, raw html retrieval and screenshot capture for custom analysis, configurable timeout and headless mode control, pino-based structured logging for debugging and monitoring, typescript type system with searchresponse and htmlresponse interfaces, multi-layered anti-detection strategy with user-agent and viewport randomization

google-search

MCP ServerFree

A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches. Local alternative to SERP APIs with MCP server integration.

Open Source

/ 100

10 capabilities

Capabilities10 decomposed

playwright-based google search execution with anti-bot evasion

Medium confidence

Executes real Google searches using Playwright browser automation while implementing multiple anti-detection strategies (user-agent rotation, viewport randomization, request throttling, browser state persistence) to bypass Google's anti-scraping mechanisms. The core googleSearch() function in src/search.ts orchestrates browser navigation, DOM waiting, and result extraction without relying on external SERP APIs, enabling unlimited searches without rate limits or API quotas.

Solves for

Execute Google searches programmatically without API keys or paid SERP servicesRetrieve real-time search results directly from Google's live indexIntegrate live search capability into AI agents and LLM applicationsAvoid SERP API costs and rate limiting for high-volume search workloads

Best for

AI agents and LLM applications requiring real-time search without external API dependencies

Developers building local-first search tools with no cloud infrastructure

Teams migrating from paid SERP APIs (SerpAPI, DataForSEO) to self-hosted alternatives

Requires

Node.js 18+

Playwright browser binaries (auto-installed via npm)

Writable filesystem for browser state persistence

Limitations

Subject to Google's dynamic anti-bot detection; may encounter CAPTCHAs or IP blocks on high-frequency searches

Browser state persistence (./browser-state.json) mitigates but doesn't eliminate CAPTCHA challenges

Single-threaded Playwright execution limits concurrent search parallelism

What makes it unique

Combines Playwright's headless browser automation with stateful browser persistence (saving/restoring cookies and session state) to minimize CAPTCHA triggers, unlike stateless SERP API calls. Implements multi-layered anti-detection (user-agent rotation, viewport randomization, request throttling) at the browser level rather than HTTP header manipulation alone.

vs alternatives

Eliminates SERP API costs and rate limits (SerpAPI charges $0.005-0.02 per search) while providing real-time results; slower than cached APIs but faster than manual browser interaction and suitable for agents requiring fresh data.

mcp server integration for ai assistant search access

Medium confidence

Wraps the core googleSearch() function as a Model Context Protocol (MCP) server using the MCP SDK, enabling AI assistants like Claude to invoke Google searches via standardized tool-calling interface. The mcp-server.ts component manages McpServer instance, StdioServerTransport for stdio communication, and a global persistent Playwright browser to serve multiple search requests from a single AI session without browser restart overhead.

Solves for

Enable Claude and other MCP-compatible AI assistants to perform real-time web searchesIntegrate live search as a native tool in AI agent workflows without custom API wrappersProvide AI models with fresh information beyond training data cutoffBuild AI agents that can research topics, verify facts, and retrieve current information

Best for

AI agent developers using Claude or other MCP-compatible LLMs

Teams building AI research assistants requiring real-time web search

Developers extending Claude's capabilities with local search tools

Requires

Node.js 18+

MCP SDK (installed via npm)

Claude or MCP-compatible AI assistant with MCP server support

Limitations

MCP server runs as separate process; requires stdio communication overhead (~10-50ms per request)

Global browser instance is shared across all concurrent MCP requests; no request isolation

No built-in request queuing; concurrent searches may block each other if Playwright browser is busy

What makes it unique

Implements MCP server using stdio transport with persistent global Playwright browser, avoiding browser restart overhead per request. Registers search as a native MCP tool with schema-based parameter validation, enabling seamless integration into Claude's tool-calling pipeline without custom wrapper code.

vs alternatives

Provides native MCP integration (vs. requiring custom API wrappers or HTTP servers) and maintains persistent browser state across multiple AI assistant requests, reducing latency compared to stateless SERP API integrations.

command-line interface with configurable search parameters

Medium confidence

Exposes search functionality via CLI using the commander package (src/index.ts) with options for result limit, timeout, headless mode toggle, browser state file path, and HTML extraction modes. Parses command-line arguments and invokes the core googleSearch() function with validated parameters, supporting both structured JSON output and raw HTML retrieval for downstream processing.

Solves for

Execute one-off Google searches from terminal without writing codeIntegrate Google search into shell scripts and CI/CD pipelinesDebug search behavior with headless mode disabled (visible browser)Extract raw HTML for custom parsing or analysis workflows

Best for

DevOps engineers integrating search into automation scripts

Researchers performing batch searches from command line

Developers debugging search extraction logic with visible browser

Requires

Node.js 18+

google-search package installed globally or via npx

Bash/shell environment (Windows requires WSL or Git Bash for bin/google-search script)

Limitations

CLI blocks until search completes; no async/streaming output

No built-in result pagination; --limit capped at practical browser rendering limits (~100 results)

Browser state file (./browser-state.json) is global; concurrent CLI invocations may conflict

What makes it unique

Uses commander package for declarative CLI argument parsing with built-in help/version generation. Supports both structured JSON output (for programmatic consumption) and raw HTML extraction (--get-html, --save-html), enabling flexible integration into shell pipelines and scripts.

vs alternatives

Simpler than writing custom Node.js scripts while more flexible than web-based search tools; enables shell integration without HTTP server overhead.

browser state persistence for captcha mitigation

Medium confidence

Saves and restores Playwright browser state (cookies, localStorage, sessionStorage) to a JSON file (default ./browser-state.json) between search invocations. This stateful approach preserves Google's session context and reduces CAPTCHA triggers by maintaining browser identity across multiple searches, unlike stateless HTTP clients that appear as fresh visitors to Google on each request.

Solves for

Reduce CAPTCHA challenges when performing multiple searches in sequenceMaintain browser session identity across CLI invocations and MCP requestsEnable long-running search workflows without manual CAPTCHA solvingPreserve Google's trust signals (cookies, session tokens) across tool invocations

Best for

Batch search workflows requiring 10+ searches without interruption

Long-running AI agents performing repeated searches

Automated research tools requiring sustained search access

Requires

Writable filesystem with persistent storage

Playwright browser instance to load/save state

Initial successful search to bootstrap state file

Limitations

State file is global and not thread-safe; concurrent invocations may corrupt state

Stale state file (>24 hours old) may be rejected by Google; requires periodic refresh

CAPTCHA mitigation is probabilistic; high-frequency searches (>50/hour) still trigger blocks

What makes it unique

Implements stateful browser persistence at the Playwright level (saving/restoring browser context) rather than HTTP-level cookie management. Preserves full browser state including localStorage and sessionStorage, maintaining Google's session context more effectively than header-based cookie jars.

vs alternatives

More effective CAPTCHA mitigation than stateless SERP APIs or simple cookie rotation; trades state file management complexity for sustained search access without manual intervention.

structured result extraction with title, link, snippet fields

Medium confidence

Parses Google search result DOM using Playwright's page.locator() and evaluate() methods to extract structured data (title, link, snippet) from each result element. Returns SearchResponse JSON array with typed fields, enabling downstream processing without regex parsing or HTML string manipulation. Extraction logic handles Google's dynamic DOM structure and adapts to layout variations.

Solves for

Extract search results as structured JSON for programmatic processingFeed search results into LLM context without manual HTML parsingBuild search result pipelines with type-safe data structuresAvoid brittle regex-based HTML parsing by using DOM selectors

Best for

AI agents consuming search results as structured context

Data pipelines requiring typed search result objects

Developers building search result aggregators or analyzers

Requires

Playwright browser instance with loaded Google search results page

JavaScript execution enabled in browser context

Limitations

Extraction depends on Google's DOM structure; layout changes may break selectors

No built-in fallback selectors; single DOM change can cause extraction failure

Snippet text is truncated by Google (typically 150-160 characters); full content unavailable

What makes it unique

Uses Playwright's page.locator() and evaluate() for DOM-aware extraction rather than regex or HTML parsing libraries. Returns typed SearchResponse objects with validated fields, enabling type-safe downstream processing in TypeScript/Node.js applications.

vs alternatives

More robust than regex-based extraction (handles DOM variations) and more maintainable than brittle CSS selector chains; provides structured output suitable for LLM context vs. raw HTML strings.

raw html retrieval and screenshot capture for custom analysis

Medium confidence

Provides --get-html flag to return raw HTML string of search results page and --save-html flag to capture and save full page screenshot/HTML to disk. Enables custom parsing, archival, or visual debugging workflows where structured extraction is insufficient. Playwright's page.content() and page.screenshot() methods handle full-page capture including dynamic content.

Solves for

Retrieve raw HTML for custom parsing or analysis beyond standard extractionArchive search results as HTML snapshots for historical comparisonDebug search result layout and DOM structure visuallyExtract data not available in structured SearchResponse (ads, featured snippets, knowledge panels)

Best for

Researchers analyzing search result layout and presentation

Developers building custom search result parsers

Compliance/audit workflows requiring search result archival

Requires

Playwright browser instance

Writable filesystem (for --save-html)

Sufficient disk space for HTML/screenshot storage

Limitations

--get-html returns raw HTML string; requires external parsing (cheerio, jsdom) for processing

--save-html creates large files (2-10MB per page); disk space overhead for bulk searches

Screenshots capture viewport size only (typically 1920x1080); full-page height may be truncated

What makes it unique

Offers dual output modes: structured extraction (SearchResponse) for programmatic use and raw HTML/screenshots for custom analysis. Playwright's page.content() captures dynamic content after JavaScript execution, unlike static HTML fetching.

vs alternatives

More flexible than structured-only extraction; enables custom parsing for edge cases (knowledge panels, ads, featured snippets) while maintaining option for clean structured output.

configurable timeout and headless mode control

Medium confidence

Exposes --timeout <milliseconds> (default 60000) and --no-headless CLI options to control Playwright browser behavior. Timeout parameter sets page navigation and element waiting limits; --no-headless disables headless mode to show visible browser window for debugging. Enables developers to tune performance vs. reliability and visually inspect search execution.

Solves for

Adjust search timeout for slow networks or high-latency environmentsDebug search failures by observing browser behavior visuallyOptimize performance by reducing timeout for fast networksDiagnose anti-bot detection triggers by watching browser interaction

Best for

Developers debugging search extraction logic

DevOps engineers tuning timeouts for production deployments

Network engineers testing search performance in constrained environments

Requires

Node.js 18+

Display server for --no-headless (X11/Wayland on Linux, native on macOS/Windows)

Limitations

Timeout applies globally to all page operations; no per-operation granularity

--no-headless requires display server (X11/Wayland on Linux, not available in Docker/CI)

Visible browser window slows execution (~2-3x slower than headless); unsuitable for production

What makes it unique

Exposes Playwright's timeout and headless mode as CLI flags, enabling non-developers to adjust behavior without code changes. --no-headless provides visual debugging capability absent in most SERP APIs.

vs alternatives

More flexible than fixed-timeout SERP APIs; enables visual debugging vs. blind API calls and supports network-specific tuning.

pino-based structured logging for debugging and monitoring

Medium confidence

Implements logging via Pino logger (src/logger.ts) with structured JSON output, enabling developers to track search execution flow, anti-bot detection events, and errors. Logs include timestamps, log levels, and contextual data suitable for parsing by log aggregation systems (ELK, Datadog, CloudWatch). Supports configurable log levels for production vs. development environments.

Solves for

Debug search failures and anti-bot detection triggersMonitor search performance and latency in productionAggregate logs from multiple search invocations for analysisDiagnose CAPTCHA and IP block events

Best for

DevOps engineers monitoring production search deployments

Developers debugging search failures

Teams using centralized log aggregation (ELK, Datadog)

Requires

Pino logger (installed via npm)

Log aggregation system for production (optional but recommended)

Limitations

Pino outputs JSON by default; requires log parsing for human readability

No built-in log rotation; requires external log management (logrotate, systemd)

Structured logging adds ~5-10ms overhead per log entry

What makes it unique

Uses Pino for structured JSON logging with minimal overhead, enabling log aggregation and analysis. Logs include search-specific context (query, result count, anti-bot events) suitable for monitoring search health.

vs alternatives

Structured JSON logging (vs. unstructured console.log) enables automated parsing and alerting; Pino's performance is optimized for high-volume logging.

typescript type system with searchresponse and htmlresponse interfaces

Medium confidence

Defines typed interfaces (src/types.ts) for SearchResponse (array of {title, link, snippet} objects) and HtmlResponse (raw HTML string) using TypeScript. Enables type-safe consumption of search results in TypeScript applications and provides IDE autocomplete for result fields. Type definitions document expected output structure and catch type errors at compile time.

Solves for

Enable type-safe result processing in TypeScript applicationsProvide IDE autocomplete for search result fieldsDocument expected output structure for API consumersCatch type errors at compile time vs. runtime

Best for

TypeScript developers building search integrations

Teams using strict TypeScript configurations (noImplicitAny, strictNullChecks)

Projects requiring type safety for search result processing

Requires

TypeScript 4.0+ (for type definitions)

TypeScript compiler or ts-node for type checking

Limitations

Type definitions only available in TypeScript; JavaScript consumers get no type checking

No runtime validation; types are erased at compile time

Type definitions must be manually updated if Google's result structure changes

What makes it unique

Provides explicit TypeScript interfaces for search results, enabling IDE autocomplete and compile-time type checking. Interfaces document expected output structure without runtime validation overhead.

vs alternatives

More maintainable than untyped JavaScript; enables IDE support and catches type errors early vs. runtime failures.

multi-layered anti-detection strategy with user-agent and viewport randomization

Medium confidence

Implements anti-bot evasion through user-agent rotation (randomizing User-Agent header), viewport randomization (varying browser window size), and request throttling (adding delays between navigation and interactions). These strategies operate at the Playwright browser level, making searches appear as legitimate user traffic rather than automated bots. Combines multiple evasion techniques to increase success rate against Google's detection heuristics.

Solves for

Bypass Google's anti-bot detection to execute searches successfullyReduce CAPTCHA and IP block frequencyAppear as legitimate user traffic to Google's detection systemsEnable sustained search workflows without manual intervention

Best for

High-volume search workflows requiring sustained access

AI agents performing repeated searches without manual CAPTCHA solving

Developers building search tools for production use

Requires

Playwright browser instance

Network access to google.com

Patience for throttled requests (anti-detection adds latency)

Limitations

Anti-detection is probabilistic; no guarantee against detection on any single search

User-agent rotation alone insufficient; Google detects via behavioral patterns (timing, request sequences)

Viewport randomization has minimal impact; Google detects via request metadata, not browser dimensions

What makes it unique

Combines multiple evasion techniques (user-agent rotation, viewport randomization, request throttling, state persistence) at the Playwright browser level rather than HTTP header manipulation alone. Stateful approach (preserving browser session) is more effective than stateless techniques.

vs alternatives

More sophisticated than simple user-agent rotation; combines behavioral mimicry (throttling) with session persistence. Less effective than proxy rotation but requires no external infrastructure.

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Related Artifactssharing capabilities

Artifacts that share capabilities with google-search, ranked by overlap. Discovered automatically through the match graph.

MCP Server24

Google PSE/CSE

** - A Model Context Protocol (MCP) server providing access to Google Programmable Search Engine (PSE) and Custom Search Engine (CSE).

mcp-native web search via google custom search apigoogle custom search api request translation and forwardingon-demand server instantiation via npx with environment-based configuration

3 shared capabilities

MCP Server23

WebSearch-MCP

** - Self-hosted Websearch API

mcp-compliant web search tool exposurereal-time web search with anti-bot bypass

2 shared capabilities

MCP Server20

Exa

** - Exa AI Search API

agent-compatible-search-integrationsemantic-web-search-via-mcp

2 shared capabilities

MCP Server23

Search1API

** - One API for Search, Crawling, and Sitemaps

multi-engine web search with filtering and time-range constraints

1 shared capability

MCP Server26

Scrapeless

** - Integrate real-time [Scrapeless](https://www.scrapeless.com/en) Google SERP(Google Search, Google Flight, Google Map, Google Jobs....) results into your LLM applications. This server enables dynamic context retrieval for AI workflows, chatbots, and research tools.

real-time google serp result retrieval via mcp protocol

1 shared capability

MCP Server23

Brave Search

** - Web and local search using Brave's Search API. Has been replaced by the [official server](https://github.com/brave/brave-search-mcp-server).

web-search-via-brave-api

1 shared capability

Best For

✓AI agents and LLM applications requiring real-time search without external API dependencies
✓Developers building local-first search tools with no cloud infrastructure
✓Teams migrating from paid SERP APIs (SerpAPI, DataForSEO) to self-hosted alternatives
✓AI agent developers using Claude or other MCP-compatible LLMs
✓Teams building AI research assistants requiring real-time web search
✓Developers extending Claude's capabilities with local search tools
✓DevOps engineers integrating search into automation scripts
✓Researchers performing batch searches from command line

Known Limitations

⚠Subject to Google's dynamic anti-bot detection; may encounter CAPTCHAs or IP blocks on high-frequency searches
⚠Browser state persistence (./browser-state.json) mitigates but doesn't eliminate CAPTCHA challenges
⚠Single-threaded Playwright execution limits concurrent search parallelism
⚠No built-in proxy rotation; requires external proxy infrastructure for large-scale scraping
⚠Slower than cached SERP APIs (Playwright startup + navigation overhead ~3-5 seconds per search)
⚠MCP server runs as separate process; requires stdio communication overhead (~10-50ms per request)

Requirements

Node.js 18+Playwright browser binaries (auto-installed via npm)Writable filesystem for browser state persistenceNetwork access to google.com (not blocked by corporate firewall/ISP)MCP SDK (installed via npm)Claude or MCP-compatible AI assistant with MCP server supportProper MCP server configuration in assistant's config file (e.g., claude_desktop_config.json)google-search package installed globally or via npx

Input / Output

Accepts: search query (string), optional: limit (number, default 10), optional: timeout (milliseconds, default 60000), optional: language/locale parameter, MCP tool call with search query parameter, optional: limit parameter, optional: timeout parameter, search query (positional argument), optional: --limit <number> (default 10), optional: --timeout <milliseconds> (default 60000), optional: --no-headless flag, optional: --state-file <path> (default ./browser-state.json), optional: --get-html flag, optional: --save-html flag, state-file path (--state-file CLI option, default ./browser-state.json), Playwright page object with rendered Google search results, --get-html flag (returns HTML string), --save-html flag (saves to disk), --timeout <milliseconds> (default 60000), --no-headless flag (boolean), search execution events (navigation, DOM parsing, extraction), TypeScript type definitions (SearchResponse, HtmlResponse), search query and parameters

Produces: SearchResponse JSON with title, link, snippet fields, raw HTML of search results page (--get-html flag), screenshot/saved HTML (--save-html flag), MCP tool result with SearchResponse JSON, structured text representation of search results for LLM consumption, JSON (stdout) with SearchResponse structure, raw HTML string (with --get-html), HTML file saved to disk (with --save-html), JSON file containing browser state (cookies, storage, session data), SearchResponse[] JSON array with {title, link, snippet} objects, raw HTML string (--get-html), HTML file on disk (--save-html), PNG screenshot (--save-html with screenshot option), search results (on success) or timeout error (on failure), JSON log entries (stdout/stderr) with timestamp, level, message, context, typed SearchResponse[] or HtmlResponse objects, search results (on successful evasion) or CAPTCHA/block error (on detection)

UnfragileRank

Adoption21%(30% weight)

Quality29%(25% weight)

Ecosystem55%(25% weight)

Match Graph10%(15% weight)

Freshness75%(5% weight)

UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.

Type: MCP Server

10 capabilities

Visit google-search→

Repository Details

586

Stars

Forks

TypeScript

Language

Topics

aigoogle-searchllmmcp-serverweb-scraping

Last commit: Apr 6, 2025

About

A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches. Local alternative to SERP APIs with MCP server integration.

Alternatives to google-search

wink-embeddings-sg-100d24Repository

100-dimensional English word embeddings for wink-nlp

Compare →

voyage-ai-provider30API

Voyage AI Provider for running Voyage AI models with Vercel AI SDK

Compare →

@vibe-agent-toolkit/rag-lancedb27Agent

LanceDB implementation of RAG interfaces for vibe-agent-toolkit

Compare →

vectra41Repository

A lightweight, file-backed vector database for Node.js and browsers with Pinecone-compatible filtering and hybrid BM25 search.

Compare →

Are you the builder of google-search?

Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.

Claim this artifact →Verification via email

Get the weekly brief

New tools, rising stars, and what's actually worth your time. No spam.

Data Sources

mcp registry

Looking for something else?

Search →

Capabilities10 decomposed

playwright-based google search execution with anti-bot evasion

Medium confidence

Solves for

Best for

AI agents and LLM applications requiring real-time search without external API dependencies

Developers building local-first search tools with no cloud infrastructure

Teams migrating from paid SERP APIs (SerpAPI, DataForSEO) to self-hosted alternatives

Requires

Node.js 18+

Playwright browser binaries (auto-installed via npm)

Writable filesystem for browser state persistence

Limitations

Subject to Google's dynamic anti-bot detection; may encounter CAPTCHAs or IP blocks on high-frequency searches

Browser state persistence (./browser-state.json) mitigates but doesn't eliminate CAPTCHA challenges

Single-threaded Playwright execution limits concurrent search parallelism

What makes it unique

vs alternatives

mcp server integration for ai assistant search access

Medium confidence

Solves for

Best for

AI agent developers using Claude or other MCP-compatible LLMs

Teams building AI research assistants requiring real-time web search

Developers extending Claude's capabilities with local search tools

Requires

Node.js 18+

MCP SDK (installed via npm)

Claude or MCP-compatible AI assistant with MCP server support

Limitations

MCP server runs as separate process; requires stdio communication overhead (~10-50ms per request)

Global browser instance is shared across all concurrent MCP requests; no request isolation

No built-in request queuing; concurrent searches may block each other if Playwright browser is busy

What makes it unique

vs alternatives

command-line interface with configurable search parameters

Medium confidence

Solves for

Best for

DevOps engineers integrating search into automation scripts

Researchers performing batch searches from command line

Developers debugging search extraction logic with visible browser

Requires

Node.js 18+

google-search package installed globally or via npx

Bash/shell environment (Windows requires WSL or Git Bash for bin/google-search script)

Limitations

CLI blocks until search completes; no async/streaming output

No built-in result pagination; --limit capped at practical browser rendering limits (~100 results)

Browser state file (./browser-state.json) is global; concurrent CLI invocations may conflict

What makes it unique

vs alternatives

Simpler than writing custom Node.js scripts while more flexible than web-based search tools; enables shell integration without HTTP server overhead.

browser state persistence for captcha mitigation

Medium confidence

Solves for

Best for

Batch search workflows requiring 10+ searches without interruption

Long-running AI agents performing repeated searches

Automated research tools requiring sustained search access

Requires

Writable filesystem with persistent storage

Playwright browser instance to load/save state

Initial successful search to bootstrap state file

Limitations

State file is global and not thread-safe; concurrent invocations may corrupt state

Stale state file (>24 hours old) may be rejected by Google; requires periodic refresh

CAPTCHA mitigation is probabilistic; high-frequency searches (>50/hour) still trigger blocks

What makes it unique

vs alternatives

More effective CAPTCHA mitigation than stateless SERP APIs or simple cookie rotation; trades state file management complexity for sustained search access without manual intervention.

structured result extraction with title, link, snippet fields

Medium confidence

Solves for

Best for

AI agents consuming search results as structured context

Data pipelines requiring typed search result objects

Developers building search result aggregators or analyzers

Requires

Playwright browser instance with loaded Google search results page

JavaScript execution enabled in browser context

Limitations

Extraction depends on Google's DOM structure; layout changes may break selectors

No built-in fallback selectors; single DOM change can cause extraction failure

Snippet text is truncated by Google (typically 150-160 characters); full content unavailable

What makes it unique

vs alternatives

More robust than regex-based extraction (handles DOM variations) and more maintainable than brittle CSS selector chains; provides structured output suitable for LLM context vs. raw HTML strings.

raw html retrieval and screenshot capture for custom analysis

Medium confidence

Solves for

Best for

Researchers analyzing search result layout and presentation

Developers building custom search result parsers

Compliance/audit workflows requiring search result archival

Requires

Playwright browser instance

Writable filesystem (for --save-html)

Sufficient disk space for HTML/screenshot storage

Limitations

--get-html returns raw HTML string; requires external parsing (cheerio, jsdom) for processing

--save-html creates large files (2-10MB per page); disk space overhead for bulk searches

Screenshots capture viewport size only (typically 1920x1080); full-page height may be truncated

What makes it unique

vs alternatives

More flexible than structured-only extraction; enables custom parsing for edge cases (knowledge panels, ads, featured snippets) while maintaining option for clean structured output.

configurable timeout and headless mode control

Medium confidence

Solves for

Best for

Developers debugging search extraction logic

DevOps engineers tuning timeouts for production deployments

Network engineers testing search performance in constrained environments

Requires

Node.js 18+

Display server for --no-headless (X11/Wayland on Linux, native on macOS/Windows)

Limitations

Timeout applies globally to all page operations; no per-operation granularity

--no-headless requires display server (X11/Wayland on Linux, not available in Docker/CI)

Visible browser window slows execution (~2-3x slower than headless); unsuitable for production

What makes it unique

vs alternatives

More flexible than fixed-timeout SERP APIs; enables visual debugging vs. blind API calls and supports network-specific tuning.

pino-based structured logging for debugging and monitoring

Medium confidence

Solves for

Best for

DevOps engineers monitoring production search deployments

Developers debugging search failures

Teams using centralized log aggregation (ELK, Datadog)

Requires

Pino logger (installed via npm)

Log aggregation system for production (optional but recommended)

Limitations

Pino outputs JSON by default; requires log parsing for human readability

No built-in log rotation; requires external log management (logrotate, systemd)

Structured logging adds ~5-10ms overhead per log entry

What makes it unique

vs alternatives

Structured JSON logging (vs. unstructured console.log) enables automated parsing and alerting; Pino's performance is optimized for high-volume logging.

typescript type system with searchresponse and htmlresponse interfaces

Medium confidence

Solves for

Best for

TypeScript developers building search integrations

Teams using strict TypeScript configurations (noImplicitAny, strictNullChecks)

Projects requiring type safety for search result processing

Requires

TypeScript 4.0+ (for type definitions)

TypeScript compiler or ts-node for type checking

Limitations

Type definitions only available in TypeScript; JavaScript consumers get no type checking

No runtime validation; types are erased at compile time

Type definitions must be manually updated if Google's result structure changes

What makes it unique

vs alternatives

More maintainable than untyped JavaScript; enables IDE support and catches type errors early vs. runtime failures.

multi-layered anti-detection strategy with user-agent and viewport randomization

Medium confidence

Solves for

Best for

High-volume search workflows requiring sustained access

AI agents performing repeated searches without manual CAPTCHA solving

Developers building search tools for production use

Requires

Playwright browser instance

Network access to google.com

Patience for throttled requests (anti-detection adds latency)

Limitations

Anti-detection is probabilistic; no guarantee against detection on any single search

User-agent rotation alone insufficient; Google detects via behavioral patterns (timing, request sequences)

Viewport randomization has minimal impact; Google detects via request metadata, not browser dimensions

What makes it unique

vs alternatives

More sophisticated than simple user-agent rotation; combines behavioral mimicry (throttling) with session persistence. Less effective than proxy rotation but requires no external infrastructure.

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Alternatives to google-search

wink-embeddings-sg-100d24Repository

100-dimensional English word embeddings for wink-nlp

Compare →

voyage-ai-provider30API

Voyage AI Provider for running Voyage AI models with Vercel AI SDK

Compare →

@vibe-agent-toolkit/rag-lancedb27Agent

LanceDB implementation of RAG interfaces for vibe-agent-toolkit

Compare →

vectra41Repository

A lightweight, file-backed vector database for Node.js and browsers with Pinecone-compatible filtering and hybrid BM25 search.

Compare →

google-search

Capabilities10 decomposed

playwright-based google search execution with anti-bot evasion

mcp server integration for ai assistant search access

command-line interface with configurable search parameters

browser state persistence for captcha mitigation

structured result extraction with title, link, snippet fields

raw html retrieval and screenshot capture for custom analysis

configurable timeout and headless mode control

pino-based structured logging for debugging and monitoring

typescript type system with searchresponse and htmlresponse interfaces

multi-layered anti-detection strategy with user-agent and viewport randomization

Related Artifactssharing capabilities

Google PSE/CSE

WebSearch-MCP

Exa

Search1API

Scrapeless

Brave Search

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to google-search

Are you the builder of google-search?

Get the weekly brief

Data Sources

google-search

Capabilities10 decomposed

playwright-based google search execution with anti-bot evasion

mcp server integration for ai assistant search access

command-line interface with configurable search parameters

browser state persistence for captcha mitigation

structured result extraction with title, link, snippet fields

raw html retrieval and screenshot capture for custom analysis

configurable timeout and headless mode control

pino-based structured logging for debugging and monitoring

typescript type system with searchresponse and htmlresponse interfaces

multi-layered anti-detection strategy with user-agent and viewport randomization

Related Artifactssharing capabilities

Google PSE/CSE

WebSearch-MCP

Exa

Search1API

Scrapeless

Brave Search

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to google-search

Are you the builder of google-search?

Get the weekly brief

Data Sources