time-series schema exploration via mcp protocol
Exposes Hydrolix time-series datalake schema metadata (tables, columns, data types, partitioning) through the Model Context Protocol (MCP), enabling LLM agents to discover and understand available datasets without direct database access. Implements MCP resource and tool handlers that translate Hydrolix catalog APIs into standardized schema introspection endpoints, allowing Claude and other MCP-compatible clients to query table structures, column definitions, and temporal indexing strategies.
Unique: Bridges Hydrolix time-series catalog directly into MCP protocol layer, allowing LLMs to introspect columnar time-series schemas without SQL knowledge; uses MCP resource handlers to expose catalog as queryable endpoints rather than requiring direct API calls
vs alternatives: Tighter integration with Hydrolix-specific temporal metadata (partition keys, retention policies) than generic database MCP servers, enabling smarter query planning for time-series workloads
natural language to hydrolix sql query translation
Translates natural language queries from LLM agents into Hydrolix-compatible SQL, leveraging schema context from the datalake to construct syntactically correct and optimized queries. The MCP server acts as a query builder interface that accepts natural language intent, validates it against discovered schema, and generates executable SQL targeting Hydrolix's columnar time-series engine, including proper time-range filtering and aggregation syntax.
Unique: Generates Hydrolix-specific SQL dialect (time-bucketing functions, columnar aggregations, partition pruning) rather than generic SQL; integrates schema context directly into code generation to ensure type-safe and partition-aware queries
vs alternatives: Produces Hydrolix-optimized queries with automatic partition key inference, whereas generic SQL generators produce dialect-agnostic SQL that may not leverage Hydrolix's time-series indexing
query execution and result streaming via mcp
Executes validated Hydrolix SQL queries through the MCP protocol and streams results back to LLM agents in structured format (JSON, CSV, or Arrow). The server manages query lifecycle (submission, polling, result pagination) and handles Hydrolix-specific execution semantics like time-range pruning and columnar result formatting, abstracting away connection pooling and error handling from the client.
Unique: Manages Hydrolix query lifecycle (async submission, polling, result pagination) within MCP protocol layer, hiding connection complexity and providing streaming results without requiring client-side Hydrolix SDK
vs alternatives: Abstracts Hydrolix async query semantics into synchronous MCP tool calls, whereas direct SDK usage requires explicit polling loops and connection management
time-series aggregation and bucketing helpers
Provides MCP tools for common time-series operations (time-bucketing, downsampling, rolling aggregations) that generate Hydrolix-compatible SQL fragments. These helpers encapsulate Hydrolix-specific temporal functions (e.g., DATE_TRUNC, INTERVAL arithmetic) and allow LLM agents to compose complex time-series queries without deep SQL knowledge, automatically handling timezone and precision considerations.
Unique: Encapsulates Hydrolix temporal function syntax (DATE_TRUNC, INTERVAL) into reusable MCP tools, allowing LLMs to compose time-series queries without learning Hydrolix SQL dialect
vs alternatives: Provides higher-level temporal abstractions than raw SQL generation, reducing LLM reasoning complexity for common time-series patterns
multi-table join and correlation analysis
Enables LLM agents to discover and construct joins across multiple Hydrolix tables based on schema relationships and common column patterns. The MCP server analyzes table metadata to identify potential join keys (matching column names, types, and temporal alignment) and generates join queries that respect Hydrolix's columnar architecture and time-series semantics, including automatic time-range alignment for correlated datasets.
Unique: Automatically discovers join relationships by analyzing schema metadata and temporal alignment, generating time-series-aware joins that respect Hydrolix columnar semantics rather than requiring explicit join specifications
vs alternatives: Infers join keys from schema patterns and temporal properties, whereas generic query builders require explicit join specifications
retention policy and data lifecycle awareness
Exposes Hydrolix data retention policies and lifecycle metadata through MCP, allowing LLM agents to understand data availability windows and make informed decisions about query time-ranges. The server queries Hydrolix catalog for retention settings, data age, and archival status, enabling agents to warn about stale data or suggest appropriate time-windows for analysis.
Unique: Integrates Hydrolix retention policies into LLM decision-making, allowing agents to validate query feasibility against data lifecycle constraints rather than discovering unavailable data at query time
vs alternatives: Proactively surfaces retention metadata to LLM agents, preventing failed queries and enabling intelligent time-range selection, whereas generic query tools fail silently on out-of-retention queries
performance metrics and query optimization hints
Collects and exposes Hydrolix query performance metrics (execution time, data scanned, partition pruning effectiveness) through MCP, enabling LLM agents to understand query cost and make optimization decisions. The server tracks query performance patterns and suggests optimizations (e.g., narrower time-ranges, pre-aggregation, partition key usage) based on historical execution data and Hydrolix-specific optimization opportunities.
Unique: Analyzes Hydrolix-specific performance patterns (partition pruning, columnar scan efficiency) and surfaces optimization opportunities to LLM agents, enabling cost-aware query generation rather than blind query execution
vs alternatives: Provides Hydrolix-specific optimization hints (partition key usage, time-range narrowing) based on columnar execution patterns, whereas generic query optimizers lack time-series-specific insights