Data Quality And Profiling

1

Evidently AIRepository59/100

via “batch data quality profiling with 100+ built-in metrics”

ML/LLM monitoring — data drift, model quality, 100+ metrics, dashboards, test suites.

Unique: Implements a preset system where related metrics are bundled with sensible defaults and visualization templates, enabling rapid profiling without metric selection overhead. Presets are composable — users can mix preset metrics with custom metrics in a single report, balancing convenience with flexibility.

vs others: Faster than manual metric composition because presets eliminate threshold tuning; more comprehensive than simple profiling tools (pandas-profiling) because it includes ML-specific metrics (drift, model quality) and integrates with CI/CD testing.

2

FeatureformPlatform59/100

via “feature analysis and statistical profiling with drift baselines”

Virtual feature store on existing data infrastructure.

Unique: Provides automatic feature profiling and baseline tracking as built-in platform capabilities, enabling data quality monitoring without external tools, whereas most feature stores require integration with separate data profiling platforms like Great Expectations

vs others: Simpler setup than external profiling tools, but less comprehensive than dedicated data quality platforms and lacks advanced statistical testing

3

Julius AIProduct55/100

via “data quality assessment and anomaly detection”

AI data analysis — upload data, ask questions, automated visualization and statistical analysis.

Unique: Automatically detects multiple data quality issues (missing values, duplicates, outliers, type inconsistencies) using statistical methods and generates actionable remediation recommendations

vs others: More comprehensive than manual data inspection because it checks multiple quality dimensions simultaneously, while more accessible than specialized data quality tools (Talend, Great Expectations) because it requires no configuration

4

OpenMetadataRepository52/100

via “data quality profiling and automated test execution”

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

Unique: Integrated data profiling and quality testing with historical trend tracking and event-driven notifications, executed directly against source databases via Airflow connectors rather than requiring separate data quality tools

vs others: More integrated than Great Expectations because quality tests are defined and executed within the metadata platform itself; more automated than manual SQL-based checks because tests are parameterized and scheduled

5

OpenMetadataPlatform43/100

via “data profiler with statistical analysis and anomaly detection”

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

Unique: Integrates statistical profiling directly into the metadata catalog with historical tracking and anomaly detection, enabling data quality baselines to be understood and monitored as part of metadata management

vs others: Simpler than dedicated profiling tools (Great Expectations) but integrated with lineage and ownership; sufficient for teams wanting profiling as a metadata feature rather than standalone platform

6

TeradataMCP Server36/100

via “data quality assessment and validation tools”

** - A collection of tools for managing the platform, addressing data quality and reading and writing to [Teradata](https://www.teradata.com/) Database.

Unique: Implements data quality checks as composable MCP tools that can be chained together in AI agent workflows, with configurable rules and thresholds stored in YAML configuration files. Tools return structured quality metrics and anomaly reports suitable for downstream processing or visualization.

vs others: Provides more granular quality checks than generic data profiling tools by offering specialized tools for specific quality dimensions (nullness, uniqueness, type validity) that can be selectively invoked based on business requirements, and integrates directly with AI agents for automated quality monitoring.

7

JuliusProduct24/100

via “data profiling and quality assessment automation”

AI data processing, analysis, and visualization

Unique: Combines statistical profiling with heuristic quality rules to identify issues and automatically suggest remediation steps, providing both a quality scorecard and actionable recommendations

vs others: More comprehensive than manual data exploration and faster than writing custom profiling scripts, but less customizable than domain-specific data quality frameworks

8

Context DataPlatform20/100

via “data quality monitoring and validation”

Data Processing & ETL infrastructure for Generative AI applications

Unique: Incorporates a customizable dashboard for real-time monitoring of data quality metrics, allowing users to visualize data integrity at a glance.

vs others: More user-friendly than traditional data quality tools like Talend Data Quality, thanks to its intuitive dashboard and alerting system.

9

KnimeProduct

via “data-profiling-and-quality-assessment”

10

DataikuProduct

via “data-quality-and-profiling”

11

IllumexProduct

via “data-quality-validation-and-profiling”

12

DelveProduct

via “data-quality-validation”

13

VizlyProduct

via “data-quality-assessment-and-validation”

Unique: Automatically profiles data quality without requiring users to define validation rules, providing a quick assessment of data reliability before analysis

vs others: Faster than manual data inspection or custom validation scripts, but less comprehensive than dedicated data quality tools (Great Expectations, Soda) that support complex business rules and continuous monitoring

14

ManifoldProduct

via “data quality assessment and validation reporting”

15

Indicium TechProduct

via “data quality monitoring with anomaly detection and data profiling”

Unique: Combines statistical anomaly detection with data profiling and quality scorecards; integrates with the data transformation pipeline to prevent bad data from flowing downstream, and provides both real-time alerts and historical quality trends

vs others: More integrated than point solutions (Great Expectations, Soda) because it's built into the data platform; more automated than manual data quality checks because anomalies are detected continuously and alerts are triggered automatically

16

Julius AIProduct

via “data summary and profiling”

17

PhoenixProduct

via “data quality issue detection and reporting”

18

Qlik AutoMLProduct

via “data-quality-assessment”

19

DataRobotProduct

via “data-preparation-and-quality-assessment”

20

Tableau AIProduct

via “data-quality-assessment”

Top Matches

Also Known As

Company