Overview
Data Validation Harness (DVH) is a comprehensive toolkit designed for a wide range of data testing and validation use cases. Its modular and loosely coupled design ensures adaptability to the data testing and reconciliation needs of organizations, regardless of scale, domain, or complexity. DVH also offers a fully configurable and customizable automation framework, identifying potential risks and challenges across data workflows.
**How Does it Work? **
For Glue-based pipelines, it offers interfaces to create data quality rules using DQDL (Data Quality Definition Language) within the Rule Builder section. Various rulesets can also be created in Glue Databrew.
It provides granular control over test coverage, allowing custom unit tests to be created using Deequ and Pytest libraries. These tests can be applied across S3, Athena, Redshift, and Glue pipelines.
Third-party frameworks such as Great Expectations, IceDQ, and Pytest can also be integrated with Amazon S3, Amazon Athena, Amazon Redshift, and AWS Glue for more customized testing. This may need customized deployment on ECR based containerized applications.
The following use cases are supported by DVH for data validation and testing:
1. Foundational Capabilities Data Validation Metadata Validation Data Integration Report Validation Test Suite Management Test Case Repository Code Coverage
2. Intermediate Capabilities Special Character Testing Data Reconciliation Data Transformation Testing Data Quality Assurance Dependency Impact Analysis Data Dictionary & Automated Documentation
3. Advanced Capabilities Data Encryption Testing Data Masking Testing Continuous Integration and Deployment (CI/CD) Test Data Generation AI/ML Testing
Highlights
- - Data Validation at Persistence Points: Validate data across multiple storage interfaces like AWS S3, AWS Redshift and AWS RDS. - Automated Metadata Validation: A metadata driven validation to ensure quality in the changing dimensions of data across pipelines and transformations leveraging AWS Glue catalog. - Comprehensive Test Framework: Includes test suites, repositories, and cases.
- - Granular Testing Features: Special character testing, data transformation, dependency impact analysis, automated documentation, and data dictionary using custom libraries and core AWS Services like Glue, Athena, Dynamodb and Quicksight. - Advanced Testing Capabilities: Supports data encryption, data masking, continuous integration and deployment, test data generation, and AI/ML testing.
Details
Unlock automation with AI agent solutions

Pricing
Custom pricing options
How can we make this page better?
Legal
Content disclaimer
Resources
Vendor resources
Support
Vendor support
Our committed 300+ certified AWS professionals are available in the U.S., Europe and India time zones to address any questions or provide the assistance you need. Thanks to the aggressive SLAs in place, we are here to help you achieve success in your quest to achieve operational efficiency and cost optimization on your AWS workloads.
Phone: 1 408 727 1100
Email:  AWS_Experts@apexon.comÂ
Contact Us URL: https://www.apexon.com/about/contact-us/Â
Software associated with this service
