Skip to content

DataKitchen/dataops-testgen

Repository files navigation

DataOps Data Quality TestGen

apache 2.0 license Badge PRs Badge Latest Version Docker Pulls Documentation Static Badge

DataOps Data Quality TestGen, or "TestGen" for short, can help you find data issues so you can alert your users and notify your suppliers. It does this by delivering simple, fast data quality test generation and execution by data profiling, new dataset screening and hygiene review, algorithmic generation of data quality validation tests, ongoing production testing of new data refreshes, and continuous anomaly monitoring of datasets. TestGen is part of DataKitchen's Open Source Data Observability.

Documentation

DataOps TestGen Overview

DataOps TestGen Documentation

Features

Interactive Product Tour

What does DataKitchen's DataOps Data Quality TestGen do? It helps you understand and find data issues in new data.

DatKitchen Open Source Data Quality TestGen Features - New Data

It constantly watches your data for data quality anomalies and lets you drill into problems.

DataKitchen Open Source Data Quality TestGen Features - Data Ingestion and Quality Testing

A single place to manage Data Quality across data sets, locations, and teams.

DataKitchen Open Source Data Quality TestGen Features - Single Place

Installation

The dk-installer program installs TestGen in either Docker or pip mode. For complete instructions, see the documentation:

What Next?

Getting started guide

We recommend you start by going through the Data Observability Overview Demo.

Support

For support requests, join the Data Observability Slack 👋 and post on the #support channel.

Connect to your database

Follow these instructions to improve the quality of data in your database.

Community

Talk and learn with other data practitioners who are building with DataKitchen. Share knowledge, get help, and contribute to our open-source project.

Join our community here:

Contributing

For details on contributing or running the project for development, check out our contributing guide.

License

DataKitchen's DataOps Data Quality TestGen is Apache 2.0 licensed.

About

DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors