Automating Data Quality Monitoring  : Scaling Beyond Rules with Machine Learning - Jeremy Stanley
eTextbook alternate format product

Instant online reading.

Automating Data Quality Monitoring

Scaling Beyond Rules with Machine Learning

By: Jeremy Stanley, Paige Schwartz

Paperback | 19 January 2024

At a Glance

Paperback


RRP $125.50

$55.25

56%OFF

or 4 interest-free payments of $13.81 with

 or 
In Stock and Aims to ship in 1-2 business days

When will this arrive by?

The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your organization relies on contains only high-quality records. Most data engineers, data analysts, and data scientists genuinely care about data quality, but they often don't have the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately. This book will help you: Learn why data quality is a business imperative Understand and assess unsupervised learning models for detecting data issues Implement notifications that reduce alert fatigue and let you triage and resolve issues quickly Integrate automated data quality monitoring with data catalogs, orchestration layers, and BI and ML systems Understand the limits of automated data quality monitoring and how to overcome them Learn how to deploy and manage your monitoring solution at scale Maintain automated data quality monitoring for the long term

More in Database Design & Theory

Python All-in-One For Dummies : 3rd Edition - John C. Shovic

RRP $74.95

$50.40

33%
OFF
Information Modeling and Relational Databases : 2nd Edition - Terry Halpin
Data Visualisation : 2nd Edition - A Handbook for Data Driven Design - Andy Kirk
Time Series Databases - New Ways to Store and Acces Data - Ellen, M.D. Friedman
Artificial Intelligence in Finance : A Python-Based Guide - Yves Hilpisch
Scaling Python with Dask : From Data Science to Machine Learning - Holden Karau