Data Cleaning
by Ihab F. Ilyas
English | 2019 | ISBN: 1450371523 | 285 Pages | PDF | 14 MB
by Ihab F. Ilyas
English | 2019 | ISBN: 1450371523 | 285 Pages | PDF | 14 MB
Data quality is one of the most important problems in data management, since dirty data often leads to inaccurate data analytics results and incorrect business decisions. Poor data across businesses and the U.S. government are reported to cost trillions of dollars a year. Multiple surveys show that dirty data is the most common barrier faced by data scientists. Not surprisingly, developing effective and efficient data cleaning solutions is challenging and is rife with deep theoretical and engineering problems.