Lyt når som helst, hvor som helst

Nyd den ubegrænsede adgang til tusindvis af spændende e- og lydbøger - helt gratis

  • Lyt og læs så meget du har lyst til
  • Opdag et kæmpe bibliotek fyldt med fortællinger
  • Eksklusive titler + Mofibo Originals
  • Opsig når som helst
Start tilbuddet
DK - Details page - Device banner - 894x1036

Data Cleaning

Sprog
Engelsk
Format
Kategori

Fakta

This is an overview of the end-to-end data cleaning process. Data quality is one of the most important problems in data management, since dirty data often leads to inaccurate data analytics results and incorrect business decisions. Poor data across businesses and the U.S. government are reported to cost trillions of dollars a year. Multiple surveys show that dirty data is the most common barrier faced by data scientists. Not surprisingly, developing effective and efficient data cleaning solutions is challenging and is rife with deep theoretical and engineering problems. This book is about data cleaning, which is used to refer to all kinds of tasks and activities to detect and repair errors in the data. Rather than focus on a particular data cleaning task, this book describes various error detection and repair methods, and attempts to anchor these proposals with multiple taxonomies and views. Specifically, it covers four of the most common and important data cleaning tasks, namely, outlier detection, data transformation, error repair (including imputing missing values), and data deduplication. Furthermore, due to the increasing popularity and applicability of machine learning techniques, it includes a chapter that specifically explores how machine learning techniques are used for data cleaning, and how data cleaning is used to improve machine learning models. This book is intended to serve as a useful reference for researchers and practitioners who are interested in the area of data quality and data cleaning. It can also be used as a textbook for a graduate course. Although we aim at covering state-of-the-art algorithms and techniques, we recognize that data cleaning is still an active field of research and therefore provide future directions of research whenever appropriate.

© 2019 ACM Books (E-bog): 9781450371544

Release date

E-bog: 18. juni 2019

Andre kan også lide...

  1. Data Visualization Guide: Clear Guide to Data Science and Visualization Alex Campbell
  2. Data Mesh: What Is Data Mesh? Principles of Data Mesh Architecture Brian Murray
  3. Data Science John D. Kelleher
  4. Data Management Introbooks Team
  5. Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking Foster Provost
  6. Data as a Product: How to Provide the Data That the Company Needs Brian Murray
  7. Data Visualization: Ultimate Guide to Data Mining and Visualization. Alex Campbell
  8. Data Analysis Introbooks Team
  9. Data Feminism Lauren F. Klein
  10. Big Data Analytics: Turning Big Data into Big Money Frank J. Ohlhorst
  11. Data Mesh: Comprehensive Guide on How to Become Truly Data-Driven Alex Campbell
  12. Learning from Data Introbooks Team
  13. Big Data: How New Data and the Internet Help Us Analyze Everything David Feldspar
  14. Win with Advanced Business Analytics: Creating Business Value from Your Data Jean Paul Isson
  15. Big Data: Mining and Measuring Big Data for Information and Intelligence David Feldspar
  16. Fundamentals of Data Engineering: Plan and Build Robust Data Systems Matt Housley
  17. Dark Data: Why What You Don't Know Matters David J. Hand
  18. Data Analyses: Detailed, Scientific, and Business-Oriented Data Reading Skills Benjamin Farrar
  19. Data Mesh: Delivering Data-Driven Value at Scale Zhamak Dehghani
  20. Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are Seth Stephens-Davidowitz
  21. Data Smart: Using Data Science to Transform Information into Insight John W. Foreman
  22. Big Data, Big Analytics: Emerging Business Intelligence and Analytic Trends for Today's Businesses Michele Chambers
  23. Computational Thinking Peter J. Denning
  24. Data Engineering with Python for Beginners: Comprehensive Clear Guide Simon Winston
  25. Data Quality Prashanth Southekal
  26. Scary Smart: The Future of Artificial Intelligence and How You Can Save Our World Mo Gawdat
  27. Exploratory Data Analysis: Uncovering Insights from Your Data Daniel Garfield
  28. Data Lake: Unleashing the Power of Data. Exploring the Depths of the Data Lake Daniel Garfield
  29. Competing in the Age of AI: Strategy and Leadership When Algorithms and Networks Run the World Karim R. Lakhani
  30. Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence Kate Crawford
  31. Data Preprocessing: Optimizing Data Quality and Structure for Effective Analysis and Machine Learning Brian Murray
  32. Solutions Architect's Handbook: Kick-start your career as a solutions architect by learning architecture design principles and strategies Saurabh Shrivastava
  33. Database Internals: A Deep Dive into How Distributed Data Systems Work, 1st Edition Alex Petrov
  34. How Smart Machines Think Sean Gerrish
  35. Probably the Best Book on Statistics Ever Written: How to Beat the Odds and Make Better Decisions Haim Shapira
  36. LEAN: Ultimate Collection: Lean Startup, Lean Analytics, Lean Enterprise, Kaizen, Six Sigma, Agile Project Management, Kanban, Scrum Jason Bennett, Jennifer Bowen
  37. Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics Gary Smith
  38. Working with AI Thomas H. Davenport
  39. The Future Is Faster Than You Think: How Converging Technologies Are Transforming Business, Industries, and Our Lives Steven Kotler
  40. Distrust: Big Data, Data-Torturing, and the Assault on Science Gary Smith
  41. Good Company Arthur M. Blank
  42. Lean Analytics: Focus On Data That Really Matter For Your Business Harry Altman
  43. Spatiotemporal Data Analysis Gidon Eshel
  44. How Everything Became War and the Military Became Everything: Tales from the Pentagon Rosa Brooks
  45. Mixed Problem Solving Methodology: The skill that changes your life Rocco Mela

Vælg dit abonnement

  • Over 600.000 titler

  • Download og nyd titler offline

  • Eksklusive titler + Mofibo Originals

  • Børnevenligt miljø (Kids Mode)

  • Det er nemt at opsige når som helst

Flex

For dig som vil prøve Mofibo.

89 kr. /måned
  • 1 konto

  • 20 timer/måned

  • Gem op til 100 ubrugte timer

  • Eksklusivt indhold hver uge

  • Fri lytning til podcasts

  • Ingen binding

Prøv gratis
Den mest populære

Premium

For dig som lytter og læser ofte.

129 kr. /måned
  • 1 konto

  • 100 timer/måned

  • Eksklusivt indhold hver uge

  • Fri lytning til podcasts

  • Ingen binding

Start tilbuddet

Unlimited

For dig som lytter og læser ubegrænset.

149 kr. /måned
  • 1 konto

  • Ubegrænset adgang

  • Eksklusivt indhold hver uge

  • Fri lytning til podcasts

  • Ingen binding

Start tilbuddet

Family

For dig som ønsker at dele historier med familien.

Fra 179 kr. /måned
  • 2-6 konti

  • 100 timer/måned pr. konto

  • Fri lytning til podcasts

  • Kun 39 kr. pr. ekstra konto

  • Ingen binding

2 konti

179 kr. /måned
Prøv gratis