Lyt når som helst, hvor som helst

Dyk ned i over 1 million e- og lydbøger samt podcasts.

  • Over 1 million titler
  • Eksklusive titler + Mofibo Originals
  • Download og nyd titler offline
  • Opsig når som helst
Prøv nu
DK - Details page - Device banner - 894x1036
Cover for Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines

Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines

Sprog
Engelsk
Format
Kategori

Fakta

"Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines"

Data quality failures in big data systems rarely look like broken code—they look like “successful” jobs shipping quietly corrupted tables. This book is for experienced data engineers, platform engineers, and analytics/ML practitioners who need enforceable guarantees, not ad‑hoc SQL spot checks. It treats data quality as an engineering discipline: explicit contracts, measurable signals, and operational response patterns that keep pipelines trustworthy without freezing delivery.

You’ll learn Deequ’s core model—metrics plus assertions—and how it maps onto Spark execution, cost, and reproducibility. The book goes deep on authoring production-grade constraints (completeness, uniqueness, validity, ranges, patterns, proportions), composing checks with stable thresholds, and turning failures into actionable diagnostics. It then operationalizes validation via VerificationSuite, showing how to plan analyzer execution, interpret VerificationResult edge cases, and implement gating strategies such as fail-fast, quarantine, and partial publishes. Profiling and constraint suggestion are covered as accelerators—followed by governance and rollout workflows that keep rules maintainable as data and business semantics evolve.

A strong working knowledge of Spark and DataFrames is assumed. Coverage includes longitudinal quality via metrics repositories, regression detection, and alerting, plus advanced patterns for partitioned/incremental data, late arrivals, custom analyzers, and real-world version compatibility across

© 2026 NobleTrex Press (E-bog): 6610001179250

Udgivelsesdato

E-bog: 9. marts 2026

Tags

    Vælg dit abonnement

    • Over 1 million titler

    • Download og nyd titler offline

    • Eksklusive titler + Mofibo Originals

    • Børnevenligt miljø (Kids Mode)

    • Det er nemt at opsige når som helst

    Den mest populære

    Premium

    For dig som lytter og læser ofte.

    129 kr. /måned

    • Eksklusivt indhold hver uge

    • Fri lytning til podcasts

    • Ingen binding

    Start tilbuddet

    Unlimited

    For dig som lytter og læser ubegrænset.

    159 kr. /måned

    • Eksklusivt indhold hver uge

    • Fri lytning til podcasts

    • Ingen binding

    Prøv gratis

    Family

    For dig som ønsker at dele historier med familien.

    Fra 179 kr. /måned

    • Fri lytning til podcasts

    • Kun 39 kr. pr. ekstra konto

    • Ingen binding

    Dig + 1 familiemedlem2 konti

    179 kr. /måned

    Prøv gratis

    Flex

    For dig som vil prøve Mofibo.

    89 kr. /måned

    • Gem op til 100 ubrugte timer

    • Eksklusivt indhold hver uge

    • Fri lytning til podcasts

    • Ingen binding

    Prøv gratis