Lyt når som helst, hvor som helst

Nyd den ubegrænsede adgang til tusindvis af spændende e- og lydbøger - helt gratis

  • Lyt og læs så meget du har lyst til
  • Opdag et kæmpe bibliotek fyldt med fortællinger
  • Eksklusive titler + Mofibo Originals
  • Opsig når som helst
Start tilbuddet
DK - Details page - Device banner - 894x1036

Learning Spark SQL

Sprog
Engelsk
Format
Kategori

Fakta

Design, implement, and deliver successful streaming applications, machine learning pipelines and graph applications using Spark SQL API

About This Book

• Learn about the design and implementation of streaming applications, machine learning pipelines, deep learning, and large-scale graph processing applications using Spark SQL APIs and Scala.

• Learn data exploration, data munging, and how to process structured and semi-structured data using real-world datasets and gain hands-on exposure to the issues and challenges of working with noisy and "dirty" real-world data.

• Understand design considerations for scalability and performance in web-scale Spark application architectures.

Who This Book Is For

If you are a developer, engineer, or an architect and want to learn how to use Apache Spark in a web-scale project, then this is the book for you. It is assumed that you have prior knowledge of SQL querying. A basic programming knowledge with Scala, Java, R, or Python is all you need to get started with this book.

What You Will Learn

• Familiarize yourself with Spark SQL programming, including working with DataFrame/Dataset API and SQL

• Perform a series of hands-on exercises with different types of data sources, including CSV, JSON, Avro, MySQL, and MongoDB

• Perform data quality checks, data visualization, and basic statistical analysis tasks

• Perform data munging tasks on publically available datasets

• Learn how to use Spark SQL and Apache Kafka to build streaming applications

• Learn key performance-tuning tips and tricks in Spark SQL applications

• Learn key architectural components and patterns in large-scale Spark SQL applications

In Detail

In the past year, Apache Spark has been increasingly adopted for the development of distributed applications. Spark SQL APIs provide an optimized interface that helps developers build such applications quickly and easily. However, designing web-scale production applications using Spark SQL APIs can be a complex task. Hence, understanding the design and implementation best practices before you start your project will help you avoid these problems.

This book gives an insight into the engineering practices used to design and build real-world, Spark-based applications. The book's hands-on examples will give you the required confidence to work on any future projects you encounter in Spark SQL.

It starts by familiarizing you with data exploration and data munging tasks using Spark SQL and Scala. Extensive code examples will help you understand the methods used to implement typical use-cases for various types of applications. You will get a walkthrough of the key concepts and terms that are common to streaming, machine learning, and graph applications. You will also learn key performance-tuning details including Cost Based Optimization (Spark 2.2) in Spark SQL applications. Finally, you will move on to learning how such systems are architected and deployed for a successful delivery of your project.

Style and approach

This book is a hands-on guide to designing, building, and deploying Spark SQL-centric production applications at scale.

© 2017 Packt Publishing (E-bog): 9781785887352

Release date

E-bog: 7. september 2017

Andre kan også lide...

  1. Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems Martin Kleppmann
  2. Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are Seth Stephens-Davidowitz
  3. Data Science For Dummies: 2nd Edition Lillian Pierson
  4. AI and Machine Learning for On-Device Development: A Programmer's Guide, 1st Edition Laurence Moroney
  5. Big Data Revolution: What farmers, doctors and insurance agents teach us about discovering big data patterns Rob Thomas
  6. Building Analytics Teams: Harnessing analytics and artificial intelligence for business improvement Douglas B. Laney
  7. Python for Beginners: Comprehensive Guide to the Basics of Programming, Machine Learning, Data Science and Analysis with Python. Alex Campbell
  8. Python for Data Science: Clear and Complete Guide to Data Science and Analysis with Python. Alex Campbell
  9. Data Mesh: Comprehensive Guide on How to Become Truly Data-Driven Alex Campbell
  10. Probability, Markov Chains, Queues, and Simulation: The Mathematical Basis of Performance Modeling William J. Stewart
  11. Learning SQL: Generate, Manipulate, and Retrieve Data, 3rd Edition Alan Beaulieu
  12. Computational Thinking Peter J. Denning
  13. The AI Factor: How to Apply Artificial Intelligence and Use Big Data to Grow Your Business Exponentially Asha Saxena
  14. Docker Essentials: Simplifying containerization : A Beginner's Guide Mike Wilson
  15. Ultimate SwiftUI Handbook for iOS Developers Duong Dinh Bao Thang
  16. Competing in the Age of AI: Strategy and Leadership When Algorithms and Networks Run the World Karim R. Lakhani
  17. Hyperfocus: How to Work Less and Achieve More Chris Bailey
  18. Machine Learning: Deep Learning, Text Analytics, and Reinforcement Learning with Big Data David Feldspar
  19. 97 Principles for Software Architects: Axioms for software architecture and development written by industry practitioners Multiple Authors
  20. Robot-Proof: Higher Education in the Age of Artificial Intelligence Joseph E. Aoun
  21. Deep Learning: Guide to Machine Learning and Artificial Intelligence David Feldspar
  22. Building Microservices: Designing Fine-Grained Systems Sam Newman
  23. Talk Like TED: The 9 Public Speaking Secrets of the World's Top Minds Carmine Gallo
  24. Artificial Intelligence For Dummies Luca Massaron
  25. Humans Need Not Apply: A Guide to Wealth and Work in the Age of Artificial Intelligence Jerry Kaplan
  26. Get Smart: How to Think and Act Like the Most Successful and Highest-Paid People in Every Field Brian Tracy
  27. The 7 Habits of Highly Effective People Stephen R. Covey
  28. Deep Learning John D. Kelleher
  29. Homo Deus: A Brief History of Tomorrow Yuval Noah Harari
  30. Fundamentals of Software Architecture: An Engineering Approach Neal Ford

Vælg dit abonnement

  • Over 600.000 titler

  • Download og nyd titler offline

  • Eksklusive titler + Mofibo Originals

  • Børnevenligt miljø (Kids Mode)

  • Det er nemt at opsige når som helst

Flex

For dig som vil prøve Mofibo.

89 kr. /måned
  • 1 konto

  • 20 timer/måned

  • Gem op til 100 ubrugte timer

  • Eksklusivt indhold hver uge

  • Fri lytning til podcasts

  • Ingen binding

Prøv gratis
Den mest populære

Premium

For dig som lytter og læser ofte.

129 kr. /måned
  • 1 konto

  • 100 timer/måned

  • Eksklusivt indhold hver uge

  • Fri lytning til podcasts

  • Ingen binding

Start tilbuddet

Unlimited

For dig som lytter og læser ubegrænset.

149 kr. /måned
  • 1 konto

  • Ubegrænset adgang

  • Eksklusivt indhold hver uge

  • Fri lytning til podcasts

  • Ingen binding

Start tilbuddet

Family

For dig som ønsker at dele historier med familien.

Fra 179 kr. /måned
  • 2-6 konti

  • 100 timer/måned pr. konto

  • Fri lytning til podcasts

  • Kun 39 kr. pr. ekstra konto

  • Ingen binding

2 konti

179 kr. /måned
Prøv gratis