Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood

  • 6h 9m
  • Saliya Ekanayake, Supun Kamburugamuve
  • John Wiley & Sons (US)
  • 2021

PEEK “UNDER THE HOOD” OF BIG DATA ANALYTICS

The world of big data analytics grows ever more complex. And while many people can work superficially with specific frameworks, far fewer understand the fundamental principles of large-scale, distributed data processing systems and how they operate. In Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood, renowned big-data experts and computer scientists Drs. Supun Kamburugamuve and Saliya Ekanayake deliver a practical guide to applying the principles of big data to software development for optimal performance.

The authors discuss foundational components of large-scale data systems and walk readers through the major software design decisions that define performance, application type, and usability. You???ll learn how to recognize problems in your applications resulting in performance and distributed operation issues, diagnose them, and effectively eliminate them by relying on the bedrock big data principles explained within.

Moving beyond individual frameworks and APIs for data processing, this book unlocks the theoretical ideas that operate under the hood of every big data processing system.

Ideal for data scientists, data architects, dev-ops engineers, and developers, Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood shows readers how to:

  • Identify the foundations of large-scale, distributed data processing systems
  • Make major software design decisions that optimize performance
  • Diagnose performance problems and distributed operation issues
  • Understand state-of-the-art research in big data
  • Explain and use the major big data frameworks and understand what underpins them
  • Use big data analytics in the real world to solve practical problems

In this Book

  • Introduction
  • Data Intensive Applications
  • Data and Storage
  • Computing Resources
  • Data Structures
  • Programming Models
  • Messaging
  • Parallel Tasks
  • Case Studies
  • Fault Tolerance
  • Performance and Productivity
SHOW MORE
FREE ACCESS