did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

We're the #1 textbook rental company. Let us show you why.

9781119713029

Foundations of Data Intensive Applications Large Scale Data Analytics under the Hood

by ;
  • ISBN13:

    9781119713029

  • ISBN10:

    1119713021

  • Edition: 1st
  • Format: Paperback
  • Copyright: 2021-09-08
  • Publisher: Wiley
  • Purchase Benefits
List Price: $55.00 Save up to $17.87
  • Buy New
    $53.35
    Add to Cart Free Shipping Icon Free Shipping

    USUALLY SHIPS IN 3-4 BUSINESS DAYS

Supplemental Materials

What is included with this book?

Summary

PEEK “UNDER THE HOOD” OF BIG DATA ANALYTICS

The world of big data analytics grows ever more complex. And while many people can work superficially with specific frameworks, far fewer understand the fundamental principles of large-scale, distributed data processing systems and how they operate. In Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood, renowned big-data experts and computer scientists Drs. Supun Kamburugamuve and Saliya Ekanayake deliver a practical guide to applying the principles of big data to software development for optimal performance.

The authors discuss foundational components of large-scale data systems and walk readers through the major software design decisions that define performance, application type, and usability. You???ll learn how to recognize problems in your applications resulting in performance and distributed operation issues, diagnose them, and effectively eliminate them by relying on the bedrock big data principles explained within.

Moving beyond individual frameworks and APIs for data processing, this book unlocks the theoretical ideas that operate under the hood of every big data processing system.

Ideal for data scientists, data architects, dev-ops engineers, and developers, Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood shows readers how to:

  • Identify the foundations of large-scale, distributed data processing systems
  • Make major software design decisions that optimize performance
  • Diagnose performance problems and distributed operation issues
  • Understand state-of-the-art research in big data
  • Explain and use the major big data frameworks and understand what underpins them
  • Use big data analytics in the real world to solve practical problems

Author Biography

SUPUN KAMBURUGAMUVE has a Ph.D. in computer science specializing in high-performance data analytics. He is the architect behind the high-performance data analytics system Twister2 and is a principal software engineer at the Digital Science Center of Indiana University, where he researches and leads efforts to develop data analytics applications and frameworks. He is a frequent speaker at research and technical conferences including Strata NY, Big Data Conference, and Apache Con.

SALIYA EKANAYAKE is a postdoctoral fellow at Berkeley Lab, specializing in improving the performance of large-scale machine learning systems. He received his Ph.D. in Computer Science from Indiana University, Bloomington, where, his research contributed to the development of SPIDAL, a scalable parallel interoperable data analytics library that outperformed existing big data systems on several machine learning applications. His work has been published in over twenty publications.

Table of Contents

Chapter 1: Introduction
Chapter 2: Large Data
Chapter 3: Going Distributed
Chapter 4: Distributing Applications
Chapter 5: Messaging is the Key
Chapter 6: CPUs or GPUs
Chapter 7: In Memory Data Structures
Chapter 8: Programming Abstractions
Chapter 9: Handling Faults
Chapter 10: Performance and Productivity

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program