Big Data Analytics with Microsoft HDInsight in 24 Hours, Sams Teach Yourself

by ;
  • ISBN13:


  • ISBN10:


  • Edition: 1st
  • Format: Paperback
  • Copyright: 2019-11-21
  • Publisher: Sams Publishing
  • Purchase Benefits
  • Free Shipping On Orders Over $35!
    Your order must be $35 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • Get Rewarded for Ordering Your Textbooks! Enroll Now
List Price: $44.99 Save up to $5.40
  • Buy New
    Add to Cart Free Shipping


Supplemental Materials

What is included with this book?


Sams Teach Yourself Big Data Analytics with Microsoft HDInsight in 24 Hours

In just 24 lessons of one hour or less, Sams Teach Yourself Big Data Analytics with Microsoft HDInsight in 24 Hours helps you leverage Hadoop’s power on a flexible, scalable cloud platform using Microsoft’s newest business intelligence, visualization, and productivity tools.

This book’s straightforward, step-by-step approach shows you how to provision, configure, monitor, and troubleshoot HDInsight and use Hadoop cloud services to solve real analytics problems. You’ll gain more of Hadoop’s benefits, with less complexity–even if you’re completely new to Big Data analytics. Every lesson builds on what you’ve already learned, giving you a rock-solid foundation for real-world success.

Practical, hands-on examples show you how to apply what you learn

Quizzes and exercises help you test your knowledge and stretch your skills

Notes and tips point out shortcuts and solutions


Learn how to…

·         Master core Big Data and NoSQL concepts, value propositions, and use cases

·         Work with key Hadoop features, such as HDFS2 and YARN

·         Quickly install, configure, and monitor Hadoop (HDInsight) clusters in the cloud

·         Automate provisioning, customize clusters, install additional Hadoop projects, and administer clusters

·         Integrate, analyze, and report with Microsoft BI and Power BI

·         Automate workflows for data transformation, integration, and other tasks

·         Use Apache HBase on HDInsight

·         Use Sqoop or SSIS to move data to or from HDInsight

·         Perform R-based statistical computing on HDInsight datasets

·         Accelerate analytics with Apache Spark

·         Run real-time analytics on high-velocity data streams

·         Write MapReduce, Hive, and Pig programs


Register your book at for convenient access to downloads, updates, and corrections as they become available. 

Author Biography

Arshad Ali has more than 13 years of experience in the computer industry. As a DB/DW/BI consultant in an end-to-end delivery role, he has been working on several enterprise-scale data warehousing and analytics projects for enabling and developing business intelligence and analytic solutions. He specializes in database, data warehousing, and business intelligence/analytics application design, development, and deployment at the enterprise level. He frequently works with SQL Server, Microsoft Analytics Platform System (APS, or formally known as SQL Server Parallel Data Warehouse [PDW]), HDInsight (Hadoop, Hive, Pig, HBase, and so on), SSIS, SSRS, SSAS, Service Broker, MDS, DQS, SharePoint, and PPS. In the past, he has also handled performance optimization for several projects, with significant performance gain.

Arshad is a Microsoft Certified Solutions Expert (MCSE)–SQL Server 2012 Data Platform, and Microsoft Certified IT Professional (MCITP) in Microsoft SQL Server 2008–Database Development, Data Administration, and Business Intelligence. He is also certified on ITIL 2011 foundation.

He has worked in developing applications in VB, ASP, .NET, ASP.NET, and C#. He is a Microsoft Certified Application Developer (MCAD) and Microsoft Certified Solution Developer (MCSD) for the .NET platform in Web, Windows, and Enterprise.

Arshad has presented at several technical events and has written more than 200 articles related to DB, DW, BI, and BA technologies, best practices, processes, and performance optimization techniques on SQL Server, Hadoop, and related technologies. His articles have been published on several prominent sites.

On the educational front, Arshad holds a Master in Computer Applications degree and a Master in Business Administration in IT degree.

Arshad can be reached at, or visit to connect with him.


Manpreet Singh is a consultant and author with extensive expertise in architecture, design, and implementation of business intelligence and Big Data analytics solutions. He is passionate about enabling businesses to derive valuable insights from their data.

Manpreet has been working on Microsoft technologies for more than 8 years, with a strong focus on Microsoft Business Intelligence Stack, SharePoint BI, and Microsoft’s Big Data Analytics Platforms (Analytics Platform System and HDInsight). He also specializes in Mobile Business Intelligence solution development and has helped businesses deliver a consolidated view of their data to their mobile workforces.

Manpreet has coauthored books and technical articles on Microsoft technologies, focusing on the development of data analytics and visualization solutions with the Microsoft BI Stack and SharePoint. He holds a degree in computer science and engineering from Panjab University, India.

Manpreet can be reached at 

Table of Contents

1. Introduction of NoSQL, Big Data, Hadoop and Business Value Proposition
2. Hadoop Architecture, Ecosystem and Technology Stack
3. Hadoop Distributed File System (HDFS)
4. MapReduce Job Framework
5. Introducing Microsoft Windows Azure
6. Provisioning Your HDInsight Service Cluster, Automating HDInsight Cluster Provisioning
7. Exploring the HDInsight Name Node, Seconday Name Node and Data Nodes
8. Storing data in Windows Azure Blob Storage (WABS) vs HDFS
9. Using Windows Azure HDInsight Emulator
10. Programming MapReduce jobs (Mapper, Reducer and Driver)
11. Configuring and executing MapReduce jobs to Your HDInsight Cluster
12. Getting started with Hive
13. Creating Table, Views, Indexes and Loading Data
14. Accessing HDInsight over Hive and ODBC
15. Consuming HDInsight from Microsoft BI Tools
16. Integrating HDInsight with SQL Server Integration Services
17. Introducing PIG (Pig Latin and Runtime)
18. Writing PIG Queries for Loading, Filtering, Grouping, Combining Storing Etc.
19. Introducing and using Sqoop for Data Movement Between RDBMS (SQL Server) and HDInsight
20. Introducing and Using SSIS (SQL Server Integration Services) for Data Movement Between RDBMS (SQL Server) and HDInsight
21. Understand the different HDInsight Services and Configuration Files
22. Administer and Manage HDInsight Using Hadoop Command Prompt
23. Administer and Manage HDInsight Using Powershell
24. Logging in HDInsight
25. Troubleshooting Cluster Deployments, Troubleshooting Job Failures
26. Case Study - Sentiment Analysis with StreamInsight and HDInsight service
27. Hadoop 2.0 - What's New

Rewards Program

Write a Review