之前有人要的,补一个……
ISBN-13: 978-0-672-33727-7
ISBN-10: 0-672-33727-4
Library of Congress Control Number: 2015914167
Printed in the United States of America
First Printing November 2015
Contents at a Glance
Introduction
Part I: Understanding Big Data, Hadoop 1.0, and 2.0
HOUR 1 Introduction of Big Data, NoSQL, and Business Value Proposition
2 Introduction to Hadoop, Its Architecture, Ecosystem, and Microsoft Offerings
3 Hadoop Distributed File System Versions 1.0 and 2.0
4 The MapReduce Job Framework and Job Execution Pipeline
5 MapReduce—Advanced Concepts and YARN
Part II: Getting Started with HDInsight and Understanding Its Different
Components
HOUR 6 Getting Started with HDInsight, Provisioning Your HDInsight Service Cluster,
and Automating HDInsight Cluster Provisioning
7 Exploring Typical Components of HDFS Cluster
8 Storing Data in Microsoft Azure Storage Blob
9 Working with Microsoft Azure HDInsight Emulator
Part III: Programming MapReduce and HDInsight Script Action
HOUR 10 Programming MapReduce Jobs
11 Customizing the HDInsight Cluster with Script Action
Part IV: Querying and Processing Big Data in HDInsight
HOUR 12 Getting Started with Apache Hive and Apache Tez in HDInsight
13 Programming with Apache Hive, Apache Tez in HDInsight, and Apache
HCatalog
14 Consuming HDInsight Data from Microsoft BI Tools over Hive ODBC Driver:
Part 1
15 Consuming HDInsight Data from Microsoft BI Tools over Hive ODBC Driver:
Part 2
16 Integrating HDInsight with SQL Server Integration Services
17 Using Pig for Data Processing
18 Using Sqoop for Data Movement Between RDBMS and HDInsight
Part V: Managing Workflow and Performing Statistical Computing
HOUR 19 Using Oozie Workflows and Job Orchestration with HDInsight
20 Performing Statistical Computing with R
Part VI: Performing Interactive Analytics and Machine Learning
HOUR 21 Performing Big Data Analytics with Spark
22 Microsoft Azure Machine Learning
Part VII: Performing Real-time Analytics
HOUR 23 Performing Stream Analytics with Storm
24 Introduction to Apache HBase on HDInsight