作者: Chuck Lam
出版年: 2010-12-22
内容简介
——————————
IGHLIGHT Hadoop in Action is an example-rich tutorial that showsdevelopers how to implement data-intensive distributed computing using Hadoopand the Map- Reduce framework. DESCRIPTION Hadoop is an open sourceimplementation of Google's MapReduce framework for scalable, distributed dataprocessing. Hadoop in Action is for programmers, architects, and projectmanagers who have to process large amounts of data offline. The book beginswith several simple examples that illustrate the basic idea behind Hadoop.Later chapters explain the core framework components and demonstrate Hadoop ina variety of data analysis tasks. Throughout the book, readers will learn bestpractices and design patterns, and how to write meaningful programs in aMapReduce framework. KEY POINTS Explains distributed computing, MapReduce, andthe Hadoop framework Focuses on most-used features and rapid developmentsolutions Numerous hands-on examples to illustrate abstract ideas Concise,developer-centric, In Action style Multiple case studies demonstrate real-worldHadoop uses Covers popular Hadoop extensions that ease development and extendfunctionality
作者简介
——————————
Chuck Lam 目前建立了一个名为RollCall的移动社交网络公司,让活跃的个体用户拥有了一个社交助理。他以前曾是RockYou的高级技术组长,开发了社交应用 程序和数据处理基础架构,能够支撑上亿的用户。在斯坦福大学攻读博士的时候,Chuck就对大数据产生了兴趣。他的论文“Computational DataAcquisition”首创了可用于机器学习的数据采集方法,吸纳了来自开源软件和网络游戏等领域的思想。