Beginning Apache Pig
Big Data Processing Made Easy
Authors: Balaswamy Vaddeman
Only Pig book that talks about Pig jobs scheduling using Oozie
Only Pig book that talks about how to submit Pig jobs using Hue
One stop shop for all Apache Pig needs
Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications.The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools.You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance.
What You Will Learn
• Use all the features of Apache Pig
• Integrate Apache Pig with other tools
• Extend Apache Pig
• Optimize Pig Latin code
• Solve different use cases for Pig Latin
Table of contents (17 chapters)
Front Matter
MapReduce and Its Abstractions
Data Types
Grunt
Pig Latin Fundamentals
Joins and Functions
Creating and Scheduling Workflows Using Apache Oozie
HCatalog
Pig Latin in Hue
Pig Latin Scripts in Apache Falcon
Macros
User-Defined Functions
Writing Eval Functions
Writing Load and Store Functions
Troubleshooting
Data Formats
Optimization
Hadoop Ecosystem Tools
Back Matter
原版 PDF + EPUB:
本帖隐藏的内容
原版 PDF:
PDF 压缩包:
BAP (pdf).zip
大小:(3.73 MB)
只需: 15 个论坛币
马上下载
本附件包括:
- Beginning Apache Pig_Big Data Processing Made Easy.pdf
EPUB:
EPUB 压缩包:
BAP (epub).zip
大小:(1.76 MB)
只需: 15 个论坛币
马上下载
本附件包括:
- Beginning Apache Pig_Big Data Processing Made Easy.epub
PDF + EPUB 压缩包:
BAP (pdf epub).zip
大小:(5.49 MB)
只需: 30 个论坛币
马上下载
本附件包括:
- Beginning Apache Pig_Big Data Processing Made Easy.pdf
- Beginning Apache Pig_Big Data Processing Made Easy.epub
如果你喜欢我分享的书籍,请关注我:
https://bbs.pinggu.org/z_guanzhu.php?action=add&fuid=5975757
订阅我的文库:
【金融 + 经济 + 商学 + 国际政治】
https://bbs.pinggu.org/forum.php?mod=collection&action=view&ctid=3257
【数学 + 统计 + 计算机编程】
https://bbs.pinggu.org/forum.php?mod=collection&action=view&ctid=3258
【历史 + 心理学 + 社会自然科学】
https://bbs.pinggu.org/forum.php?mod=collection&action=view&ctid=3259