SQL on Big Data
Technology, Architecture, and Innovation
Authors: Sumit Pal
 This is a rapidly changing and evolving space and the book both summarizes and elaborates on the merits and drawbacks of each of themThe audience for the book is broad—business analysts, BI engineers, developers, data scientists and architects, and quality assurance professionalsThere is no book on this topic—there is a lot of information (e.g., whitepapers, blogs, webinars), but all focus one solution rather than putting it all together and stitching a story around it
This is a rapidly changing and evolving space and the book both summarizes and elaborates on the merits and drawbacks of each of themThe audience for the book is broad—business analysts, BI engineers, developers, data scientists and architects, and quality assurance professionalsThere is no book on this topic—there is a lot of information (e.g., whitepapers, blogs, webinars), but all focus one solution rather than putting it all together and stitching a story around it
Learn various commercial and open source products that perform SQL on Big Data platforms. You will understand the architectures of the various SQL engines being used and how the tools work internally in terms of execution, data movement, latency, scalability, performance, and system requirements.
This book consolidates in one place solutions to the challenges associated with the requirements of speed, scalability, and the variety of operations needed for data integration and SQL operations. After discussing the history of the how and why of SQL on Big Data, the book provides in-depth insight into the products, architectures, and innovations happening in this rapidly evolving space.
SQL on Big Data discusses in detail the innovations happening, the capabilities on the horizon, and how they solve the issues of performance and scalability and the ability to handle different data types. The book covers how SQL on Big Data engines are permeating the OLTP, OLAP, and Operational analytics space and the rapidly evolving HTAP systems.
You will learn the details of:
• 
Batch Architectures—an understanding of the internals and how the existing Hive engine is built and how it is evolving continually to support new features and provide lower latency on queries
• 
Interactive Architectures—an understanding of how SQL engines are architected to support low latency on large data sets
• 
Streaming Architectures—an understanding of how SQL engines are architected to support queries on data in motion using in-memory and lock-free data structures
• 
Operational Architectures—an understanding of how SQL engines are architected for transactional and operational systems to support transactions on Big Data platforms
• 
Innovative Architectures—an exploration of the rapidly evolving newer SQL engines on Big Data with innovative ideas and concepts
Table of contents
Front Matter
Pages i-xvii
Why SQL on Big Data?
Pages 1-15
SQL-on-Big-Data Challenges & Solutions
Pages 17-33
Batch SQL—Architecture
Pages 35-59
Interactive SQL—Architecture
Pages 61-95
SQL for Streaming, Semi-Structured, and Operational Analytics
Pages 97-126
Innovations and the Road Ahead
Pages 127-145
Appendix
Pages 147-152
Back Matter
Pages 153-157
原版 PDF + EPUB:
本帖隐藏的内容
原版 PDF:
EPUB:
PDF + EPUB 压缩包: