Reference textbook:Modern Data Engineering with Apache Spark: A Hands-On Guide for Building Mission-Critical Streaming Applications
Author(s): S. Haines
Description:
This coursebook is an introduction to building consistent, “mission-critical” streaming applications using Apache Spark. You will not immediately start writing streaming applications on page 1, but rather you will work hands-on, solving small problems using Spark and a wide array of tools to help you along the way. Each chapter introduces a critical foundation, a new tool in your data engineering toolbox, and as the book progresses, you will gain exposure to many of the common data systems and services that work well with Apache Spark. By the end of the book, you will have written and
deployed a fully tested Spark Structured Streaming application on Kubernetes. You will have an entire containerized local data platform at your disposal, to take the ideas and implementations covered in this coursebook with you to your next project.