全部版块 我的主页
论坛 数据科学与人工智能 大数据分析 spark高速集群计算平台
2445 1
2019-09-15
Book Description:

Leverage machine and deep learning models to build applications on real-time data using PySpark. This book is perfect for those who want to learn to use this language to perform exploratory data analysis and solve an array of business challenges.

You’ll start by reviewing PySpark fundamentals, such as Spark’s core architecture, and see how to use PySpark for big data processing like data ingestion, cleaning, and transformations techniques. This is followed by building workflows for analyzing streaming data using PySpark and a comparison of various streaming platforms.


You’ll then see how to schedule different spark jobs using Airflow with PySpark and book examine tuning machine and deep learning models for real-time predictions. This book concludes with a discussion on graph frames and performing network analysis using graph algorithms in PySpark. All the code presented in the book will be available in Python scripts on Github.


What You’ll Learn

  • Develop pipelines for streaming data processing using PySpark
  • Build Machine Learning & Deep Learning models using PySpark latest offerings
  • Use graph analytics using PySpark
  • Create Sequence Embeddings from Text data

Who This Book is For


Data Scientists, machine learning and deep learning engineers who want to learn and use PySpark for real time analysis on streaming data.


附件列表

Learn PySpark.pdf

大小:10.29 MB

只需: 5 个论坛币  马上下载

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

全部回复
2019-9-15 11:50:28
franky_sas 发表于 2019-9-15 11:10
Book Description:Leverage machine and deep learning models to build applications on real-time data u ...

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群