全部版块 我的主页
论坛 数据科学与人工智能 大数据分析 spark高速集群计算平台
5524 40
2016-01-01
hal143jWchkl5IbjkTo4mh45yuYqL9c2.jpeg
Spark for Python Developers by Amit Nandi
English | Feb. 4, 2016 | ISBN: 1784399698 | 146 Pages | AZW3/EPUB/PDF (conv) | 15.03 MB                                                

Looking for a cluster computing system that provides high-level APIs? Apache Spark is your answer―an open source, fast, and general purpose cluster computing system. Spark's multi-stage memory primitives provide performance up to 100 times faster than Hadoop, and it is also well-suited for machine learning algorithms.

Key Features

Set up real-time streaming and batch data intensive infrastructure using Spark and Python
Deliver insightful visualizations in a web app using Spark (PySpark)
Inject live data using Spark Streaming with real-time events

Book Description

Are you a Python developer inclined to work with Spark engine? If so, this book will be your companion as you create data-intensive app using Spark as a processing engine, Python visualization libraries, and web frameworks such as Flask.

To begin with, you will learn the most effective way to install the Python development environment powered by Spark, Blaze, and Bookeh. You will then find out how to connect with data stores such as MySQL, MongoDB, Cassandra, and Hadoop.

You'll expand your skills throughout, getting familiarized with the various data sources (Github, Twitter, Meetup, and Blogs), their data structures, and solutions to effectively tackle complexities. You'll explore datasets using iPython Notebook and will discover how to optimize the data models and pipeline. Finally, you'll get to know how to create training datasets and train the machine learning models.

By the end of the book, you will have created a real-time and insightful trend tracker data-intensive app with Spark.

What you will learn

Create a Python development environment powered by Spark (PySpark), Blaze, and Bookeh
Build a real-time trend tracker data intensive app
Visualize the trends and insights gained from data using Bookeh
Generate insights from data using machine learning through Spark MLLIB
Juggle with data using Blaze
Create training data sets and train the Machine Learning models
Test the machine learning models on test datasets
Deploy the machine learning algorithms and models and scale it for real-time events

About the Author

Amit Nandi studied physics at the Free University of Brussels in Belgium, where he did his research on computer generated holograms. Computer generated holograms are the key components of an optical computer, which is powered by photons running at the speed of light. He then worked with the university Cray supercomputer, sending batch jobs of programs written in Fortran. This gave him a taste for computing, which kept growing. He has worked extensively on large business reengineering initiatives, using SAP as the main enabler. He focused for the last 15 years on start-ups in the data space, pioneering new areas of the information technology landscape. He is currently focusing on large-scale data-intensive applications as an enterprise architect, data engineer, and software developer. He understands and speaks seven human languages. Although Python is his computer language of choice, he aims to be able to write fluently in seven computer languages too.

Table of Contents

Setting Up a Spark Virtual Environment
Building Batch and Streaming Apps with Spark
Juggling Data with Spark
Learning from Data Using Spark
Streaming Live Data with Spark
Visualizing Insights and Trends

本帖隐藏的内容

Spark for Python Developers - Amit Nandi.rar
大小:(8.88 MB)

只需: 5 个论坛币  马上下载




二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

全部回复
2016-1-1 20:27:15
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-1-1 22:03:38
提示: 作者被禁止或删除 内容自动屏蔽
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-1-1 22:06:22
提示: 作者被禁止或删除 内容自动屏蔽
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-1-2 04:55:29
Thanks for sharing
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-1-2 05:56:59
thanks
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

点击查看更多内容…
相关推荐
栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群