Setup
Chapter 1 Theory
Agile Big Data
Big Words Defined
Agile Big Data Teams
Agile Big Data Process
Code Review and Pair Programming
Agile Environments: Engineering Productivity
Realizing Ideas with Large-Format Printing
Chapter 2 Data
Email
Working with Raw Data
SQL
NoSQL
Data Perspectives
Chapter 3 Agile Tools
Scalability = Simplicity
Agile Big Data Processing
Setting Up a Virtual Environment for Python
Serializing Events with Avro
Collecting Data
Data Processing with Pig
Publishing Data with MongoDB
Searching Data with ElasticSearch
Reflecting on our Workflow
Lightweight Web Applications
Presenting Our Data
Conclusion
Chapter 4 To the Cloud!
Introduction
GitHub
dotCloud
Amazon Web Services
Instrumentation
Climbing the Pyramid
Chapter 5 Collecting and Displaying Records
Putting It All Together
Collect and Serialize Our Inbox
Process and Publish Our Emails
Presenting Emails in a Browser
Agile Checkpoint
Listing Emails
Searching Our Email
Conclusion
Chapter 6 Visualizing Data with Charts
Good Charts
Extracting Entities: Email Addresses
Visualizing Time
Conclusion
Chapter 7 Exploring Data with Reports
Building Reports with Multiple Charts
Linking Records
Extracting Keywords from Emails with TF-IDF
Conclusion
Chapter 8 Making Predictions
Predicting Response Rates to Emails
Personalization
Conclusion
Chapter 9 Driving Actions
Properties of Successful Emails
Better Predictions with Naive Bayes
P(Reply | From & To)
P(Reply | Token)
Making Predictions in Real Time
Logging Events
Conclusion
Colophon