Learning Spark: Lightning-Fast Big Data Analysis
Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java,...
Fast Data Processing with Spark - Second Edition
Perform real-time analytics using Spark in a fast, distributed, and scalable way
About This Book
Develop a machine learning system with Spark's MLlib and scalable algorithms
Deploy Spark jobs to various clusters such as Mesos, EC2, Chef, YARN, EMR, and so on
This is a...