Learning Spark: Lightning-Fast Big Data Analysis
Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java,...
Fast Data Processing with Spark - Second Edition
Perform real-time analytics using Spark in a fast, distributed, and scalable way
About This Book
Develop a machine learning system with Spark's MLlib and scalable algorithms
Deploy Spark jobs to various clusters such as Mesos, EC2, Chef, YARN, EMR, and so on
This is a...
Practical Apache Spark: Using the Scala API
Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic....