Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Spark for Python Developers

Buy
Spark for Python Developers, 9781784399696 (1784399698), Packt Publishing, 2015

Key Features

  • Set up real-time streaming and batch data intensive infrastructure using Spark and Python
  • Deliver insightful visualizations in a web app using Spark (PySpark)
  • Inject live data using Spark Streaming with real-time events

Book Description

Looking for a cluster computing system that provides high-level APIs? Apache Spark is your answer―an open source, fast, and general purpose cluster computing system. Spark's multi-stage memory primitives provide performance up to 100 times faster than Hadoop, and it is also well-suited for machine learning algorithms.

Are you a Python developer inclined to work with Spark engine? If so, this book will be your companion as you create data-intensive app using Spark as a processing engine, Python visualization libraries, and web frameworks such as Flask.

To begin with, you will learn the most effective way to install the Python development environment powered by Spark, Blaze, and Bookeh. You will then find out how to connect with data stores such as MySQL, MongoDB, Cassandra, and Hadoop.

You'll expand your skills throughout, getting familiarized with the various data sources (Github, Twitter, Meetup, and Blogs), their data structures, and solutions to effectively tackle complexities. You'll explore datasets using iPython Notebook and will discover how to optimize the data models and pipeline. Finally, you'll get to know how to create training datasets and train the machine learning models.

By the end of the book, you will have created a real-time and insightful trend tracker data-intensive app with Spark.

What you will learn

  • Create a Python development environment powered by Spark (PySpark), Blaze, and Bookeh
  • Build a real-time trend tracker data intensive app
  • Visualize the trends and insights gained from data using Bookeh
  • Generate insights from data using machine learning through Spark MLLIB
  • Juggle with data using Blaze
  • Create training data sets and train the Machine Learning models
  • Test the machine learning models on test datasets
  • Deploy the machine learning algorithms and models and scale it for real-time events

About the Author

Amit Nandi studied physics at the Free University of Brussels in Belgium, where he did his research on computer generated holograms. Computer generated holograms are the key components of an optical computer, which is powered by photons running at the speed of light. He then worked with the university Cray supercomputer, sending batch jobs of programs written in Fortran. This gave him a taste for computing, which kept growing. He has worked extensively on large business reengineering initiatives, using SAP as the main enabler. He focused for the last 15 years on start-ups in the data space, pioneering new areas of the information technology landscape. He is currently focusing on large-scale data-intensive applications as an enterprise architect, data engineer, and software developer. He understands and speaks seven human languages. Although Python is his computer language of choice, he aims to be able to write fluently in seven computer languages too.

Table of Contents

  1. Setting Up a Spark Virtual Environment
  2. Building Batch and Streaming Apps with Spark
  3. Juggling Data with Spark
  4. Learning from Data Using Spark
  5. Streaming Live Data with Spark
  6. Visualizing Insights and Trends
(HTML tags aren't allowed.)

Yii Application Development Cookbook - Second Edition
Yii Application Development Cookbook - Second Edition

This book is the perfect way to add the capabilities of Yii to your PHP5 development skills. Dealing with practical solutions through real-life recipes and screenshots, it enables you to write applications more efficiently.

Overview

  • Learn how to use Yii even more efficiently
  • Full of...
Geometry: The Third Book of Foundations (The Foundations Trilogy)
Geometry: The Third Book of Foundations (The Foundations Trilogy)

In this third installment of his classic 'Foundations' trilogy, Michel Serres takes on the history of geometry and mathematics. Even more broadly, Geometry is the beginnings of things and also how these beginnings have shaped how we continue to think philosophically and critically. Serres rejects a traditional history of mathematics...

Quantum Mechanics of Fundamental Systems: The Quest for Beauty and Simplicity: Claudio Bunster Festschrift
Quantum Mechanics of Fundamental Systems: The Quest for Beauty and Simplicity: Claudio Bunster Festschrift
This is a collection of contributed papers by former collaborators and colleagues of Claudio Bunster (formerly Teitelboim). The topics include General Relativity, Quantum Gravity, String Theory; from historical reviews to current research; from mathematical structures underlying the fundamental interactions, to cosmological scenarios describing the...

Micro-Cutting: Fundamentals and Applications (Microsystem and Nanotechnology)
Micro-Cutting: Fundamentals and Applications (Microsystem and Nanotechnology)

State-of-the-art of research and engineering practice in micro cutting which is becoming increasingly important in modern micro manufacturing Micro-Cutting is a comprehensive presentation of the state-of-the-art of research and engineering practice in micro cutting, a technique that is becoming increasingly important in modern micro...

Cisco LAN Switching Fundamentals
Cisco LAN Switching Fundamentals

The essential guide for understanding Ethernet switched networks

  • Understand various Ethernet technologies from 10BASE-T to Gigabit Ethernet

  • Learn about common switching modes, paths, and architectures

  • ...
Photographers: A Sourcebook for Historical Research
Photographers: A Sourcebook for Historical Research
Completely revised edition, featuring Richard Rudisill's Directories of Photographers, an annotated international bibliography, and six new essays on photography research. Included are David Haynes' "how to" essay expanded to include new resources provided by the advent of the computer and the World Wide Web); Linda Ries' sequel to her...
©2020 LearnIT (support@pdfchm.net) - Privacy Policy