Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Spark for Python Developers

Buy
Spark for Python Developers, 9781784399696 (1784399698), Packt Publishing, 2015

Key Features

  • Set up real-time streaming and batch data intensive infrastructure using Spark and Python
  • Deliver insightful visualizations in a web app using Spark (PySpark)
  • Inject live data using Spark Streaming with real-time events

Book Description

Looking for a cluster computing system that provides high-level APIs? Apache Spark is your answer―an open source, fast, and general purpose cluster computing system. Spark's multi-stage memory primitives provide performance up to 100 times faster than Hadoop, and it is also well-suited for machine learning algorithms.

Are you a Python developer inclined to work with Spark engine? If so, this book will be your companion as you create data-intensive app using Spark as a processing engine, Python visualization libraries, and web frameworks such as Flask.

To begin with, you will learn the most effective way to install the Python development environment powered by Spark, Blaze, and Bookeh. You will then find out how to connect with data stores such as MySQL, MongoDB, Cassandra, and Hadoop.

You'll expand your skills throughout, getting familiarized with the various data sources (Github, Twitter, Meetup, and Blogs), their data structures, and solutions to effectively tackle complexities. You'll explore datasets using iPython Notebook and will discover how to optimize the data models and pipeline. Finally, you'll get to know how to create training datasets and train the machine learning models.

By the end of the book, you will have created a real-time and insightful trend tracker data-intensive app with Spark.

What you will learn

  • Create a Python development environment powered by Spark (PySpark), Blaze, and Bookeh
  • Build a real-time trend tracker data intensive app
  • Visualize the trends and insights gained from data using Bookeh
  • Generate insights from data using machine learning through Spark MLLIB
  • Juggle with data using Blaze
  • Create training data sets and train the Machine Learning models
  • Test the machine learning models on test datasets
  • Deploy the machine learning algorithms and models and scale it for real-time events

About the Author

Amit Nandi studied physics at the Free University of Brussels in Belgium, where he did his research on computer generated holograms. Computer generated holograms are the key components of an optical computer, which is powered by photons running at the speed of light. He then worked with the university Cray supercomputer, sending batch jobs of programs written in Fortran. This gave him a taste for computing, which kept growing. He has worked extensively on large business reengineering initiatives, using SAP as the main enabler. He focused for the last 15 years on start-ups in the data space, pioneering new areas of the information technology landscape. He is currently focusing on large-scale data-intensive applications as an enterprise architect, data engineer, and software developer. He understands and speaks seven human languages. Although Python is his computer language of choice, he aims to be able to write fluently in seven computer languages too.

Table of Contents

  1. Setting Up a Spark Virtual Environment
  2. Building Batch and Streaming Apps with Spark
  3. Juggling Data with Spark
  4. Learning from Data Using Spark
  5. Streaming Live Data with Spark
  6. Visualizing Insights and Trends
(HTML tags aren't allowed.)

User-Centered Design: A Developer's Guide to Building User-Friendly Applications
User-Centered Design: A Developer's Guide to Building User-Friendly Applications

How do you design engaging applications that people love to use? This book demonstrates several ways to include valuable input from potential clients and customers throughout the process. With practical guidelines and insights from his own experience, author Travis Lowdermilk shows you how usability and user-centered design will...

Intelligent Learning Infrastructure for Knowledge Intensive Organizations: A Semantic Web Perspective
Intelligent Learning Infrastructure for Knowledge Intensive Organizations: A Semantic Web Perspective
In the knowledge society, the traditional facts of business life are volatile and not given. Every organization is challenged in this context, to enrich its sensors, and its internal processes toward the exploitation of tangible and intangible resources.

During the last few years, an important shift in management
...
Modern Quantum Mechanics
Modern Quantum Mechanics
Revised edition includes discussions of fundamental topics and newer developments such as neutron interferometer experiments, Feynman path integrals, correlation measurements, and Bell's inequality. DLC: Quantum theory.

J. J. Sakurai was always a very welcome guest here at CERN, for he was one of those rare theorists to whom the
...

The Real Taste of Spain: Recipes Inspired by the Markets of Spain
The Real Taste of Spain: Recipes Inspired by the Markets of Spain
More than 100 modern and traditional recipes for authentic Spanish dishes based on fresh, seasonal ingredients are adapted for the home kitchen

From the famous Boqueria in Barcelona to the tiny village markets of rural Spain, market life and fresh regional products are at the heart of Spanish life and cuisine. These...

Developing Your Intuition: A Guide to Reflective  Practice (Center for Creative Leadership)
Developing Your Intuition: A Guide to Reflective Practice (Center for Creative Leadership)
Leaders often have to make decisions without complete information, and those decisions are expected to be not only right but also timely. Using reflective techniques can help you learn to depend on your intuition for help in making good decisions quickly. Reflective practices may seem time-consuming at the beginning, but the time you put in on the...
Education in Rwanda: Rebalancing Resources to Accelerate Post-Conflict Development and Poverty Reduction (World Bank Country Study)
Education in Rwanda: Rebalancing Resources to Accelerate Post-Conflict Development and Poverty Reduction (World Bank Country Study)
Ten years after the 1994 genocide in which an estimated 10 percent of the country’s population perished, Rwanda’s devastated education system is now back on its feet. Classrooms have been repaired and new ones built; teachers who fled the mayhem have been reintegrated into the teaching force; arrears in teacher pay have been cleared up;...
©2021 LearnIT (support@pdfchm.net) - Privacy Policy