Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Spark for Python Developers

Buy
Spark for Python Developers, 9781784399696 (1784399698), Packt Publishing, 2015

Key Features

  • Set up real-time streaming and batch data intensive infrastructure using Spark and Python
  • Deliver insightful visualizations in a web app using Spark (PySpark)
  • Inject live data using Spark Streaming with real-time events

Book Description

Looking for a cluster computing system that provides high-level APIs? Apache Spark is your answer―an open source, fast, and general purpose cluster computing system. Spark's multi-stage memory primitives provide performance up to 100 times faster than Hadoop, and it is also well-suited for machine learning algorithms.

Are you a Python developer inclined to work with Spark engine? If so, this book will be your companion as you create data-intensive app using Spark as a processing engine, Python visualization libraries, and web frameworks such as Flask.

To begin with, you will learn the most effective way to install the Python development environment powered by Spark, Blaze, and Bookeh. You will then find out how to connect with data stores such as MySQL, MongoDB, Cassandra, and Hadoop.

You'll expand your skills throughout, getting familiarized with the various data sources (Github, Twitter, Meetup, and Blogs), their data structures, and solutions to effectively tackle complexities. You'll explore datasets using iPython Notebook and will discover how to optimize the data models and pipeline. Finally, you'll get to know how to create training datasets and train the machine learning models.

By the end of the book, you will have created a real-time and insightful trend tracker data-intensive app with Spark.

What you will learn

  • Create a Python development environment powered by Spark (PySpark), Blaze, and Bookeh
  • Build a real-time trend tracker data intensive app
  • Visualize the trends and insights gained from data using Bookeh
  • Generate insights from data using machine learning through Spark MLLIB
  • Juggle with data using Blaze
  • Create training data sets and train the Machine Learning models
  • Test the machine learning models on test datasets
  • Deploy the machine learning algorithms and models and scale it for real-time events

About the Author

Amit Nandi studied physics at the Free University of Brussels in Belgium, where he did his research on computer generated holograms. Computer generated holograms are the key components of an optical computer, which is powered by photons running at the speed of light. He then worked with the university Cray supercomputer, sending batch jobs of programs written in Fortran. This gave him a taste for computing, which kept growing. He has worked extensively on large business reengineering initiatives, using SAP as the main enabler. He focused for the last 15 years on start-ups in the data space, pioneering new areas of the information technology landscape. He is currently focusing on large-scale data-intensive applications as an enterprise architect, data engineer, and software developer. He understands and speaks seven human languages. Although Python is his computer language of choice, he aims to be able to write fluently in seven computer languages too.

Table of Contents

  1. Setting Up a Spark Virtual Environment
  2. Building Batch and Streaming Apps with Spark
  3. Juggling Data with Spark
  4. Learning from Data Using Spark
  5. Streaming Live Data with Spark
  6. Visualizing Insights and Trends
(HTML tags aren't allowed.)

Nikon D7000 For Dummies
Nikon D7000 For Dummies

Learn all about the Nikon D7000?the fun and friendly For Dummies way!

Whether you?re a digital camera beginner or an experienced photographer, this is the book you need to get the most out of the Nikon D7000, the update to Nikon?s popular D90 model. The helpful tips and tricks in this fun and easy guide will get you quickly...

Knowledge Networks: Innovation Through Communities of Practice
Knowledge Networks: Innovation Through Communities of Practice
Knowledge Networks: Innovations Through Communities of Practice draws on the experience of people who have worked with CoPs in the real world and to present their combined wisdom in a form that is accessible to a wide audience. CoPs are examined from a practical, rather than a purely academic point of view. The book also examines the benefits that...
Design Patterns For Dummies (Computer/Tech)
Design Patterns For Dummies (Computer/Tech)
There's a pattern here, and here's how to use it!

Find out how the 23 leading design patterns can save you time and trouble

Ever feel as if you've solved this programming problem before? You — or someone — probably did, and that's why there's a design pattern to help...


Scripting Intelligence: Web 3.0 Information, Gathering and Processing
Scripting Intelligence: Web 3.0 Information, Gathering and Processing
This book covers Web 3.0 technologies from a software developer’s point of view. While nontechies can use web services and portals that other people create, developers have the ability to be creators and consumers at the same time—by integrating their work with other people’s efforts.

The Meaning of Web 3.0
...
Primer of Diagnostic Imaging: Expert Consult - Online and Print
Primer of Diagnostic Imaging: Expert Consult - Online and Print

Widely known as THE survival guide for radiology residents, fellows, and junior faculty, the "purple book" provides comprehensive, up-to-date coverage of diagnostic imaging in an easy-to-read, bulleted format. Focusing on the core information you need for learning and practice, this portable resource combines...

Facebook Marketing For Dummies
Facebook Marketing For Dummies

Discover how to leverage the power of the Facebook community to achieve your business marketing goals

Facebook boasts an extremely devoted user base, with more than 65 billion page visits per month. With Facebook, an organization can market and promote their brand, products, or services via the network's built-in...

©2021 LearnIT (support@pdfchm.net) - Privacy Policy