Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Python Data Analysis Cookbook

Buy
Python Data Analysis Cookbook, 9781785282287 (178528228X), Packt Publishing, 2016

Key Features

  • Analyze Big Data sets, create attractive visualizations, and manipulate and process various data types
  • Packed with rich recipes to help you learn and explore amazing algorithms for statistics and machine learning
  • Authored by Ivan Idris, expert in python programming and proud author of eight highly reviewed books

Book Description

Data analysis is a rapidly evolving field and Python is a multi-paradigm programming language suitable for object-oriented application development and functional design patterns. As Python offers a range of tools and libraries for all purposes, it has slowly evolved as the primary language for data science, including topics on: data analysis, visualization, and machine learning.

Python Data Analysis Cookbook focuses on reproducibility and creating production-ready systems. You will start with recipes that set the foundation for data analysis with libraries such as matplotlib, NumPy, and pandas. You will learn to create visualizations by choosing color maps and palettes then dive into statistical data analysis using distribution algorithms and correlations. You’ll then help you find your way around different data and numerical problems, get to grips with Spark and HDFS, and then set up migration scripts for web mining.

In this book, you will dive deeper into recipes on spectral analysis, smoothing, and bootstrapping methods. Moving on, you will learn to rank stocks and check market efficiency, then work with metrics and clusters. You will achieve parallelism to improve system performance by using multiple threads and speeding up your code.

By the end of the book, you will be capable of handling various data analysis techniques in Python and devising solutions for problem scenarios.

What You Will Learn

  • Set up reproducible data analysis
  • Clean and transform data
  • Apply advanced statistical analysis
  • Create attractive data visualizations
  • Web scrape and work with databases, Hadoop, and Spark
  • Analyze images and time series data
  • Mine text and analyze social networks
  • Use machine learning and evaluate the results
  • Take advantage of parallelism and concurrency

About the Author

Ivan Idris was born in Bulgaria to Indonesian parents. He moved to the Netherlands and graduated in experimental physics. His graduation thesis had a strong emphasis on applied computer science. After graduating, he worked for several companies as a software developer, data warehouse developer, and QA analyst.

His professional interests are business intelligence, big data, and cloud computing. He enjoys writing clean, testable code and interesting technical articles. He is the author of NumPy Beginner's Guide, NumPy Cookbook, Learning NumPy, and Python Data Analysis, all by Packt Publishing.

Table of Contents

  1. Laying the Foundation for Reproducible Data Analysis
  2. Creating Attractive Data Visualizations
  3. Statistical Data Analysis and Probability
  4. Dealing with Data and Numerical Issues
  5. Web Mining, Databases, and Big Data
  6. Signal Processing and Timeseries
  7. Selecting Stocks with Financial Data Analysis
  8. Text Mining and Social Network Analysis
  9. Ensemble Learning and Dimensionality Reduction
  10. Evaluating Classifi ers, Regressors, and Clusters
  11. Analyzing Images
  12. Parallelism and Performance
  13. Glossary
  14. Function Reference
(HTML tags aren't allowed.)

Running Linux
Running Linux

Linux is the most exciting development today in the UNIX world -- and some would say in the world of the PC-compatible. A complete, UNIX-compatible operating system developed by volunteers on the Internet, Linux is distributed freely in electronic form and for low cost from many vendors. Its software packages include the X Window...

Engineering with Mathcad: Using Mathcad to Create and Organize your Engineering Calculations
Engineering with Mathcad: Using Mathcad to Create and Organize your Engineering Calculations
A must-have resource demonstrating the power of Mathcad to create calculations for solving complex engineering problems

Using the author's considerable experience of applying Mathcad to engineering problems, Engineering with Mathcad identifies the most powerful functions and features of the software and
...
Legislative XML for the Semantic Web: Principles, Models, Standards for Document Management
Legislative XML for the Semantic Web: Principles, Models, Standards for Document Management

The Internet already is the broadest and most used source of legal information: in the web one can find most statutory texts (all in some jurisdictions), a vast amount of case law, huge repertoires of doctrinal contributions, many blogs and fora discussing different legal issues. Moreover, as legal activities (legislative,...


Networked Neighbourhoods: The Connected Community in Context
Networked Neighbourhoods: The Connected Community in Context
The setting for this book is the networked community. The treatment of the subject matter is broad and interdisciplinary, with contributions from computer science, sociology, design, human factors and communication technology.

The chapter contributors, drawn from across Europe and North America, offer a varied prospectus of commentary,...

Project 2010 Project Management: Real World Skills for Certification and Beyond (Exam 70-178)
Project 2010 Project Management: Real World Skills for Certification and Beyond (Exam 70-178)

The ideal on-the-job reference guide for project managers who use Microsoft Project 2010

This must-have guide to using Microsoft Project 2010 is written from a real project manager's perspective and is packed with information you can use on the job. The book explores using Project 2010 during phases of project...

Continuous Delivery with Visual Studio ALM  2015
Continuous Delivery with Visual Studio ALM 2015

This book is the authoritative source on implementing Continuous Delivery practices using Microsoft’s Visual Studio and TFS 2015. Microsoft MVP authors Mathias Olausson and Jakob Ehn translate the theory behind this methodology and show step by step how to implement Continuous Delivery in a real world environment.

Building...

©2021 LearnIT (support@pdfchm.net) - Privacy Policy