Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Python Data Analysis Cookbook

Python Data Analysis Cookbook, 9781785282287 (178528228X), Packt Publishing, 2016

Key Features

  • Analyze Big Data sets, create attractive visualizations, and manipulate and process various data types
  • Packed with rich recipes to help you learn and explore amazing algorithms for statistics and machine learning
  • Authored by Ivan Idris, expert in python programming and proud author of eight highly reviewed books

Book Description

Data analysis is a rapidly evolving field and Python is a multi-paradigm programming language suitable for object-oriented application development and functional design patterns. As Python offers a range of tools and libraries for all purposes, it has slowly evolved as the primary language for data science, including topics on: data analysis, visualization, and machine learning.

Python Data Analysis Cookbook focuses on reproducibility and creating production-ready systems. You will start with recipes that set the foundation for data analysis with libraries such as matplotlib, NumPy, and pandas. You will learn to create visualizations by choosing color maps and palettes then dive into statistical data analysis using distribution algorithms and correlations. You’ll then help you find your way around different data and numerical problems, get to grips with Spark and HDFS, and then set up migration scripts for web mining.

In this book, you will dive deeper into recipes on spectral analysis, smoothing, and bootstrapping methods. Moving on, you will learn to rank stocks and check market efficiency, then work with metrics and clusters. You will achieve parallelism to improve system performance by using multiple threads and speeding up your code.

By the end of the book, you will be capable of handling various data analysis techniques in Python and devising solutions for problem scenarios.

What You Will Learn

  • Set up reproducible data analysis
  • Clean and transform data
  • Apply advanced statistical analysis
  • Create attractive data visualizations
  • Web scrape and work with databases, Hadoop, and Spark
  • Analyze images and time series data
  • Mine text and analyze social networks
  • Use machine learning and evaluate the results
  • Take advantage of parallelism and concurrency

About the Author

Ivan Idris was born in Bulgaria to Indonesian parents. He moved to the Netherlands and graduated in experimental physics. His graduation thesis had a strong emphasis on applied computer science. After graduating, he worked for several companies as a software developer, data warehouse developer, and QA analyst.

His professional interests are business intelligence, big data, and cloud computing. He enjoys writing clean, testable code and interesting technical articles. He is the author of NumPy Beginner's Guide, NumPy Cookbook, Learning NumPy, and Python Data Analysis, all by Packt Publishing.

Table of Contents

  1. Laying the Foundation for Reproducible Data Analysis
  2. Creating Attractive Data Visualizations
  3. Statistical Data Analysis and Probability
  4. Dealing with Data and Numerical Issues
  5. Web Mining, Databases, and Big Data
  6. Signal Processing and Timeseries
  7. Selecting Stocks with Financial Data Analysis
  8. Text Mining and Social Network Analysis
  9. Ensemble Learning and Dimensionality Reduction
  10. Evaluating Classifi ers, Regressors, and Clusters
  11. Analyzing Images
  12. Parallelism and Performance
  13. Glossary
  14. Function Reference
(HTML tags aren't allowed.)

Water and Biomolecules: Physical Chemistry of Life Phenomena (Biological and Medical Physics, Biomedical Engineering)
Water and Biomolecules: Physical Chemistry of Life Phenomena (Biological and Medical Physics, Biomedical Engineering)

Life is produced by the interplay of water and biomolecules. This book deals with the physicochemical aspects of such life phenomena produced by water and biomolecules, and addresses topics including "Protein Dynamics and Functions", "Protein and DNA Folding", and "Protein Amyloidosis". All sections have...

Learning C# 3.0
Learning C# 3.0

If you're new to C#, this popular book is the ideal way to get started. Completely revised for the latest version of the language, Learning C# 3.0 starts with the fundamentals and takes you through intermediate and advanced C# features -- including generics, interfaces, delegates, lambda expressions, and LINQ. You'll...

Deploying and Managing IP over WDM Networks
Deploying and Managing IP over WDM Networks
The integration of the Internet with optical networks and management of heterogeneous
and hybrid networks has been always a challenge for network and
service operators. Different frameworks and architectural approaches have been
proposed and investigated in the research literature and in the commercial
world. The purpose of this

Doing Business in 2005: Obstacles to Growth
Doing Business in 2005: Obstacles to Growth
Doing Business in 2005: Obstacles to Growth is the second in a series of annual reports investigating the scope and manner of regulations that enhance business activity and those that constrain it. New quantitative indicators on business regulations and their enforcement can be compared across more than 130 countries, and over time. The indicators...
Triumph Forsaken: The Vietnam War, 1954-1965 (v. 1)
Triumph Forsaken: The Vietnam War, 1954-1965 (v. 1)
Drawing on a wealth of new evidence from all sides, Triumph Forsaken overturns most of the historical orthodoxy on the Vietnam War. Through the analysis of international perceptions and power, it shows that South Vietnam was a vital interest of the United States. The book provides many new insights into the overthrow of Ngo Dinh Diem in 1963 and...
High-Speed DSP and Analog System Design
High-Speed DSP and Analog System Design

High-Speed DSP and Analog System Design is based on the author’s over 25 years of experience in high-speed DSP and computer systems and courses in both digital and analog systems design at Rice University. It provides hands-on, practical advice for working engineers, including: • Tips on cost-efficient design and system simulation...

©2020 LearnIT (support@pdfchm.net) - Privacy Policy