Python Data Analysis
Find, manipulate, and analyze your data using the Python 3.5 libraries
Perform advanced, high-performance linear algebra and mathematical calculations with clean and efficient Python code
An easy-to-follow guide with realistic examples that are frequently used in real-world data
Tap your unstructured Big Data and empower your business using the Hadoop distribution from Windows
Architect a Hadoop solution with a modular design for data collection, distributed processing, analysis, and reporting
Build a multi-node Hadoop cluster on Windows servers...
Hadoop Real World Solutions Cookbook
Ever felt you could use some no-nonsense, practical help when developing applications with Hadoop? Well, you've just found it. This real-world solutions cookbook is packed with handy recipes you can apply to your own everyday issues.
Solutions to common problems when working in the Hadoop...
Hadoop in Practice
Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating...
Hadoop MapReduce v2 Cookbook Second Edition
Explore the Hadoop MapReduce v2 ecosystem to gain insights from very large datasets
About This Book
Process large and complex datasets using next generation Hadoop
Install, configure, and administer MapReduce programs and learn what's new in MapReduce v2
More than 90...
Learning Cloudera Impala
Perform interactive, real-time in-memory analytics on large amounts of data using the massive parallel processing engine Cloudera Impala
Step-by-step guidance to get you started with Impala on your Hadoop cluster
Manipulate your data rapidly by writing proper SQL statements...
Hadoop: The Definitive Guide
Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework -- an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing...
This guide is an ideal learning tool and reference for Apache Pig, the open source engine for executing parallel data flows on Hadoop. With Pig, you can batch-process data without having to create a full-fledged application—making it easy for you to experiment with new datasets.
Programming Pig introduces...
Microsoft SQL Server 2012 with Hadoop
With the explosion of data, the open source Apache Hadoop ecosystem is gaining traction, thanks to its huge ecosystem that has arisen around the core functionalities of its distributed file system (HDFS) and Map Reduce. As of today, being able to have SQL Server talking to Hadoop has become increasingly important because the two are indeed...
|Result Page: 4 3 2 1 |