Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Hadoop in Practice

Buy
Hadoop in Practice, 9781617292224 (1617292222), Manning Publications, 2014

Summary

Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the Book

It's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date coverage of Hadoop available.

Readers need to know a programming language like Java and have basic familiarity with Hadoop.

What's Inside

  • Thoroughly updated for Hadoop 2
  • How to write YARN applications
  • Integrate real-time technologies like Storm, Impala, and Spark
  • Predictive analytics using Mahout and RR
  • Readers need to know a programming language like Java and have basic familiarity with Hadoop.

About the Author

Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects.

Table of Contents

PART 1 BACKGROUND AND FUNDAMENTALS
PART 2 DATA LOGISTICS
PART 3 BIG DATA PATTERNS
PART 4 BEYOND MAPREDUCE
  1. Hadoop in a heartbeat
  2. Introduction to YARN
  3. Data serialization—working with text and beyond
  4. Organizing and optimizing data in HDFS
  5. Moving data into and out of Hadoop
  6. Applying MapReduce patterns to big data
  7. Utilizing data structures and algorithms at scale
  8. Tuning, debugging, and testing
  9. SQL on Hadoop
  10. Writing a YARN application
(HTML tags aren't allowed.)

Unique Chips and Systems (Computer Engineering Series)
Unique Chips and Systems (Computer Engineering Series)
Which came first, the system or the chip? While integrated circuits enable technology for the modern information age, computing, communication, and network chips fuel it. As soon as the integration ability of modern semiconductor technology offers presents opportunities, issues in power consumption, reliability, and form-factor present challenges....
Pro Ubuntu Server Administration
Pro Ubuntu Server Administration
Pro Ubuntu Server Administration teaches you advanced Ubuntu system building. After reading this book, you will be able to manage anything from simple file servers to multiple virtual servers to high–availability clusters. This is the capstone volume of the Apress Ubuntu trilogy that includes Beginning Ubuntu Linux, Third...
Algorithmic Bioprocesses (Natural Computing Series)
Algorithmic Bioprocesses (Natural Computing Series)

A fundamental understanding of algorithmic bioprocesses is key to learning how information processing occurs in nature at the cell level. The field is concerned with the interactions between computer science on the one hand and biology, chemistry, and DNA-oriented nanoscience on the other. In particular, this book offers a comprehensive overview...


Microsoft Visio 2010 Step by Step: The smart way to learn Microsoft Visio 2010-one step at a time!
Microsoft Visio 2010 Step by Step: The smart way to learn Microsoft Visio 2010-one step at a time!

Microsoft Visio 2010 is a bold new release. If you’re new to Visio, your timing is excellent! This version of Visio is easier to use than ever before and yet the diagrams you create can have more impact and style, and can present more real-world data than in any previous version.

If you’ve used prior versions...

Oracle® Database 10g INSIDER SOLUTIONS
Oracle® Database 10g INSIDER SOLUTIONS

Oracle Database 10g Insider Solutions is a must-have reference guide for all Oracle professionals. It provides much-needed information on best practices, tips, and techniques in debugging, installation, deployment, and tuning of the Oracle 10g database. You can draw upon the experience and...

Business Under Fire: How Israeli Companies Are Succeeding in the Face of Terror
Business Under Fire: How Israeli Companies Are Succeeding in the Face of Terror
American companies steeling themselves against the threat of terrorism can learn a lot from Israel's experience. Despite facing the constant grim reality of terrorism, the Israeli economy is surprisingly robust. How do businesses in Israel stay viable in a chaotic environment, and how do they rebuild in the wake of destruction? Based on in-depth...
©2020 LearnIT (support@pdfchm.net) - Privacy Policy