Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Learning Cloudera Impala

Buy
Learning Cloudera Impala, 9781783281275 (1783281278), Packt Publishing, 2013

Perform interactive, real-time in-memory analytics on large amounts of data using the massive parallel processing engine Cloudera Impala

Overview

  • Step-by-step guidance to get you started with Impala on your Hadoop cluster
  • Manipulate your data rapidly by writing proper SQL statements
  • Explore the concepts of Impala security, administration, and troubleshooting in detail to maintain your Impala cluster

In Detail

If you have always wanted to crunch billions of rows of raw data on Hadoop in a couple of seconds, then Cloudera Impala is the number one choice for you. Cloudera Impala provides fast, interactive SQL queries directly on your Apache Hadoop data stored in HDFS or HBase. In addition to using the same unified storage platform, Impala also uses the same metadata, SQL syntax (Hive SQL), ODBC driver, and user interface (Hue Beeswax) as Apache Hive. This provides a familiar and unified platform for batch-oriented or real-time queries.

In this practical, example-oriented book, you will learn everything you need to know about Cloudera Impala so that you can get started on your very own project. The book covers everything about Cloudera Impala from installation, administration, and query processing, all the way to connectivity with other third party applications. With this book in your hand, you will find yourself empowered to play with your data in Hadoop.

As a reader of this book, you will learn about the origin of Impala and the technology behind it that allows it to run on thousands of machines. You will learn how to install, run, manage, and troubleshoot Impala in your own Hadoop cluster using the step-by-step guidance provided in the book. The book covers tenets of data processing such as loading data stored in Hadoop into Impala tables and querying data using Impala SQL statements, all with various code illustrations and a real-world example.

The book is written to get you started with Impala by providing rich information so you can understand what Impala is, what it can do for you, and finally how you can use it to achieve your objective.

What you will learn from this book

  • Understand the various ways of installing Impala in your Hadoop cluster
  • Use the Impala shell API to interact with Impala components
  • Utilize Impala Query Language and built-in functions to play with data
  • Administrate and fine-tune Impala for high availability
  • Identify and troubleshoot problems in a variety of ways
  • Get acquainted with various input data formats in Hadoop and how to use them with Impala
  • Comprehend how third party applications can connect with Impala to provide data visualization and various other enhancements

Approach

This book is an easy-to-follow, step-by-step tutorial where each chapter takes your knowledge to the next level. The book covers practical knowledge with tips to implement this knowledge in real-world scenarios. A chapter with a real-life example is included to help you understand the concepts in full.

Who this book is written for

Using Cloudera Impala is for those who really want to take advantage of their Hadoop cluster by processing extremely large amounts of raw data in Hadoop at real-time speed. Prior knowledge of Hadoop and some exposure to HIVE and MapReduce is expected.

(HTML tags aren't allowed.)

Innovative Cryptography (Programming Series)
Innovative Cryptography (Programming Series)
Innovative Cryptography, Second Edition provides a cutting-edge evaluation and review of current findings in the area of cryptography and explores how to implement these new techniques efficiently. It covers current cryptographic problems and suggests practical solutions. The book also discusses the role of symmetric ciphers and symmetric block...
Think Java: How to Think Like a Computer Scientist
Think Java: How to Think Like a Computer Scientist

Currently used at many colleges, universities, and high schools, this hands-on introduction to computer science is ideal for people with little or no programming experience. The goal of this concise book is not just to teach you Java, but to help you think like a computer scientist. You’ll learn how to program—a useful...

Geographical Information Systems in Archaeology (Cambridge Manuals in Archaeology)
Geographical Information Systems in Archaeology (Cambridge Manuals in Archaeology)

Geographical Information Systems (GIS) is a rapidly developing archaeological method which is moving from the domain of the computer specialist into that of the broader archaeological community. This comprehensive manual on the use of GIS in archaeology explores the concept of GIS and illustrates how it can be adapted for practical use....


MCSE: The Core Exams in a Nutshell (In a Nutshell (O'Reilly))
MCSE: The Core Exams in a Nutshell (In a Nutshell (O'Reilly))

Microsoft's MCSE (Microsoft Certified Systems Engineer) program is a rigorous testing and certification program for Windows NT system and network administrators. To achieve certification, one must pass four required exams and two elective exams. Close to twenty potential elective exams exist, although only nine of them are...

Survivors of Childhood and Adolescent Cancer: A Multidisciplinary Approach (Pediatric Oncology)
Survivors of Childhood and Adolescent Cancer: A Multidisciplinary Approach (Pediatric Oncology)

It was not long ago that clinicians would say,“study ed at the 1975 meeting revealed. Among them was the late complications of cancer treatments we give to one based on data collected by the Late Effects Study children? You must be joking! We can start worrying Group, an international consortium that consisted about that when we start...

The Definitive Guide to iReport (Expert's Voice)
The Definitive Guide to iReport (Expert's Voice)
JasperForge.org is the open source development portal for the JasperSoft Business Intelligence Suite, the JasperSoft Business Intelligence solution that delivers comprehensive tools for data access, data integration, analysis, and reporting, including iReport. This definitive, authoritative book covers the following:
  • Covers iReport...
©2021 LearnIT (support@pdfchm.net) - Privacy Policy