Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Next-Generation Big Data: A Practical Guide to Apache Kudu, Impala, and Spark

Buy

Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies.

Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used for big data warehousing, data warehouse optimization, real-time and batch data ingestion and processing, real-time data visualization, big data governance, data wrangling, big data cloud deployments, and distributed in-memory big data computing. Finally, the book has an extensive and detailed coverage of big data case studies from Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard.

What You’ll Learn

  • Install Apache Kudu, Impala, and Spark to modernize enterprise data warehouse and business intelligence environments, complete with real-world, easy-to-follow examples, and practical advice
  • Integrate HBase, Solr, Oracle, SQL Server, MySQL, Flume, Kafka, HDFS, and Amazon S3 with Apache Kudu, Impala, and Spark
  • Use StreamSets, Talend, Pentaho, and CDAP for real-time and batch data ingestion and processing
  • Utilize Trifacta, Alteryx, and Datameer for data wrangling and interactive data processing
  • Turbocharge Spark with Alluxio, a distributed in-memory storage platform
  • Deploy big data in the cloud using Cloudera Director
  • Perform real-time data visualization and time series analysis using Zoomdata, Apache Kudu, Impala, and Spark
  • Understand enterprise big data topics such as big data governance, metadata management, data lineage, impact analysis, and policy enforcement, and how to use Cloudera Navigator to perform common data governance tasks
  • Implement big data use cases such as big data warehousing, data warehouse optimization, Internet of Things, real-time data ingestion and analytics, complex event processing, and scalable predictive modeling
  • Study real-world big data case studies from innovative companies, including Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard
Who This Book Is For

BI and big data warehouse professionals interested in gaining practical and real-world insight into next-generation big data processing and analytics using Apache Kudu, Impala, and Spark; and those who want to learn more about other advanced enterprise topics
(HTML tags aren't allowed.)

Acquired Brain Injury: An Integrative Neuro-Rehabilitation Approach
Acquired Brain Injury: An Integrative Neuro-Rehabilitation Approach
Crimmins (2000) marveled at the greatness of the “three pound-blob” that is our brain and control system. As seasoned clinicians in the field of neuro-rehabilitation, we still marvel each day at the resilience of the brain and at the exciting recoveries that we attempt to facilitate in survivors of acquired brain...
Z-80 Microprocessor: Architecture, Interfacing, Programming and Design
Z-80 Microprocessor: Architecture, Interfacing, Programming and Design

This text is intended for microprocessor courses at the undergraduate level in technology and engineering. ll is a comprehensive treatment of the microprocessor. covering both hardware and software based on the Z80 microprocessor family. The text assumes a course in digital logic as a prerequisite; however, it does not assume a background in...

International Classification of HRCT for Occupational and Environmental Respiratory Diseases
International Classification of HRCT for Occupational and Environmental Respiratory Diseases

Many international experts collaborated in creating this groundbreaking work, a principal-coding system, and in developing reference films and imaging parameters for the International Classification of HRCT for Occupational and Environmental Respiratory Diseases. The book is an authoritative guide to the recognition of dust...


Microsoft SQL Server 2005 New Features
Microsoft SQL Server 2005 New Features

Get full details on all the innovative features and benefits available in the upcoming release of SQL Server 2005. This authoritative guide explains the new and improved enterprise data management capabilities, developer functions, and business intelligence tools. You’ll see how the new release offers enhanced scalability,...

Bootstrap for Rails
Bootstrap for Rails

A quick-start guide to developing beautiful web applications with the Bootstrap toolkit and Rails framework

About This Book

  • Enhance your applications with Bootstrap modals and carousels
  • Explore the usage of advanced Bootstrap components and plugins in Rails through various examples
  • ...
Extending Docker
Extending Docker

Key Features

  • Get the first book on the market that shows you how to extend the capabilities of Docker using plugins and third-party tools
  • Master the skills of creating various plugins and integrating great tools in order to enhance the functionalities of Docker
  • A practical and learning guide...
©2020 LearnIT (support@pdfchm.net) - Privacy Policy