Home | Amazing | Today | Tags | Publishers | Years | Search 
Web Crawling and Data Mining with Apache Nutch
Web Crawling and Data Mining with Apache Nutch

Apache Nutch helps you to create your own search engine and customize it according to your needs. You can integrate Apache Nutch very easily with your existing application and get the maximum benefit from it. It can be easily integrated with different components like Apache Hadoop, Eclipse, and MySQL.

"Web Crawling and Data...

Learning Cloudera Impala
Learning Cloudera Impala

Perform interactive, real-time in-memory analytics on large amounts of data using the massive parallel processing engine Cloudera Impala

Overview

  • Step-by-step guidance to get you started with Impala on your Hadoop cluster
  • Manipulate your data rapidly by writing proper SQL statements
  • ...
HBase Administration Cookbook
HBase Administration Cookbook

Master HBase configuration and administration for optimum database performance

  • Move large amounts of data into HBase and learn how to manage it efficiently
  • Set up HBase on the cloud, get it ready for production, and run it smoothly with high performance
  • Maximize the ability of HBase with the...
Programming Hive
Programming Hive
Programming Hive introduces Hive, an essential tool in the Hadoop ecosystem that provides an SQL (Structured Query Language) dialect for querying data stored in the Hadoop Distributed Filesystem (HDFS), other filesystems that integrate with Hadoop, such as MapR-FS and Amazon’s S3 and databases like HBase (the...
Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement
Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement
It has been said that data is the new oil. If this is so, then databases are the fields, the refineries, the drills, and the pumps. Data is stored in databases, and if you’re interested in tapping into it, then coming to grips with the modern equipment is a great start.

Databases are
...
Hadoop: The Definitive Guide
Hadoop: The Definitive Guide
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers...
Mastering Apache Spark
Mastering Apache Spark

Gain expertise in processing and storing data by using advanced techniques with Apache Spark

About This Book

  • Explore the integration of Apache Spark with third party applications such as H20, Databricks and Titan
  • Evaluate how Cassandra and Hbase can be used for storage
  • An...
Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem
Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem
Get a jump start on using Azure HDInsight and Hadoop Ecosystem components. As most Hadoop and Big Data projects are written in either Java, Scala, or Python, this book minimizes the effort to learn another language and is written from the perspective of a .NET developer. Hadoop components are covered, including Hive, Pig, HBase, Storm,...
Hadoop MapReduce v2 Cookbook Second Edition
Hadoop MapReduce v2 Cookbook Second Edition

Explore the Hadoop MapReduce v2 ecosystem to gain insights from very large datasets

About This Book

  • Process large and complex datasets using next generation Hadoop
  • Install, configure, and administer MapReduce programs and learn what's new in MapReduce v2
  • More than 90...
Spring Data
Spring Data
The data access landscape over the past seven or so years has changed dramatically. Relational databases, the heart of storing and processing data in the enterprise for over 30 years, are no longer the only game in town. The past seven years have seen the birth —and in some cases the death—of many...
Big Data Analytics with Microsoft HDInsight in 24 Hours, Sams Teach Yourself
Big Data Analytics with Microsoft HDInsight in 24 Hours, Sams Teach Yourself

With Microsoft HDInsight, business professionals and data analysts can rapidly leverage the power of Hadoop on a flexible, scalable cloud-based platform, using Microsoft's accessible business intelligence, visualization, and productivity tools. Now, in just 24 lessons of one hour or less, you can learn all the skills and...

Learning Storm
Learning Storm

Create real-time stream processing applications with Apache Storm

About This Book

  • Integrate Storm with other Big Data technologies like Hadoop, HBase, and Apache Kafka
  • Explore log processing and machine learning using Storm
  • Step-by-step and easy-to-understand guide to...
Result Page: 3 2 1 
©2024 LearnIT (support@pdfchm.net) - Privacy Policy