Learn IT - Books tags hadoop

HBase in Action

Manning Publications, 2012

I got my start with HBase in the fall of 2008. It was a young project then, released only in the preceding year. As early releases go, it was quite capable, although not without its fair share of embarrassing warts. Not bad for an Apache subproject with fewer than 10 active committers to its name! That was the height of the NoSQL...

Data Warehousing in the Age of Big Data (The Morgan Kaufmann Series on Business Intelligence)

Morgan Kaufmann, 2013

Web 2.0 has changed the way we conduct business, interact with customers, share information with friends and family, measure success in terms of business revenue and customer wallet share, and define brand management, and, most importantly, it has created a revenue channel like none other. Whether you plan your vacation, buy the newest...

Network Security Through Data Analysis: Building Situational Awareness

O'Reilly, 2014

Traditional intrusion detection and logfile analysis are no longer enough to protect today’s complex networks. In this practical guide, security researcher Michael Collins shows you several techniques and tools for collecting and analyzing network traffic datasets. You’ll understand how your network is used, and what...

Getting Started with Greenplum for Big Data Analytics

Packt Publishing, 2013

Big Data started off as a technology buzzword rapidly growing into the headline agenda of several corporate strategies across industry verticals. With the amount of structured and unstructured data available to organizations exploding, analysis of these large data sets is increasingly becoming a key basis of competition, productivity growth,...

Information Management: Strategies for Gaining a Competitive Advantage with Data

Morgan Kaufmann, 2013

Information Management: Gaining a Competitive Advantage with Data is about making smart decisions to make the most of company information. Expert author William McKnight develops the value proposition for information in the enterprise and succinctly outlines the numerous forms of data storage. Information Management will...

Web Crawling and Data Mining with Apache Nutch

Packt Publishing, 2013

Apache Nutch helps you to create your own search engine and customize it according to your needs. You can integrate Apache Nutch very easily with your existing application and get the maximum benefit from it. It can be easily integrated with different components like Apache Hadoop, Eclipse, and MySQL.

"Web Crawling and Data...

Storm Blueprints: Patterns for Distributed Real-time Computation

Packt Publishing, 2014

A blueprints book with 10 different projects built in 10 different chapters which demonstrate the various use cases of storm for both beginner and intermediate users, grounded in real-world example applications.

Although the book focuses primarily on Java development with Storm, the patterns are more broadly applicable and...

Big Data Bootcamp: What Managers Need to Know to Profit from the Big Data Revolution

Apress, 2014

Investors and technology gurus have called big data one of the most important trends to come along in decades. Big Data Bootcamp explains what big data is and how you can use it in your company to become one of tomorrow’s market leaders. Along the way, it explains the very latest...

Disruptive Possibilities: How Big Data Changes Everything

O'Reilly, 2013

Big data has more disruptive potential than any information technology developed in the past 40 years. As author Jeffrey Needham points out in this revealing book, big data can provide unprecedented visibility into the operational efficiency of enterprises and agencies.

Disruptive Possibilities provides an...

Mastering Apache Spark

Packt Publishing, 2015

Gain expertise in processing and storing data by using advanced techniques with Apache Spark

About This Book

Explore the integration of Apache Spark with third party applications such as H20, Databricks and Titan

Evaluate how Cassandra and Hbase can be used for storage

An...

Python Data Analysis Cookbook

Packt Publishing, 2016

Key Features

Analyze Big Data sets, create attractive visualizations, and manipulate and process various data types

Packed with rich recipes to help you learn and explore amazing algorithms for statistics and machine learning

Authored by Ivan Idris, expert in python programming and proud...

Professional NoSQL

Wrox Press, 2011

A hands-on guide to leveraging NoSQL databases

NoSQL databases are an efficient and powerful tool for storing and manipulating vast quantities of data. Most NoSQL databases scale well as data grows. In addition, they are often malleable and flexible enough to accommodate semi-structured and sparse data sets. This...

Result Page: 11 10 9 8 7 6 5 4 3 2 1