Home | Amazing | Today | Tags | Publishers | Years | Search 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduce

Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms...

MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems
MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems
Welcome to MapReduce Design Patterns! This book will be unique in some ways and familiar in others. First and foremost, this book is obviously about design patterns, which are templates or general guides to solving problems. We took a look at other design patterns books that have been written in the past as...
Hadoop MapReduce v2 Cookbook Second Edition
Hadoop MapReduce v2 Cookbook Second Edition

Explore the Hadoop MapReduce v2 ecosystem to gain insights from very large datasets

About This Book

  • Process large and complex datasets using next generation Hadoop
  • Install, configure, and administer MapReduce programs and learn what's new in MapReduce v2
  • More than 90...
MongoDB and PHP
MongoDB and PHP
Once every decade or so, a technology comes along that is so revolutionary that it fundamentally alters the way we approach everything we do. The world itself has changed. As I think back to 1995 when I first started developing Internet applications, our data needs were relatively simple. For the next 10 years,...
Raspberry Pi Super Cluster
Raspberry Pi Super Cluster

As a Raspberry Pi enthusiast have you ever considered increasing their performance with parallel computing? Discover just how easy it can be with the right help - this guide takes you through the process from start to finish.

Overview

  • Learn about parallel computing by building your own system using...
Getting Started with Hazelcast - Second Edition
Getting Started with Hazelcast - Second Edition

Get acquainted with the highly scalable data grid, Hazelcast, and learn how to bring its powerful in-memory features into your application

About This Book

  • Store and pass data in your application using Hazelcast's scalable and resilient collections, working with real code and examples to see what is...
Hadoop Backup and Recovery solutions
Hadoop Backup and Recovery solutions

Learn the best strategies for data recovery from Hadoop backup clusters and troubleshoot problems

About This Book

  • Learn the fundamentals of Hadoop's backup needs, recovery strategy, and troubleshooting
  • Determine common failure points, intimate HBase, and explore different backup...
Seven Concurrency Models in Seven Weeks: When Threads Unravel (The Pragmatic Programmers)
Seven Concurrency Models in Seven Weeks: When Threads Unravel (The Pragmatic Programmers)

Your software needs to leverage multiple cores, handle thousands of users and terabytes of data, and continue working in the face of both hardware and software failure. Concurrency and parallelism are the keys, and Seven Concurrency Models in Seven Weeks equips you for this new world. See how emerging technologies such as actors and...

Big Data Glossary
Big Data Glossary

To help you navigate the large number of new data tools available, this guide describes 60 of the most recent innovations, from NoSQL databases and MapReduce approaches to machine learning and visualization tools. Descriptions are based on first-hand experience with these tools in a production environment.

This handy...

CouchDB: The Definitive Guide
CouchDB: The Definitive Guide

Three of CouchDb's creators show you how to use this document-oriented database as a standalone application framework or with high-volume, distributed applications. With its simple model for storing, processing, and accessing data, CouchDb is ideal for web applications that handle huge amounts of loosely structured data. That...

Beginning Apache Cassandra Development
Beginning Apache Cassandra Development

Beginning Apache Cassandra Development introduces you to one of the most robust and best-performing NoSQL database platforms on the planet. Apache Cassandra is a document database following the JSON document model. It is specifically designed to manage large amounts of data across many commodity servers without there being any single...

Apache Sqoop Cookbook
Apache Sqoop Cookbook
It’s been four years since, via a post to the Apache JIRA, the first version of Sqoop was released to the world as an addition to Hadoop. Since then, the project has taken several turns, most recently landing as a top-level Apache project. I’ve been amazed at how many people use this small tool for a...
Result Page: 6 5 4 3 2 1 
©2024 LearnIT (support@pdfchm.net) - Privacy Policy