Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
HBase: The Definitive Guide

Buy
HBase: The Definitive Guide, 9781449396107 (1449396100), O'Reilly, 2011

You may be reading this book for many reasons. It could be because you heard all about Hadoop and what it can do to crunch petabytes of data in a reasonable amount of time. While reading into Hadoop you found that, for random access to the accumulated data, there is something called HBase. Or it was the hype that is prevalent these days addressing a new kind of data storage architecture. It strives to solve large-scale data problems where traditional solutions may be either too involved or cost-prohibitive. A common term used in this area is NoSQL.

No matter how you have arrived here, I presume you want to know and learn—like I did not too long ago—how you can use HBase in your company or organization to store a virtually endless amount of data. You may have a background in relational database theory or you want to start fresh and this “column-oriented thing” is something that seems to fit your bill. You also heard that HBase can scale without much effort, and that alone is reason enough to look at it since you are building the next webscale system.

I was at that point in late 2007 when I was facing the task of storing millions of documents in a system that needed to be fault-tolerant and scalable while still being maintainable by just me. I had decent skills in managing a MySQL database system, and was using the database to store data that would ultimately be served to our website users. This database was running on a single server, with another as a backup. The issue was that it would not be able to hold the amount of data I needed to store for this new project. I would have to either invest in serious RDBMS scalability skills, or find something else instead.

Obviously, I took the latter route, and since my mantra always was (and still is) “How does someone like Google do it?” I came across Hadoop. After a few attempts to use Hadoop directly, I was faced with implementing a random access layer on top of it— but that problem had been solved already: in 2006, Google had published a paper titled “Bigtable”* and the Hadoop developers had an open source implementation of it called HBase (the Hadoop Database). That was the answer to all my problems. Or so it seemed...

(HTML tags aren't allowed.)

A Decision Framework for Integrated Wetland-River Basin Management in a Tropical and Data Scarce Environment: UNESCO-IHE PhD Thesis
A Decision Framework for Integrated Wetland-River Basin Management in a Tropical and Data Scarce Environment: UNESCO-IHE PhD Thesis

Traditionally, wetlands were considered separately from river basin systems. However, nowadays it is becoming common practice to follow an integrated approach in wetland-riverine watershed analysis and management. Such approach requires not only adequate representations of all relevant bio-physical parameters, but also of socio-political and...

Inside the Minds: Winning Antitrust Strategies - Leading Lawyers
Inside the Minds: Winning Antitrust Strategies - Leading Lawyers

This book provides a broad, yet comprehensive overview of legal discipline and elements outlining the current shape and future state of antitrust.

Inside the Minds: Winning Antitrust Strategies is an authoritative, insider’s perspective on the basic pillars that govern antitrust and...

Python Machine Learning
Python Machine Learning

Unlock deeper insights into Machine Leaning with this vital guide to cutting-edge predictive analytics

About This Book

  • Leverage Python's most powerful open-source libraries for deep learning, data wrangling, and data visualization
  • Learn effective strategies and best practices to...

CMS Design Using PHP and jQuery
CMS Design Using PHP and jQuery

PHP and jQuery are two of the most famous open source frameworks used for web development. This book will explain how to leverage their power by building a core CMS which can be used for most projects without needing to be written, and how to add custom plugins that can then be tailored to the individual project.

This book walks...

Synthetische Biologie: Die Geburt einer neuen Technikwissenschaft (acatech DISKUTIERT) (German Edition)
Synthetische Biologie: Die Geburt einer neuen Technikwissenschaft (acatech DISKUTIERT) (German Edition)

Eine wirtschaftliche und einfache Herstellung komplexer Moleküle für maßgeschneiderte Werkstoffe, Arzneimittel oder auch Treibstoffe, dazu neue Methoden für Forschungszweige wie Biomedizin, Materialwissenschaft oder Katalyseforschung: Die Liste potenzieller Anwendungen der Synthetischen Biologie, einer Forschungsrichtung...

Using Galaxy Tab: An Android Tablet
Using Galaxy Tab: An Android Tablet

Learn to use your Galaxy Tab with ease, no jargon. Clear, concise, short and to the point... Using Galaxy Tab is the essential guide to getting around Samsung's flagship and best selling Android-based tablet and e-book reader. This book explores setting up your new Tab and registering; navigating around the interface and what all...

©2021 LearnIT (support@pdfchm.net) - Privacy Policy