Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Big Data Made Easy: A Working Guide to the Complete Hadoop Toolset

Buy

Many corporations are finding that the size of their data sets are outgrowing the capability of their systems to store and process them. The data is becoming too big to manage and use with traditional tools. The solution: implementing a big data system.

As Big Data Made Easy: A Working Guide to the Complete Hadoop Toolset shows, Apache Hadoop offers a scalable, fault-tolerant system for storing and processing data in parallel. It has a very rich toolset that allows for storage (Hadoop), configuration (YARN and ZooKeeper), collection (Nutch and Solr), processing (Storm, Pig, and Map Reduce), scheduling (Oozie), moving (Sqoop and Avro), monitoring (Chukwa, Ambari, and Hue), testing (Big Top), and analysis (Hive).

The problem is that the Internet offers IT pros wading into big data many versions of the truth and some outright falsehoods born of ignorance. What is needed is a book just like this one: a wide-ranging but easily understood set of instructions to explain where to get Hadoop tools, what they can do, how to install them, how to configure them, how to integrate them, and how to use them successfully. And you need an expert who has worked in this area for a decade—someone just like author and big data expert Mike Frampton.

Big Data Made Easy approaches the problem of managing massive data sets from a systems perspective, and it explains the roles for each project (like architect and tester, for example) and shows how the Hadoop toolset can be used at each system stage. It explains, in an easily understood manner and through numerous examples, how to use each tool. The book also explains the sliding scale of tools available depending upon data size and when and how to use them. Big Data Made Easy shows developers and architects, as well as testers and project managers, how to:

  • Store big data
  • Configure big data
  • Process big data
  • Schedule processes
  • Move data among SQL and NoSQL systems
  • Monitor data
  • Perform big data analytics
  • Report on big data processes and projects
  • Test big data systems

Big Data Made Easy also explains the best part, which is that this toolset is free. Anyone can download it and—with the help of this book—start to use it within a day. With the skills this book will teach you under your belt, you will add value to your company or client immediately, not to mention your career.

What you’ll learn

  • How to install and employ Hadoop
  • How to install and use Hadoop-related tools like Hive, Storm, Pig, Solr, Oozie, Ambari, and many others
  • How to set up and test a big data system
  • How to scale the system for the amount of data at hand and the data you expect to accumulate
  • How those who have spent their careers in the SQL database world can apply their skills to building big data systems

Who this book is for

This book is for developers, architects, IT project managers, database administrators, and others charged with developing or supporting a big data system. It is also for a general IT audience, anyone interested in Hadoop or big data, and those experiencing problems with data size. It’s also for anyone who would like to further their career in this area by adding big data skills.

(HTML tags aren't allowed.)

Pour Your Heart Into It: How Starbucks Built a Company One Cup at a Time
Pour Your Heart Into It: How Starbucks Built a Company One Cup at a Time

The success of Starbucks Coffee Company is one of the most amazing business stories in decades. What started as a single store on Seattle’s waterfront has grown into a company with over sixteen hundred stores worldwide and a new one opening every single business day. Just as remarkable as this incredible growth is the fact that...

Alzheimer: 100 Years and Beyond (Research and Perspectives in Alzheimer's Disease)
Alzheimer: 100 Years and Beyond (Research and Perspectives in Alzheimer's Disease)

Few medical or scientific addresses have so unmistakeably made history as the presentation delivered by Alois Alzheimer on November 4, 1906 in Tübingen.

The celebratory event "Alzheimer 100 Years and Beyond" was organized through the Alzheimer community in Germany and worldwide, in collaboration with the Fondation...

Outsourcing: The Definitive View, Applications, and Implications
Outsourcing: The Definitive View, Applications, and Implications
Outsourcing is the hottest topic in business, and it will likely stay that way. Managers, workers, executives, and almost anyone else involved in any large business will probably have to deal with it one day, in one way or another. Outsourcing is a business issue first and foremost, but it's also a political, personal, and cultural issue...

Encyclopedia of Twentieth-Century Photography (3 Volumes)
Encyclopedia of Twentieth-Century Photography (3 Volumes)
Photography is an international cultural form, practice, and profession. Encompassing art, advertising, journalism, fashion, commercial, political, and everyday photography, the field of photography also includes chemical processes, mechanical inventions, equipment, industries, movements, techniques, terms and concepts. In addition, photography has...
MySpace For Dummies (Computers)
MySpace For Dummies (Computers)
Activate the built-in tools to use MySpace safely

Trick out your profile, meet new people, and promote your talents on MySpace

MySpace is the place to connect with friends online and this book helps you make the most of it. Here's how to customize your profile so it stands out in a crowd,...

Nanoscale Materials in Chemistry
Nanoscale Materials in Chemistry
Nanotechnology is almost a household word now-a-days, or at least some word with ‘‘nano’’ in it, such as nanoscale, nanoparticle, nanophase, nanocrystal, or nanomachine. This field now enjoys worldwide attention and a National Nanotechnology Initiative (NNI) is about to be launched.

This
...
©2020 LearnIT (support@pdfchm.net) - Privacy Policy