If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step through the algorithms and tools you need to build distributed MapReduce applications with Apache Hadoop or Apache Spark. Each chapter provides a recipe for solving a massive computational problem, such as building a recommendation system. You’ll learn how to implement the appropriate MapReduce solution with code that you can use in your projects.
Dr. Mahmoud Parsian covers basic design patterns, optimization techniques, and data mining and machine learning solutions for problems in bioinformatics, genomics, statistics, and social network analysis. This book also includes an overview of MapReduce, Hadoop, and Spark.
Market basket analysis for a large set of transactions
Data mining algorithms (K-means, KNN, and Naive Bayes)
Using huge genomic data to sequence DNA and RNA
Naive Bayes theorem and Markov chains for data and market prediction
Recommendation algorithms and pairwise document similarity
Linear regression, Cox regression, and Pearson correlation
Allelic frequency and mining DNA
Social network analysis (recommendation systems, counting triangles, sentiment analysis)
Making TeX Work (A Nutshell handbook)
TeX is a powerful tool for creating professional quality typeset text and is unsurpassed at typesetting mathematical equations, scientific text, and multiple languages. Many books describe how you use TeX to construct sentences, paragraphs, and chapters. Until now, no book has described all the software that actually lets you build,...
Web Developer's Reference Guide
A one-stop guide to the essentials of web development including popular frameworks such as jQuery, Bootstrap, AngularJS, and Node.js
About This Book
Walk through three of the...
Digitally Assisted Pipeline ADCs Digitally Assisted Pipeline ADCs: Theory and Implementation explores the opportunity to reduce ADC power dissipation by leveraging digital signal processing capabilities in fine line integrated circuit technology. The described digitally assisted pipelined ADC uses a statistics-based system identification technique as an enabling...
OpenGL SuperBible: Comprehensive Tutorial and Reference (5th Edition)
OpenGL® SuperBible, Fifth Edition is the definitive programmer’s guide, tutorial, and reference for the world’s leading 3D API for real-time computer graphics, OpenGL 3.3. The best all-around introduction to OpenGL for developers at all levels of experience, it clearly explains both the API...
Joel on Software
This is a selection of essays from the author's Web site, http://www.joelonsoftware.com. Joel Spolsky started the web log in March 2000 in order to offer his insights, based on years of experience, on how to improve the world of programming. His extraordinary writing skills, technical knowledge, and caustic wit have made him a programming guru....
Nietzsche's Protestant Fathers: A Study in Prodigal Christianity
Nietzsche was famously an atheist, despite coming from a strongly Protestant family. This heritage influenced much of his thought, but was it in fact the very thing that led him to his atheism? This work provides a radical re-assessment of Protestantism by documenting and extrapolating Nietzsche’s view that Christianity dies...