Spark for Python Developers
Set up real-time streaming and batch data intensive infrastructure using Spark and Python
Deliver insightful visualizations in a web app using Spark (PySpark)
Inject live data using Spark Streaming with real-time events
In this fast-paced book on the Docker open standards platform for developing, packaging and running portable distributed applications, Deepak Vorha
discusses how to build, ship and run applications on any platform such as a PC, the cloud, data center or a virtual machine. He describes how to install and...
An Inconstant Landscape: The Maya Kingdom of El Zotz, Guatemala
Presenting the results of six years of archaeological survey and excavation in and around the Maya kingdom of El Zotz, An Inconstant Landscape
paints a complex picture of a dynamic landscape over the course of almost 2,000 years of occupation. El Zotz was a dynastic seat of the Classic period in Guatemala. Located between...
Complete Guide to Open Source Big Data Stack
See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.
In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private...
Programming Hive introduces Hive, an essential tool in the Hadoop ecosystem that
provides an SQL (Structured Query Language) dialect for querying data stored in the
Hadoop Distributed Filesystem (HDFS), other filesystems that integrate with Hadoop,
such as MapR-FS and Amazon’s S3 and databases like HBase (the...
Mastering Apache Spark
Gain expertise in processing and storing data by using advanced techniques with Apache Spark
About This Book
Explore the integration of Apache Spark with third party applications such as H20, Databricks and Titan
Evaluate how Cassandra and Hbase can be used for storage
OpenStack Trove is your step-by-step guide to set up and run a secure and scalable cloud Database as a Service (DBaaS) solution. The book shows you how to set up and configure the Trove DBaaS framework, use prepackaged or custom database implementations, and provision and operate a variety of databasesâincluding MySQL,...
Cassandra High Performance Cookbook
Apache Cassandra is a fault-tolerant, distributed data store which offers linear scalability allowing it to be a storage platform for large high volume websites.
This book provides detailed recipes that describe how to use the features of Cassandra and improve its performance. Recipes cover topics ranging from setting up Cassandra...
NoSQL For Dummies
Get up to speed on the nuances of NoSQL databases and what they mean for your organization
This easy to read guide to NoSQL databases provides the type of no-nonsense overview and analysis that you need to learn, including what NoSQL is and which database is right for you. Featuring specific evaluation criteria for NoSQL databases,...
|Result Page: 3 2 1 |