This eBook includes the following formats, accessible from your Account page after purchase:
EPUB The open industry format known for its reflowable content and usability on supported mobile devices.
MOBI The eBook format compatible with the Amazon Kindle and Amazon Kindle applications.
PDF The popular standard, used most often with the free Adobe® Reader® software.
This eBook requires no passwords or activation to read. We customize your eBook by discreetly watermarking it with your name, making it uniquely yours.
Also available in other formats.
Register your product to gain access to bonus material or receive a coupon.
”Eric and Russell were early adopters of Cassandra at SimpleReach. In Practical Cassandra, you benefit from their experience in the trenches administering Cassandra, developing against it, and building one of the first CQL drivers. If you are deploying Cassandra soon, or you inherited a Cassandra cluster to tend, spend some time with the deployment, performance tuning, and maintenance chapters… If you are new to Cassandra, I highly recommend the chapters on data modeling and CQL.”
–From the Foreword by Jonathon Ellis, Apache Cassandra Chair
Build and Deploy Massively Scalable, Super-fast Data Management Applications with Apache Cassandra
Practical Cassandra is the first hands-on developer’s guide to building Cassandra systems and applications that deliver breakthrough speed, scalability, reliability, and performance. Fully up to date, it reflects the latest versions of Cassandra–including Cassandra Query Language (CQL), which dramatically lowers the learning curve for Cassandra developers.
Pioneering Cassandra developers and Datastax MVPs Russell Bradberry and Eric Lubow walk you through every step of building a real production application that can store enormous amounts of structured, semi-structured, and unstructured data. Drawing on their exceptional expertise, Bradberry and Lubow share practical insights into issues ranging from querying to deployment, management, maintenance, monitoring, and troubleshooting.
The authors cover key issues, from architecture to migration, and guide you through crucial decisions about configuration and data modeling. They provide tested sample code, detailed explanations of how Cassandra works ”under the covers,” and new case studies from three cutting-edge users: Ooyala, Hailo, and eBay.
Foreword by Jonathon Ellis xiii
Foreword by Paul Dix xv
About the Authors xxiii
Chapter 1: Introduction to Cassandra 1
A Greek Story 1
What Is NoSQL? 2
There’s No Such Thing as “Web Scale” 2
ACID, CAP, and BASE 2
Where Cassandra Fits In 5
What Is Cassandra? 5
Cassandra Terminology 8
Our Hope 9
Chapter 2: Installation 11
Cluster Setup 15
Chapter 3: Data Modeling 17
The Cassandra Data Model 17
Model Queries—Not Data 19
Chapter 4: CQL 27
A Familiar Way of Doing Things 27
Chapter 5: Deployment and Provisioning 41
Keyspace Creation 41
Replication Strategies 42
Node Layout 48
Chapter 6: Performance Tuning 51
System Tuning 62
Solid-State Drives 64
JVM Tuning 65
Chapter 7: Maintenance 69
Understanding nodetool 69
Ring Information 72
ColumnFamily Statistics 73
Thread Pool Statistics 74
Backup and Restore 79
CommitLog Archiving 81
Chapter 8: Monitoring 83
JMX and MBeans 85
Health Checks 91
Chapter 9: Drivers and Sample Code 99
Chapter 10: Troubleshooting 119
Common Problems 121
Chapter 11: Architecture 127
Meta Keyspaces 127
Gossip Protocol 129
Failure Detection 130
Bloom Filters 131
Chapter 12: Case Studies 135
Appendix A: Getting Help 149
Preparing Information 149
Mailing Lists 149
Appendix B: Enterprise Cassandra 151
Titan by Aurelius 153