This eBook includes the following formats, accessible from your Account page after purchase:
EPUB The open industry format known for its reflowable content and usability on supported mobile devices.
MOBI The eBook format compatible with the Amazon Kindle and Amazon Kindle applications.
PDF The popular standard, used most often with the free Adobe® Reader® software.
This eBook requires no passwords or activation to read. We customize your eBook by discreetly watermarking it with your name, making it uniquely yours.
Also available in other formats.
Register your product to gain access to bonus material or receive a coupon.
As adoption of Hadoop accelerates in the enterprise and beyond, there's soaring demand for those who can solve real world problems by applying advanced data science techniques in Hadoop environments. Now there's a complete and up-to-date guide to data science with Hadoop: high-level concepts, deep-dive techniques, practical applications, hands-on tutorials, and real-world use cases. Drawing on their immense experience with Hadoop in enterprise Big Data environments, this book's authors bring together all the practical knowledge you'll need to do real, useful data science with Hadoop. Coverage includes:
Part 1: Data Science with Hadoop - An Overview
1. Introduction to Data Science
2. Data Science Use-Cases
3. Hadoop and Data Science
Part 2: The Process of Data Science with Hadoop
4. The Process of Data Science
5. Getting the Data into Hadoop
6. Data Preparation
7. Data Modeling
Part 3: Real World Examples
9. Building a Recommender System With Mahout
10. Customer Segmentation with Kmeans
11. Analyzing Sentiment
12. Predictive Risk Modeling
Part 4: The Road Ahead
13. Advanced Topics
14. The Data Science Journey