Home > Store

Data Analytics with Spark Using Python

Data Analytics with Spark Using Python

eBook (Watermarked)

  • Your Price: $28.79
  • List Price: $35.99
  • Estimated Release: Jun 1, 2018
  • Includes EPUB, MOBI, and PDF
  • About eBook Formats
  • This eBook includes the following formats, accessible from your Account page after purchase:

    ePub EPUB The open industry format known for its reflowable content and usability on supported mobile devices.

    MOBI MOBI The eBook format compatible with the Amazon Kindle and Amazon Kindle applications.

    Adobe Reader PDF The popular standard, used most often with the free Adobe® Reader® software.

    This eBook requires no passwords or activation to read. We customize your eBook by discreetly watermarking it with your name, making it uniquely yours.

Also available in other formats.

Register your product to gain access to bonus material or receive a coupon.


  • Copyright 2018
  • Dimensions: 7" x 9-1/8"
  • Pages: 400
  • Edition: 1st
  • eBook (Watermarked)
  • ISBN-10: 0-13-484609-5
  • ISBN-13: 978-0-13-484609-5

Spark for Data Professionals introduces and solidifies the concepts behind Spark 2.x, teaching working developers, architects, and data professionals exactly how to build practical Spark solutions. Jeffrey Aven covers all aspects of Spark development, including basic programming to SparkSQL, SparkR, Spark Streaming, Messaging, NoSQL and Hadoop integration. Each chapter presents practical exercises deploying Spark to your local or cloud environment, plus programming exercises for building real applications. Unlike other Spark guides, Spark for Data Professionals explains crucial concepts step-by-step, assuming no extensive background as an open source developer. It provides a complete foundation for quickly progressing to more advanced data science and machine learning topics. This guide will help you:

  • Understand Spark basics that will make you a better programmer and cluster “citizen”
  • Master Spark programming techniques that maximize your productivity
  • Choose the right approach for each problem
  • Make the most of built-in platform constructs, including broadcast variables, accumulators, effective partitioning, caching, and checkpointing
  • Leverage powerful tools for managing streaming, structured, semi-structured, and unstructured data

Sample Content

Table of Contents

1. Introducing Big Data and Apache Spark
2. Learning Spark Programming Basics
3. Advanced Programming using the Spark Core API
4. SQL and NoSQL Programming with Spark
5. Stream Processing and Messaging using Spark
6. Beginning Data Science and Machine Learning using Spark
7. Administering Spark


Submit Errata

More Information

Unlimited one-month access with your purchase
Free Safari Membership