This chapter has set the tone for the book by discussing the limitations of Hadoop along the lines of the seven giants. It has also brought out the three dimensions along which thinking beyond Hadoop is necessary:
- Real-time analytics: Storm and Spark streaming are the choices.
- Analytics involving iterative ML: Spark is the technology of choice.
- Specialized data structures and processing requirements for these: GraphLab is an important paradigm to process large graphs.
These are elaborated in the subsequent chapters of this book. Happy reading!