- Introduction to HBase, the NoSQL Database for Hadoop: Programming HBase with Java
- Nov 4, 2014
- The first article in this series (“Introduction to HBase”) presented HBase, also known as the Hadoop database and described how to set up a local environment and manipulate data using the HBase shell. This article continues by demonstrating how to interact with HBase using Java. You learn how to put data into HBase, get data out of HBase, delete data from HBase, and how to perform a table scan to extract a range of records. Finally, you see how to set up an HBase project using Maven.
|
- Introduction to HBase, the NoSQL Database for Hadoop
- Oct 27, 2014
- HBase is called the Hadoop database because it is a NoSQL database that runs on top of Hadoop. It combines the scalability of Hadoop by running on the Hadoop Distributed File System (HDFS), with real-time data access as a key/value store and deep analytic capabilities of Map Reduce. This article introduces HBase and describes how it organizes and manages data and then demonstrates how to set up a local HBase environment and interact with data using the HBase shell.
|
- Introduction to Oracle Databases on Virtual Infrastructure
- Oct 22, 2014
- 99.9% of all database or data management systems should be considered candidates for virtualization on vSphere. In this chapter from Virtualizing Oracle Databases on vSphere, the authors argue that Oracle databases and software are prime candidates to consider migrating to virtualized infrastructure.
|
- Introducing NoSQL and MongoDB
- Sep 18, 2014
- In this chapter from NoSQL with MongoDB in 24 Hours, Sams Teach Yourself, learn about the design considerations to review before deciding how to implement the structure of data and configuration of a MongoDB database. You'll also learn which design questions to ask and then how to explore the mechanisms built into MongoDB to answer those questions.
|
- Hit the Ground Running with MongoDB and Python
- Sep 16, 2014
- Stephen B. Morris describes how to get started with MongoDB and Python. As usual with Python, you can get productive quickly, without worrying about complex IDEs. MongoDB has a simple data model and easy-to-understand semantics, giving you a handy on-ramp to this interesting technology.
|
- Introduction to Understanding Big Data Scalability
- Aug 18, 2014
- This introduction describes the goals of the Big Data Scalability four-volume series, focusing on the underlying growth of databases (the "data explosion") and providing some background into big data's relevance.
|
- SQL Queries for Mere Mortals: Thinking in Sets
-
By
Michael J. Hernandez
- Jul 1, 2014
- This chapter introduces the concept of an SQL set. It discusses each of the major set operations implemented in SQL in detail (intersection, difference, and union), and shows how to use set diagrams to visualize the problem you’re trying to solve. Finally, it introduces the basic SQL syntax and keywords (INTERSECT, EXCEPT, and UNION) for all three operations.
|
- Why Big Data and Analytics?
- Jun 12, 2014
- Get the inside story of how analytics is being used across the IBM enterprise in this introduction to Analytics Across the Enterprise: How IBM Realizes Business Value from Big Data and Analytics.
|
- Ten Tips to Realize Value from Big Data and Analytics
- Jun 10, 2014
- What does it really take to derive value from Big Data and Analytics? Co-authors of Analytics Across the Enterprise: How IBM Realizes Business Value from Big Data and Analytics, Brenda Dietrich, Emily Plachy and Maureen Norton, identify 10 top tips based on their years of experience at IBM “eating their own cooking.” Interviews with more than 70 executives, managers and analytic practitioners across IBM yielded 31 case studies across 9 different business functions which show the breadth challenges, outcomes, analytics techniques, and lessons learned to make your analytics journey to realize business value successful.
|
- Preface to Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2
- May 20, 2014
- While the power of YARN is easily comprehensible, the ability to exploit that power requires the user to understand the intricacies of building such a system in conjunction with YARN. This book aims to reconcile that dichotomy, as the authors explain in the preface to Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2.
|
- Apache Hadoop YARN: A Brief History and Rationale
- Mar 24, 2014
- This chapter provides a historical account of why and how Apache Hadoop YARN came about.
|
- Data Just Right Video Tutorials: How to Use Hadoop, Hive, Shark, R, Apache Pig, Mahout, and Google BigQuery
- Feb 26, 2014
- Michael Manoochehri provides viewers with an introduction to implementing practical solutions for common data problems. This excerpt from Data Just Right LiveLessonscontains three sample videos: 1. Loading Data into Hive, 2. Writing a Multistep MapReduce Job Using the mrjob Python Library, and 3. Using the Pandas Library for Analyzing Time Series Data.
|
- Data Reshaping in R
- Feb 3, 2014
- Jared P. Lander considers when the data needs to be rearranged from column oriented to row oriented (or the opposite) and when the data are in multiple, separate sets and need to be combined into one.
|
- The Basics of Monitoring Cassandra
- Jan 28, 2014
- This chapter covers the basics of monitoring Cassandra. These include file-based logging, inspection of the JVM, and monitoring of Cassandra itself.
|
- Four Rules for Data Success
- Jan 1, 2014
- Database technology is a fast-moving field filled with innovations. Michael Manoochehri describes the current state of the field and discusses the four rules for data success.
|
- What's New in SQL Server 2012
- Dec 13, 2013
- This chapter introduces the major new features provided in SQL Server 2012 and covers a number of the enhancements to previously available features.
|
- Working with Big Data: How to Set up a Basic Hadoop Installation
- Nov 25, 2013
- In this video, Paul Dix provides an overview of Hadoop, a framework of tools, libraries, and methodologies for big data analysis, and shows how to set up a basic Hadoop installation.
|
- Using Database Tools and Utilities in DB2
- Nov 19, 2013
- How do you work with DB2? How do you issue SQL and/or XQuery statements and enter DB2 commands? Are there graphical tools that can make your administration tasks easier? This chapter provides the answers to all of these questions. DB2 provides a wide range of tools, both graphical and command-driven, to help you work with DB2.
|
- Oracle Solaris 11 System Administration: Administering Storage Devices
- Aug 13, 2013
- System administrators need to know how to specify device names when using commands to manage disks, file systems, and other devices. This chapter describes disk device management in detail. It also describes disk device naming conventions as well as adding, configuring, and displaying information about disk devices attached to your system.
|
- Using SQL to Manage Data
- Apr 29, 2013
- SQL is the means by which you tell the server how to perform data management operations, and fluency with it is necessary for effective communication. This chapter covers how to use SQL to manage data, including
changing the SQL mode to affect server behavior, referring to elements of databases, using multiple character sets, creating and destroying databases, tables, and indexes, obtaining information about databases and their contents, retrieving data using joins, subqueries, and unions, using multiple-table deletes and updates, performing transactions that enable statements to be grouped or canceled, setting up foreign key relationships, and using the FULLTEXT search engine.
|