Getting Data into Hadoop

By Ofer Mendelevitch, Douglas Eadline, Casey Stella
Feb 17, 2017

📄 Contents

␡

⎙ Print

< Back Page 3 of 11 Next >

This chapter is from the book 

Practical Data Science with Hadoop and Spark: Designing and Building Effective Analytics at Scale

Learn More Buy

Direct File Transfer to Hadoop HDFS

The easiest way to move data into and out of HDFS is to use the native HDFS commands. These commands are wrappers that interact with the HDFS file system. Local commands, such as cp, ls, or mv will only work on local files. To copy a file (test) from your local file system to HDFS, the following put command can be used:

$ hdfs dfs -put test

To view files in HDFS use the following command. The result is a full listing similar to a locally executed ls -l command:

$ hdfs dfs -ls
-rw-r--r--   2 username hdfs        497 2016-05-11 14:32 test

To copy a file (another-test) from HDFS to your local file system, use the following get command:

$ hdfs dfs -get another-test

Other HDFS commands will be introduced in the examples. Appendix B “HDFS Quick Start,” provides basic command examples including listing, copying, and removing files in HDFS.

< Back Page 3 of 11 Next >

🔖 Save To Your Account

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.

Email Address

Getting Data into Hadoop

This chapter is from the book

This chapter is from the book

This chapter is from the book 

Direct File Transfer to Hadoop HDFS

InformIT Promotional Mailings & Special Offers