forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
4.6 KiB
4.6 KiB
How Do I Get My Data into OBS or HDFS?
MRS can process data in OBS and HDFS. You can get your data into OBS or HDFS as follows:
- Upload local data to OBS.
- Log in to the OBS console.
- Create a parallel file system named userdata on OBS and create the program, input, output, and log folders in the file system.
- Choose Parallel File System > Create Parallel File System, and create a file system named userdata.
- In the OBS file system list, click the file system name userdata, choose Files > Create Folder, and create the program, input, output, and log folders.
- Upload data to the userdata file system.
- Go to the program folder and click Upload File.
- Click add file and select a user program.
- Click Upload.
- Upload the user data file to the input directory using the same method.
- Import OBS data to HDFS.
You can import OBS data to HDFS only when Kerberos Authentication is disabled and the cluster is running.
- Log in to the MRS console.
- Click the name of the cluster.
- On the page displayed, select the HDFS File List. tab page and click
- Select a data directory, for example, bd_app1.
The bd_app1 directory is only an example. You can use any directory on the page or create a new one.
- Click Import Data and click Browse to select an OBS path and an HDFS path.
- Click OK.
You can view the file upload progress on the File Operation Records tab page.
Parent topic: Job Development