forked from docs/doc-exports
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
7.1 KiB
7.1 KiB
Interconnecting Flume with OBS
This section applies to MRS 3.x or later.
Before performing the following operations, ensure that you have configured a storage-compute decoupled cluster by referring to Configuring a Storage-Compute Decoupled Cluster (Agency) or Configuring a Storage-Compute Decoupled Cluster (AK/SK).
- Configure an agency.
- Log in to the MRS console. In the navigation pane on the left, choose Clusters > Active Clusters.
- Click the name of a cluster to go to the cluster details page.
- On the Dashboard page, click Synchronize on the right of IAM User Sync to synchronize IAM users.
- Click Manage Agency on the right of Agency, select the target agency, and click OK.
- Create an OBS file system for storing data.
- Log in to the OBS console.
- In the navigation pane on the left, choose Parallel File Systems. On the displayed page, click Create Parallel File System.
- Enter the file system name, for example, esdk-c-test-pfs1, and set other parameters as required. Click Create Now.
- In the parallel file system list on the OBS console, click the created file system name to go to its details page.
- In the navigation pane on the left, choose Files and click Create Folder to create the testFlumeOutput folder.
- Prepare the properties.properties file and upload it to the /opt/flumeInput directory.
- Prepare the properties.properties file on the local host. Its content is as follows:
# source server.sources = r1 # channels server.channels = c1 # sink server.sinks = obs_sink # ----- define net source ----- server.sources.r1.type = seq server.sources.r1.spooldir = /opt/flumeInput # ---- define OBS sink ---- server.sinks.obs_sink.type = hdfs server.sinks.obs_sink.hdfs.path = obs://esdk-c-test-pfs1/testFlumeOutput server.sinks.obs_sink.hdfs.filePrefix = %[localhost] server.sinks.obs_sink.hdfs.useLocalTimeStamp = true # set file size to trigger roll server.sinks.obs_sink.hdfs.rollSize = 0 server.sinks.obs_sink.hdfs.rollCount = 0 server.sinks.obs_sink.hdfs.rollInterval = 5 #server.sinks.obs_sink.hdfs.threadsPoolSize = 30 server.sinks.obs_sink.hdfs.fileType = DataStream server.sinks.obs_sink.hdfs.writeFormat = Text server.sinks.obs_sink.hdfs.fileCloseByEndEvent = false # define channel server.channels.c1.type = memory server.channels.c1.capacity = 1000 # transaction size server.channels.c1.transactionCapacity = 1000 server.channels.c1.byteCapacity = 800000 server.channels.c1.byteCapacityBufferPercentage = 20 server.channels.c1.keep-alive = 60 server.sources.r1.channels = c1 server.sinks.obs_sink.channel = c1
- Log in to the node where the Flume client is installed as user root.
- Create the /opt/flumeInput directory and create a customized .txt file in this directory.
- Log in to MRS Manager.
- Choose Cluster > Name of the target cluster > Services > Flume. On the displayed page, click Configurations and then Upload File in the Value column corresponding to the flume.config.file parameter, upload the properties.properties file prepared in 3.a, and click Save.
- Prepare the properties.properties file on the local host. Its content is as follows:
- View the result in the OBS system.
- Log in to the OBS console.
- Click Parallel File Systems and go to the folder created in 2 to view the result.
Parent topic: Interconnecting MRS with OBS Using an Agency