Step 2: Set up your data

Once you have your virtual machine configured, you need to install the tutorial sample data,which can be downloaded here.

Note: Before you set up a provider, make sure that Hunk has the following permissions:

Read-only access to the HDFS directory where your virtual index data resides.
Read-write access to the HDFS directory where your Splunk instance is installed. (This is usually your splunkMR directory, for example: User/hue/splunk_mr/dispatch).
Read-write access to the Datanode where your /tmp directory resides. This is the temp directory that the vix.splunk.home.datanode value points to in your Provider settings.

1. SSH to your virtual machine, and move Hunkdata.json.gz and your Splunk downloadto the HDFS user’s home directory. (If you are using the Cloudera quickstart VM, the password for root user is “cloudera”.)

For example:

scp Hunkdata.json.gz root@172.16.220.166:~
scp splunk-6.0-<version number>-Linux-x86_64.tgz root@172.16.220.166:~
ssh root@172.16.220.166
mv Hunkdata.json.gz ~hdfs (this is moves the data to the hdfs user)

2. Put the data into HDFS as the hdfs user. For example:

su - hdfs -c "hadoop fs -mkdir hdfs://localhost:8020/data"
su - hdfs -c "hadoop fs -put ~/Hunkdata.json.gz hdfs://localhost:8020/data"
su - hdfs -c "hadoop fs -ls hdfs://localhost:8020/data"

Related answers from Splunk Community

Step 2: Set up your data

Comments

Was this topic useful?