Configure your HDFS source
Splunk Analytics for Hadoop reaches End of Life on January 31, 2025.
1. In the Home page, click Explore Data.
2. Select a Provider and Virtual Index. The list of available indexes are based on the provider you select in the Provider drop down.
3. Click Next.
4. In the list of directories and source files, select the item you want to explore. You can drill down to the file level of available directories.
5. If you are done exploring your selected source and do not wish to view or modify the source configuration, you can close the window now, or click the back arrow to explore a different file. To view and modify the source configuration, click Next.
6. In the "Preview data" page you can view and modify your file settings for event processing to make sure that your data is processed correctly.
- View summary information: Click "View Event Summary" on the left-hand side of the page to see:
- The size of the sample data, in bytes.
- The number of events.
- The chart that represents the distribution of the events over time.
- A breakdown of the number of lines each event.
- Adjust time stamps and event breaks: Edit timestamps and event breaks, save the edits as a new source type and apply it to the data. See Assign the right source type to your data in the Getting Data In manual for information about sourcetypes.
Note: CSV and Avro data in Splunk Analytics for Hadoop is parsed as JSON. If you have problems viewing your data, try changing sourcetype to JSON.
- Add advanced settings: Add new attributes to your source data. For more information about adding attributes, see Provider configuration variables for more information about these attributes.
7. Click Next to save any changes.
8. In the "Enter Context Settings" page, assign App Context and App Sharing to your source. This determines the apps and users that will use the configuration settings for that file.
9. Click Next.
10. Review your configuration information in the Review Settings page. You can also optionally change the source file. For example, you can add wildcards to widen the source searched or restrict it further. Optionally use the Source File Clipboard window to copy your source configuration to other files in your directories.
10. Click Finish. You can now search your configured source or explore another source file.
Explore and configure Hadoop source files in the HDFS Explorer | How distributable and non-distributable commands work in Splunk Analytics for Hadoop (and what works best) |
This documentation applies to the following versions of Splunk® Enterprise: 7.0.0, 7.0.1, 7.0.2, 7.0.3, 7.0.4, 7.0.5, 7.0.6, 7.0.7, 7.0.8, 7.0.9, 7.0.10, 7.0.11, 7.0.13, 7.1.0, 7.1.1, 7.1.2, 7.1.3, 7.1.4, 7.1.5, 7.1.6, 7.1.7, 7.1.8, 7.1.9, 7.1.10, 7.2.0, 7.2.1, 7.2.2, 7.2.3, 7.2.4, 7.2.5, 7.2.6, 7.2.7, 7.2.8, 7.2.9, 7.2.10, 7.3.0, 7.3.1, 7.3.2, 7.3.3, 7.3.4, 7.3.5, 7.3.6, 7.3.7, 7.3.8, 7.3.9, 8.0.0, 8.0.1, 8.0.2, 8.0.3, 8.0.4, 8.0.5, 8.0.6, 8.0.7, 8.0.8, 8.0.9, 8.0.10, 8.1.0, 8.1.1, 8.1.2, 8.1.3, 8.1.4, 8.1.5, 8.1.6, 8.1.7, 8.1.8, 8.1.9, 8.1.10, 8.1.11, 8.1.12, 8.1.13, 8.1.14, 8.2.0, 8.2.1, 8.2.2, 8.2.3, 8.2.4, 8.2.5, 8.2.6, 8.2.7, 8.2.8, 8.2.9, 8.2.10, 8.2.11, 8.2.12, 9.0.0, 9.0.1, 9.0.2, 9.0.3, 9.0.4, 9.0.5, 9.0.6, 9.0.7, 9.0.8, 9.0.9, 9.0.10, 9.1.0, 9.1.1, 9.1.2, 9.1.3, 9.1.4, 9.1.5, 9.1.6, 9.2.0, 9.2.1, 9.2.2, 9.2.3, 9.3.0, 9.3.1
Feedback submitted, thanks!