Search a virtual index

Splunk Analytics for Hadoop reaches End of Life on January 31, 2025.

After you properly install and configure your virtual indexes, you can create reports and visualize data as you would against data in a traditional Splunk index. Using virtual indexes alongside traditional Splunk Enterprise indexes, you can gather data from the virtual index alone; or you can query both local and virtual indexes for a single report.

For the most part, you can create reports for virtual indexes much as you would for local indexes. For more information about creating reports, see the Splunk Enterprise Search Manual.

Due to the size and the nature of Hadoop datastores, there are certain Splunk Enterprise index behaviors that cannot be duplicated:

Splunk Analytics for Hadoop currently doesn't support real-time searching of Hadoop data, although preview functionality is available.
Data is not always returned as quickly as data is returned for a local index.

Since events are not sorted, any search command which depends on implicit time order will not work exactly the way you'd expect. (For example: head, delta, or transaction.) This means that a few search commands operate differently when used on virtual indexes, mostly because of the way Hadoop reports timestamps.

You can still use these commands, and may particularly want to when creating a single report for local and virtual indexes, but you should be aware of how they operate and return data differently.

How Splunk Analytics for Hadoop uses search language

For the most part, you can use Splunk Enterprise search language to create your reports. However, because Splunk Analytics for Hadoop does not support strict requirements on the order of events, there are a few differences.

The following commands are not supported when the search includes a virtual indexes:

transactions
localize

The following commands work on virtual indexes, but their results may differ from Splunk. This is because in Splunk Analytics for Hadoop, descending time order of events is not guaranteed:

streamstats
head
delta
tail
reverse
eventstats
dedup (Since the command cannot distinguish order within an HDFS directory to pick the item to remove, Splunk Analytics for Hadoop will choose the item to remove based on modified time, or file order.)

Related answers from Splunk Community

Search a virtual index

How Splunk Analytics for Hadoop uses search language

Comments

Search a virtual index

Was this topic useful?