Splunk® Enterprise

Splunk Analytics for Hadoop

Splunk Enterprise version 9.0 will no longer be supported as of June 14, 2024. See the Splunk Software Support Policy for details. For information about upgrading to a supported version, see How to upgrade Splunk Enterprise.

About data model acceleration

Splunk Analytics for Hadoop reaches End of Life on January 31, 2025.

For more information about how Data Model Acceleration works in Splunk Enterprise, see Accelerate data manuals in the Splunk Enterprise documentation.

Data model acceleration lets you create data models for data that includes virtual indexes. Splunk Analytics for Hadoop Data Model Acceleration uses cached information that can map extremely large datasets to accelerate searches.

Data model information that is stored in Splunk Enterprise indexes uses tsidx files. Splunk Analytics for Hadoop data models access data for virtual indexes that points to data in Hadoop, so you can create data models on any file types that a virtual index can point to. Splunk Analytics for Hadoop stores the data model acceleration files in Parquet and ORC.

For more information about how Data Model Acceleration works in Splunk Enterprise, see Accelerate data manuals in the Splunk Enterprise documentation.

How data model acceleration works in Splunk Analytics for Hadoop

1. You create a new data model Configure data model acceleration

2. Splunk Analytics for Hadoop creates a data model acceleration summary file for each raw data file:

  • Splunk Analytics for Hadoop maintains information about the data model acceleration summary files in the KV Store (this allows fast lookups).
  • Splunk Analytics for Hadoop stores the actual data model acceleration summary files in Hadoop.

3. In the case of a data model that covers a large time range, such as "one year" or "all time". You can partition data model files into buckets the same way you would in Splunk Analytics for Hadoop Report Acceleration. This makes both updating lookup files at model creation time and loading lookup files at search time faster. For more information see Accelerate Reports.

4. You can run a search on data in Splunk Analytics for Hadoop virtual indexes that can also include Splunk Enterprise indexes. If a data model exists for any Splunk Enterprise data, data model acceleration will be applied as described In Accelerate data models in the Splunk Knowledge Manager Manual.

5. For tstats/pivot searches on data models that are based off of Virtual Indexes, Splunk Analytics for Hadoop uses the KV Store to verify if an acceleration summary file exists for a raw data split.

6. If an acceleration summary file is not found for the raw data split, Splunk Analytics for Hadoop falls back to the original data file and does a regular search of the virtual index.

7. If an acceleration summary file is found, then the summary file is used instead of the raw data file.

8. The existing saved summary data is used to return data faster for a search that is comprised of all or part of that data model.

Last modified on 15 April, 2024
Manage report acceleration   Configure data model acceleration

This documentation applies to the following versions of Splunk® Enterprise: 7.0.0, 7.0.1, 7.0.2, 7.0.3, 7.0.4, 7.0.5, 7.0.6, 7.0.7, 7.0.8, 7.0.9, 7.0.10, 7.0.11, 7.0.13, 7.1.0, 7.1.1, 7.1.2, 7.1.3, 7.1.4, 7.1.5, 7.1.6, 7.1.7, 7.1.8, 7.1.9, 7.1.10, 7.2.0, 7.2.1, 7.2.2, 7.2.3, 7.2.4, 7.2.5, 7.2.6, 7.2.7, 7.2.8, 7.2.9, 7.2.10, 7.3.0, 7.3.1, 7.3.2, 7.3.3, 7.3.4, 7.3.5, 7.3.6, 7.3.7, 7.3.8, 7.3.9, 8.0.0, 8.0.1, 8.0.2, 8.0.3, 8.0.4, 8.0.5, 8.0.6, 8.0.7, 8.0.8, 8.0.9, 8.0.10, 8.1.0, 8.1.1, 8.1.2, 8.1.3, 8.1.4, 8.1.5, 8.1.6, 8.1.7, 8.1.8, 8.1.9, 8.1.10, 8.1.11, 8.1.12, 8.1.13, 8.1.14, 8.2.0, 8.2.1, 8.2.2, 8.2.3, 8.2.4, 8.2.5, 8.2.6, 8.2.7, 8.2.8, 8.2.9, 8.2.10, 8.2.11, 8.2.12, 9.0.0, 9.0.1, 9.0.2, 9.0.3, 9.0.4, 9.0.5, 9.0.6, 9.0.7, 9.0.8, 9.0.9, 9.0.10, 9.1.0, 9.1.1, 9.1.2, 9.1.3, 9.1.4, 9.1.5, 9.1.6, 9.2.0, 9.2.1, 9.2.2, 9.2.3, 9.3.0, 9.3.1


Was this topic useful?







You must be logged into splunk.com in order to post comments. Log in now.

Please try to keep this discussion focused on the content covered in this documentation topic. If you have a more general question about Splunk functionality or are experiencing a difficulty with Splunk, consider posting a question to Splunkbase Answers.

0 out of 1000 Characters