Splunk® Data Stream Processor

Connect to Data Sources and Destinations with DSP

DSP 1.2.0 is impacted by the CVE-2021-44228 and CVE-2021-45046 security vulnerabilities from Apache Log4j. To fix these vulnerabilities, you must upgrade to DSP 1.2.4. See Upgrade the Splunk Data Stream Processor to 1.2.4 for upgrade instructions.

On October 30, 2022, all 1.2.x versions of the Splunk Data Stream Processor will reach its end of support date. See the Splunk Software Support Policy for details.
This documentation does not apply to the most recent version of Splunk® Data Stream Processor. For documentation on the most recent version, go to the latest release.

Performance expectations for sending data from DSP pipelines to Splunk Enterprise

This page provides reference information about the performance testing of the performed by Splunk, Inc when sending data to a Splunk index with the Send to a Splunk Index with Batching or the Send to a Splunk Index sink functions. Use this information to optimize your Splunk Enterprise pipeline performance.

Many factors affect performance results, including file compression, event size, number of concurrent pipelines, deployment architecture, and hardware. These results represent reference information and do not represent performance in all environments.

To go beyond these general recommendations, contact Splunk Services to work on optimizing performance in your specific environment.

Improve performance

To maximize your performance, consider taking the following actions:

  • Enable batching. If you are using the Send to a Splunk Index with Batching function, then batching is already done for you. Otherwise, use either the Batch Bytes or Batch Records functions in your pipeline.
  • Do not use an SSL-enabled Splunk Enterprise server.
  • Disable HEC acknowledgments in the Send to a Splunk Index with Batching or the Send to a Splunk Index function.
  • Enable async = true in Send to a Splunk Index with Batching or the Send to a Splunk Index function.
  • Run DSP on a 5 GigE full duplex network.
  • Parallelize DSP with your data source. Parallelization of DSP jobs is determined by the number of partitions or shards in the upstream source.
    • When using Kafka as a data source, use multiple partitions (example: 16) in the Kafka topic that your DSP pipeline reads from.
    • When using Kinesis as a data source, use multiple shards (example: 16) in the Kinesis stream that your DSP pipeline reads from.

Your Send to a Splunk Index with Batching or Send to a Splunk Index sink functions should have the following additional parameters for performance optimization.

This screen image shows a screenshot of the Send to a Splunk Index sink function with the appropriate parameters filled out.

Last modified on 25 March, 2022
Format metrics data in DSP for Splunk indexes   Connecting SignalFx to your DSP pipeline

This documentation applies to the following versions of Splunk® Data Stream Processor: 1.2.0, 1.2.1-patch02, 1.2.1, 1.2.2-patch02, 1.2.4, 1.2.5


Was this topic useful?







You must be logged into splunk.com in order to post comments. Log in now.

Please try to keep this discussion focused on the content covered in this documentation topic. If you have a more general question about Splunk functionality or are experiencing a difficulty with Splunk, consider posting a question to Splunkbase Answers.

0 out of 1000 Characters