About lookup cache quotas

You can cache the contents of a lookup to improve lookup performance, however there are some limitations that you should be aware of. These limitations only apply when you are using the Lookup function, not the Write Thru KV Store function.

The lookup cache is subject to a quota, or a maximum amount of data that can be contained, per pipeline. The following table describes the cache quota that applies for each type of lookup.

Lookup type	Default cache quota per pipeline
CSV	50 MiB
KV Store	200 MiB

Although there are different cache quotas for each lookup type, the percentage of cache quotas that can be used is shared between CSV and KV Store lookups. For example, if 30% of the CSV cache quota is used, then only 70% of the KV Store cache quota remains available. As another example, if you have the following:

A pipeline with four lookup functions: two lookups to CSV files and two lookups to KV Stores.
CSV files that are sizes 10MiB and 20MiB in size.

In this case, you have used 30MiB of your 50MiB total, or 60% of your total quota. That means that you have 40% or 80MiB of the cache quota remaining for KV Store lookups (0.4*200MiB = 80MiB). In order to stay under the cache size limitations, the cache_size parameter for your KV Store connection should be 40MiB. Since you have two KV Store lookups in your pipeline, this adds up to 80MiB or 83886080 bytes.

Configure the maximum lookup cache quota

To ensure that your pipelines using a lookup are not cancelled, best practices are to ensure that all lookup results fit into the cache. There are two settings that control the lookup cache quota.

The lookup_quota_max_static_mb setting specifies the maximum cache quota per pipeline for CSV lookups.
The lookup_quota_max_cached_bytes setting specifies the maximum cache quota per pipeline for KV Store lookups.

In addition to the two settings above, you may need to update the Kubernetes memory settings to make sure that you have enough memory to support the desired cache quotas.

The tm_mem_limit setting specifies the minimum amount of memory assigned to the pod.
The tm_mem_request setting specifies the maximum amount of memory assigned to the pod.

Configure the cache quota for CSV lookups

Do the following steps to increase the cache quota for CSV lookups. This allows you to upload CSV files larger than 50MiB using the Streams API.

Navigate to the working directory of a DSP controller node.
Configure the cache quota for CSV lookups by running the following command in the command-line. The value must be in mebibytes (MiB).
```
./dsp config set streams lookup_quota_max_static_mb=<value>
```
Since you are increasing the cache quota, you should also increase the minimum amount of memory allocated to a pod accordingly. For a list of accepted memory sizes, see the "Managing Resources for Containers" section in the Kubernetes documentation.
```
./dsp config set flink tm_mem_request=<value>
```
Since you are increasing the cache quota, you should also increase the maximum amount of memory allocated to a pod accordingly. For a list of accepted memory sizes, see the "Managing Resources for Containers" section in the Kubernetes documentation.
```
./dsp config set flink tm_mem_limit=<value>
```
Deploy your changes.
```
./dsp deploy streams flink
```

Even if you increase the CSV cache quota, there is still a maximum file size of 50MB when uploading a CSV file using the UI. If you want to upload a larger CSV file, you'll need to upload the CSV file using the Streams API. See Upload a CSV file to the to enrich data with a lookup.

Configure the cache quota for KV Store lookups

Navigate to the working directory of a DSP controller node.
Configure the cache quota for KV Store lookups by running the following command in the command-line. The value must in bytes.
```
./dsp config set streams lookup_quota_max_cached_bytes=<value>
```
Since you are increasing the cache quota, you should also increase the minimum amount of memory allocated to a pod accordingly. For a list of accepted memory sizes, see the "Managing Resources for Containers" section in the Kubernetes documentation.
```
./dsp config set flink tm_mem_request=<value>
```
Since you are increasing the cache quota, you should also increase the maximum amount of memory allocated to a pod accordingly. For a list of accepted memory sizes, see the "Managing Resources for Containers" section in the Kubernetes documentation.
```
./dsp config set flink tm_mem_limit=<value>
```
Deploy your changes.
```
./dsp deploy streams flink
```

Related answers from Splunk Community

About lookup cache quotas

Configure the maximum lookup cache quota

Configure the cache quota for CSV lookups

Configure the cache quota for KV Store lookups

Comments

About lookup cache quotas

Was this topic useful?