character set encoding

noun

The standard method used to encode characters received from IT data sources into a standardized character set. By default, Splunk uses Splunk applies UTF-8 encoding to sources that aren't already UTF-8 encoded or is a non-ASCII file. If you would rather use a different character set, you can set this with the CHARSET key in props.conf.

Splunk supports 71 languages, including 20 that aren't UTF-8 encoded. You can manually specify character sets for sources, or have Splunk identify them automatically.

For more information

In the Getting Data In manual:

configuration

configuration file

event processing

character set encoding

segmentation

segment

timestamping

timestamp, timezone offset

default field extraction

host, source, source type, punct


archiving

retention time