How the Edge Processor solution transforms data

You can use the Edge Processor solution to transform your data in a wide variety of ways, including but not limited to:

Breaking multiline data into distinct events
Assigning event timestamps
Extracting raw data values into top-level event fields
Masking or updating data values

Some of these transformations can also be applied by other Splunk software that is part of your end-to-end data ingestion pathway, such as forwarders and indexers. When you use an Edge Processor to process and route data, the way that your data is ultimately transformed varies depending on the specific combination of data sources and destinations that are involved. For example, in some cases, the data source assigns the event timestamps. In other cases, the Edge Processor assigns the timestamps, and you might need to configure a pipeline to extract date and time information from the raw data in order to create the event timestamp.

This page explains how and where your data is transformed as it passes through an Edge Processor.

For a high-level summary of how an Edge Processor transforms various types of data as it receives that data, passes it through pipelines, and then exports the data to a destination, see Data transformation overview.
For details about how your data is transformed for each specific combination of data sources and destinations that the Edge Processor supports, as well as guidance on how to configure certain transformations, see the following sections:

Data transformation overview

When an Edge Processor processes data, the data goes through 3 phases of transformations, and the exact transformations that take place vary depending on the nature of the data.

The data goes through these 3 phases of transformations:

Receiver phase: The Edge Processor receives the data from a data source, and completes preliminary transformations to prepare the data before passing it to the pipelines.
Pipeline phase: The pipelines that are applied to the Edge Processor route and transform the data according to SPL2 configurations.
Exporter phase: The Edge Processor completes finalizing changes to ensure that the data is compatible for storage in the specified destination, and then sends the transformed data out to that destination.

During the receiver phase, the specific transformations that take place are determined by the type of data that the Edge Processor received. The supported data can be categorized into the following types:

Unparsed S2S: Data that comes from other Splunk software and is not fully parsed. This corresponds to data from a universal forwarder that does not have the INDEXED_EXTRACTIONS property configured.
Parsed S2S: Data that comes from other Splunk software and is fully parsed. This corresponds to data from a heavy forwarder or a universal forwarder that has the INDEXED_EXTRACTIONS property configured.
RFC-compliant syslog: Syslog data that is formatted in a way that complies with the specified RFC protocol. Edge Processors can be configured to use RFC 3164, 5424, or 6587.
Non-RFC-compliant syslog: Syslog data that does not comply with the specified RFC protocol.
HEC raw: Data sent by an HTTP client to the Edge Processor through the services/collector/raw HEC endpoint.
HEC event: Data sent by an HTTP client to the Edge Processor through the services/collector HEC endpoint.

The following flowchart outlines the data transformations that take place as data passes through an Edge Processor. You can select the image to expand it.

Heavy forwarder

If your Edge Processor receives the data from a heavy forwarder, then your data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	The heavy forwarder breaks the data into events based on the configurations in the props.conf file.	Configure line breaking and merging in the props.conf file of the heavy forwarder.	props.conf in the Splunk Enterprise Admin Manual Configure event line breaking in the Splunk Cloud Platform Getting Data In manual
Extract data values into event fields	First, the heavy forwarder extracts the indexed fields specified by the props.conf and transforms.conf files. Then, the Edge Processor extracts additional fields based on the configurations in the applied pipelines. Finally, if the data is sent to a Splunk platform HEC destination, then the Splunk platform extracts fields based on the configurations in the props.conf and transforms.conf files, including ingest actions. If a Splunk platform S2S destination is used instead, then the Splunk platform extracts fields based on ingest actions only.	Depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	The heavy forwarder assigns event timestamps based on the configurations in the props.conf file.	Configure timestamp assignment in the props.conf file of the heavy forwarder. Make sure that the pipelines applied to the Edge Processor do not modify the `_time` field. If any pipelines modify the `_time` field, the Edge Processor overwrites the timestamps assigned by the heavy forwarder.	props.conf in the Splunk Enterprise Admin Manual The Configure timestamps chapter in the Splunk Cloud Platform Getting Data In manual
Other data transformations	First, the heavy forwarder transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions. Then, the Edge Processor transforms data based on the configurations in the applied pipelines. Finally, if the data is sent to a Splunk platform HEC destination, then the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions. If a Splunk platform S2S destination is used instead, then the Splunk platform only applies ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through a heavy forwarder and an Edge Processor before being sent to a destination:

Universal forwarder without INDEXED_EXTRACTIONS

If your Edge Processor receives the data from a universal forwarder that does not have the INDEXED_EXTRACTIONS property configured in the props.conf file, then the exact transformations that your data goes through depends on the kind of destination that the Edge Processor sends the data to in the end.

Refer to the section corresponding to your data destination:

Splunk platform S2S destination

If your Edge Processor receives data from a universal forwarder without the INDEXED_EXTRACTIONS property and sends that data to a Splunk platform S2S destination, then the data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	If the data's source type matches a source type defined in the Edge Processor service, then the Edge Processor breaks the data into events based on the configuration settings in the source type. Then, the Splunk platform attempts to break the data again based on the configurations in the props.conf file.	Maintain the same line breaking configurations in both the Edge Processor service and the Splunk platform.	Add source types for Edge Processors props.conf in the Splunk Enterprise Admin Manual Configure event line breaking in the Splunk Cloud Platform Getting Data In manual
Extract data values into event fields	First, the Edge Processor extracts fields based on the configurations in the applied pipelines. Then, the Splunk platform extracts additional fields based on the configurations in the props.conf and transforms.conf files.	Depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	The Splunk platform assigns event timestamps based on the configuration settings in the props.conf file.	Configure timestamp assignment in the props.conf file of the Splunk platform.	props.conf in the Splunk Enterprise Admin Manual The Configure timestamps chapter in the Splunk Cloud Platform Getting Data In manual
Other data transformations	First, the Edge Processor transforms data based on the configurations in the applied pipelines. Then, the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through a universal forwarder without INDEXED_EXTRACTIONS and an Edge Processor before being sent to a Splunk platform S2S destination:

Splunk platform HEC or Amazon S3 destination

If your Edge Processor receives data from a universal forwarder without the INDEXED_EXTRACTIONS property and sends that data to a Splunk platform HEC or Amazon S3 destination, then the data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	If the data's source type matches a source type defined in the Edge Processor service, then the Edge Processor breaks the data into events based on the configuration settings in the source type.	Make sure that a matching source type with the appropriate line breaking and merging configurations is defined in the Edge Processor service.	Add source types for Edge Processors
Extract data values into event fields	First, the Edge Processor extracts fields based on the configurations in the applied pipelines. Then, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform extracts additional fields based on the configurations in the props.conf and transforms.conf files.	Depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	By default, the Edge Processor sets a timestamp of "0". You must configure a pipeline to assign event timestamps, or else the Splunk platform will use the current time as the event timestamp. The Edge Processor renames the `_time` field to `time` when formatting events to be sent to the destination.	Use an Edge Processor pipeline to assign event timestamps by using the SPL2 command `eval _time = strptime(<field>, <format>)` to create the `_time` field.	Configure a pipeline to assign event timestamps
Other data transformations	First, the Edge Processor transforms data based on the configurations in the applied pipelines. Then, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through a universal forwarder without INDEXED_EXTRACTIONS and an Edge Processor before being sent to a Splunk platform HEC or Amazon S3 destination:

Universal forwarder with INDEXED_EXTRACTIONS

If your Edge Processor receives the data from a universal forwarder that has the INDEXED_EXTRACTIONS property configured in the props.conf file, then your data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	The universal forwarder breaks the data into events based on the configurations in the props.conf file.	Configure line breaking and merging in the props.conf file of the universal forwarder.	props.conf in the Splunk Enterprise Admin Manual Configure event line breaking in the Splunk Cloud Platform Getting Data In manual
Extract data values into event fields	First, the universal forwarder extracts the indexed fields specified by the props.conf and transforms.conf files. Then, the Edge Processor extracts additional fields based on the configurations in the applied pipelines. Finally, if the data is sent to a Splunk platform HEC destination, then the Splunk platform extracts fields based on the configurations in the props.conf and transforms.conf files, including ingest actions. If a Splunk platform S2S destination is used instead, then the Splunk platform extracts fields based on ingest actions only.	Depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	The universal forwarder assigns event timestamps based on the configurations in the props.conf file.	Configure timestamp assignment in the props.conf file of the universal forwarder. Make sure that the pipelines applied to the Edge Processor do not modify the `_time` field. If any pipelines modify the `_time` field, the Edge Processor overwrites the timestamps assigned by the universal forwarder.	props.conf in the Splunk Enterprise Admin Manual The Configure timestamps chapter in the Splunk Cloud Platform Getting Data In manual
Other data transformations	First, the universal forwarder transforms data based on the configurations in the props.conf and transforms.conf files. Then, the Edge Processor transforms data based on the configurations in the applied pipelines. Finally, if the data is sent to a Splunk platform HEC destination, then the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions. If a Splunk platform S2S destination is used instead, then the Splunk platform only applies ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through a universal forwarder that has INDEXED_EXTRACTIONS configured and an Edge Processor before being sent to a destination:

HTTP client using the services/collector HEC endpoint

If your Edge Processor receives the data from an HTTP client through the services/collector HTTP Event Collector (HEC) endpoint, then the exact transformations that your data goes through depends on the kind of destination that the Edge Processor sends the data to in the end.

Refer to the section corresponding to your data destination:

Splunk platform S2S destination

If your Edge Processor receives data through the services/collector HEC endpoint and sends that data to a Splunk platform S2S destination, then the data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	The Edge Processor treats each top-level JSON object in the body of the HEC request as one distinct event. Then, the Splunk platform breaks the data into events based on the configurations in the props.conf file.	Configure line breaking and merging in the props.conf file of the Splunk platform.	props.conf in the Splunk Enterprise Admin Manual Configure event line breaking in the Splunk Cloud Platform Getting Data In manual Using the services/collector endpoint in Edge Processors
Extract data values into event fields	First, if the JSON object in the body of the HEC request contains any of these keys, the Edge Processor extracts them into event fields: `fields`, `host`, `index`, `source`, `sourcetype`, and `time`. Then, the Edge Processor extracts additional fields based on the configurations in the applied pipelines. Finally, the Splunk platform extracts additional fields based on the configurations in the props.conf and transforms.conf files.	Depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	Using the services/collector endpoint in Edge Processors The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	The Splunk platform assigns event timestamps based on the configuration settings in the props.conf file.	Configure timestamp assignment in the props.conf file of the Splunk platform.	props.conf in the Splunk Enterprise Admin Manual The Configure timestamps chapter in the Splunk Cloud Platform Getting Data In manual
Other data transformations	First, the Edge Processor transforms data based on the configurations in the applied pipelines. Then, the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through the services/collector HEC endpoint and an Edge Processor before being sent to a Splunk platform S2S destination:

Splunk platform HEC or Amazon S3 destination

If your Edge Processor receives data through the services/collector HEC endpoint and sends that data to a Splunk platform HEC or Amazon S3 destination, then the data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	The Edge Processor treats each top-level JSON object in the body of the HEC request as one distinct event. Then, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform treats each line of data as one distinct event.	None. You cannot change how the Edge Processor or the Splunk platform breaks data into events when receiving data through the services/collector HEC endpoint.	Using the services/collector endpoint in Edge Processors
Extract data values into event fields	First, if the JSON object in the body of the HEC request contains any of these keys, the Edge Processor extracts them into event fields: `fields`, `host`, `index`, `source`, `sourcetype`, and `time`. Then, the Edge Processor extracts additional fields based on the configurations in the applied pipelines. Finally, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform extracts additional fields based on the configurations in the props.conf and transforms.conf files.	Depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	Using the services/collector endpoint in Edge Processors The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	The services/collector HEC endpoint assigns event timestamps based on the `time` key in the HTTP request body. If the `time` key is not specified, then the endpoint does not assign timestamps. If the data passes through an Edge Processor pipeline that modifies the `_time` field, then the modified `_time` value is used as the event timestamp. The Edge Processor renames the `_time` field to `time` when sending events out to the destination. If none of the above occur, then the Splunk platform will use the current time as the timestamp.	Configure your HTTP application to send a timestamp in UNIX time format as the value of the `time` key. If that is not possible, then use an Edge Processor pipeline to assign event timestamps by using the SPL2 command `eval _time = strptime(<field>, <format>)` to create the `_time` field.	Using the services/collector endpoint in Edge Processors Configure a pipeline to assign event timestamps
Other data transformations	First, the Edge Processor transforms data based on the configurations in the applied pipelines. Then, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through the services/collector HEC endpoint and an Edge Processor before being sent to a Splunk platform HEC or Amazon S3 destination:

HTTP client using the services/collector/raw HEC endpoint

If your Edge Processor receives the data from an HTTP client through the services/collector/raw HTTP Event Collector (HEC) endpoint, then the exact transformations that your data goes through depends on the kind of destination that the Edge Processor sends the data to in the end.

When working with JSON-formatted event data, use the services/collector endpoint instead of the services/collector/raw endpoint. Otherwise, your data might not be transformed as expected.

Refer to the section corresponding to your data destination:

Splunk platform S2S destination

If your Edge Processor receives data through the services/collector/raw HEC endpoint and sends that data to a Splunk platform S2S destination, then the data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	If the data's source type matches a source type defined in the Edge Processor service, then the Edge Processor breaks the data into events based on the configuration settings in the source type. Then, the Splunk platform attempts to break the data again based on the configurations in the props.conf file.	Maintain the same line breaking configurations in both the Edge Processor service and the Splunk platform.	Add source types for Edge Processors props.conf in the Splunk Enterprise Admin Manual Configure event line breaking in the Splunk Cloud Platform Getting Data In manual
Extract data values into event fields	First, if the HEC request contains any of these query string parameters, the Edge Processor extracts them into event fields: `host`, `index`, `source`, `sourcetype`, and `time`. Then, the Edge Processor completes additional field extractions based on the configurations in the applied pipelines. Finally, the Splunk platform extracts additional fields based on the configurations in the props.conf and transforms.conf files.	Depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	Using the services/collector/raw endpoint in Edge Processors The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	The Splunk platform assigns event timestamps based on the configuration settings in the props.conf file.	Configure timestamp assignment in the props.conf file of the Splunk platform.	props.conf in the Splunk Enterprise Admin Manual The Configure timestamps chapter in the Splunk Cloud Platform Getting Data In manual
Other data transformations	First, the Edge Processor transforms data based on the configurations in the applied pipelines. Then, the Splunk plaform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through the services/collector/raw HEC endpoint and an Edge Processor before being sent to a Splunk platform S2S destination:

Splunk platform HEC or Amazon S3 destination

If your Edge Processor receives data through the services/collector/raw HEC endpoint and sends that data to a Splunk platform HEC or Amazon S3 destination, then the data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	If the data's source type matches a source type defined in the Edge Processor service, then the Edge Processor breaks the data into events based on the configuration settings in the source type.	Make sure that a matching source type with the appropriate line breaking and merging configurations is defined in the Edge Processor service.	Add source types for Edge Processors
Extract data values into event fields	First, if the HEC request contains any of these query string parameters, the Edge Processor extracts them into event fields: `host`, `index`, `source`, `sourcetype`, and `time`. Then, the Edge Processor completes additional field extractions based on the configurations in the applied pipelines. Finally, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform extracts additional fields based on the configurations in the props.conf and transforms.conf files.	Depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	Using the services/collector/raw endpoint in Edge Processors The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	The services/collector/raw HEC endpoint assigns event timestamps based on the `time` query string parameter. If the `time` query string parameter is not specified, then the endpoint sets a timestamp of "0". If the data passes through an Edge Processor pipeline that modifies the `_time` field, then the modified `_time` value is used as the event timestamp. The Edge Processor renames the `_time` field to `time` when sending events out to the destination. If none of the above occur, then the Splunk platform will use the current time as the timestamp.	If your HEC request includes the `time` query string parameter, then let the services/collector/raw endpoint determine the event timestamp. Otherwise, to avoid falling back on the current time as the event timestamp, you must use an Edge Processor pipeline to assign event timestamps by using the SPL2 command `eval _time = strptime(<field>, <format>)` to create the `_time` field.	Using the services/collector/raw endpoint in Edge Processors Configure a pipeline to assign event timestamps
Other data transformations	First, the Edge Processor transforms data based on the configurations in the applied pipelines. Then, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through the services/collector/raw HEC endpoint and an Edge Processor before being sent to a Splunk platform HEC or Amazon S3 destination:

Syslog devices

If your Edge Processor receives the data from a syslog device, then the exact transformations that your data goes through depends on the kind of destination that the Edge Processor sends the data to in the end.

Refer to the section corresponding to your data destination:

Splunk platform S2S destination

If your Edge Processor receives from a syslog device and sends that data to a Splunk platform S2S destination, then the data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	First, the Edge Processor treats each line of data as one distinct event. Then, if the data's source type matches a source type defined in the Edge Processor service, the Edge Processor attempts to break the data again based on the configuration settings in the source type. Finally, the Splunk platform attempts to break the data again based on the configurations in the props.conf file.	Each syslog event is a single line of data, so the initial line breaking behavior by the Edge Processor suffices and you don't need to configure any additional line breaking settings. Leave both the Line breaking option in the Edge Processor source type and the `LINE_BREAKER` property in the props.conf file of the Splunk platform set to the default value of `([\r\n]+)`. Doing so prevents the Edge Processor and the Splunk platform from making any unnecessary changes to how the data is divided into events.	Edit, clone, or delete source types for Edge Processors props.conf in the Splunk Enterprise Admin Manual Configure event line breaking in the Splunk Cloud Platform Getting Data In manual
Extract data values into event fields	When you configure a port for the Edge Processor to start receiving syslog data, you select an RFC protocol. For all syslog data, regardless of RFC compliance, the Edge Processor extracts the following metadata fields: `net.host.ip`, `net.host.name`, `net.host.port`, `net.peer.ip`, `net.peer.name`, `net.peer.port`, and `net.transport`. If your syslog data is compliant with the selected RFC protocol, then the Edge Processor extracts all of the fields supported by that RFC protocol. For example, if you use RFC 5424, these fields are extracted: `appname`, `facility`, `hostname`, `msg_id`, `priority`, and `structured_data`. If your data is not compliant, then the Edge Processor extracts the `host` field with `<nil>` as the value, and the `source` field with `edge-source` as the value. Then, the Edge Processor completes additional field extractions based on the configurations in the applied pipelines. Finally, the Splunk platform extracts additional fields based on the configurations in the props.conf and transforms.conf files.	Make sure to select an appropriate RFC protocol. Otherwise, depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	Syslog data behavior based on selected RFC protocol Configure a port for receiving syslog data The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	The Splunk platform assigns event timestamps based on the configuration settings in the props.conf file.	Configure timestamp assignment in the props.conf file of the Splunk platform.	props.conf in the Splunk Enterprise Admin Manual The Configure timestamps chapter in the Splunk Cloud Platform Getting Data In manual
Other data transformations	First, the Edge Processor transforms data based on the configurations in the applied pipelines. Then, the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through a syslog device and an Edge Processor before being sent to a Splunk platform S2S destination:

Splunk platform HEC or Amazon S3 destination

If your Edge Processor receives from a syslog device and sends that data to a Splunk platform HEC or Amazon S3 destination, then the data is transformed as follows:

Data transformation	How the transformation happens	Configuration tips	Documentation
Break data into events	First, the Edge Processor treats each line of data as one distinct event. Then, if the data's source type matches a source type defined in the Edge Processor service, the Edge Processor attempts to break the data again based on the configuration settings in the source type.	Each syslog event is a single line of data, so the initial line breaking behavior by the Edge Processor suffices and you don't need to configure any additional line breaking settings. Leave the Line breaking option in the Edge Processor source type set to the default value of `([\r\n]+)`. Doing so prevents the Edge Processor from making any unnecessary changes to how the data is divided into events.	Edit, clone, or delete source types for Edge Processors
Extract data values into event fields	When you configure a port for the Edge Processor to start receiving syslog data, you select an RFC protocol. For all syslog data, regardless of RFC compliance, the Edge Processor extracts the following metadata fields: `net.host.ip`, `net.host.name`, `net.host.port`, `net.peer.ip`, `net.peer.name`, `net.peer.port`, and `net.transport`. If your syslog data is compliant with the selected RFC protocol, then the Edge Processor extracts all of the fields supported by that RFC protocol. For example, if you use RFC 5424, these fields are extracted: `appname`, `facility`, `hostname`, `msg_id`, `priority`, and `structured_data`. If your data is not compliant, then the Edge Processor extracts the `host` field with `<nil>` as the value, and the `source` field with `edge-source` as the value. Then, the Edge Processor completes additional field extractions based on the configurations in the applied pipelines. Finally, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform extracts additional fields based on the configurations in the props.conf and transforms.conf files.	Make sure to select an appropriate RFC protocol. Otherwise, depending on your particular business requirements and use cases, you might need to configure field extractions at different points of the data ingestion pathway. Consider consolidating your custom field extractions into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	Syslog data behavior based on selected RFC protocol Configure a port for receiving syslog data The Working with pipelines chapter Extract fields from event data using an Edge Processor
Assign event timestamps	When you configure a port for the Edge Processor to start receiving syslog data, you select an RFC protocol. If your syslog data is compliant with the selected RFC protocol, the Edge Processor assigns event timestamps based on the date and time information found in the data. If your data is not compliant, then the Edge Processor sets the timestamp to "0". If the data passes through an Edge Processor pipeline that modifies the `_time` field, then the modified `_time` value is used as the event timestamp. The Edge Processor renames the `_time` field to `time` when sending events out to the destination. If none of the above occur, then the Splunk platform will use the current time as the timestamp.	If your data is compliant with the selected RFC protocol, then let the Edge Processor determine the event timestamp. Otherwise, to avoid falling back on the current time as the event timestamp, you must use an Edge Processor pipeline to assign event timestamps by using the SPL2 command `eval _time = strptime(<field>, <format>)` to create the `_time` field.	Configure a port for receiving syslog data Configure a pipeline to assign event timestamps Configure the time zone of your syslog data in the Edge Processor
Other data transformations	First, the Edge Processor transforms data based on the configurations in the applied pipelines. Then, if the Edge Processor sends the data to a Splunk platform HEC destination, the Splunk platform transforms data based on the configurations in the props.conf and transforms.conf files, including ingest actions.	Depending on your particular business requirements and use cases, you might need to configure data transformations at different points of the data ingestion pathway. Consider consolidating your data transformations into the Edge Processor pipelines where possible. This approach lets you use the broader range of transformations supported by SPL2, and allows you to manage your configurations from one centralized location.	The Working with pipelines chapter The Process data using pipelines chapter

The following diagram summarizes how data is transformed when it moves through a syslog device and an Edge Processor before being sent to a Splunk platform HEC or Amazon S3 destination:

Related answers from Splunk Community

How the Edge Processor solution transforms data

Data transformation overview

Heavy forwarder

Universal forwarder without INDEXED_EXTRACTIONS

Splunk platform S2S destination

Splunk platform HEC or Amazon S3 destination

Universal forwarder with INDEXED_EXTRACTIONS

HTTP client using the services/collector HEC endpoint

Splunk platform S2S destination

Splunk platform HEC or Amazon S3 destination

HTTP client using the services/collector/raw HEC endpoint

Splunk platform S2S destination

Splunk platform HEC or Amazon S3 destination

Syslog devices

Splunk platform S2S destination

Splunk platform HEC or Amazon S3 destination

Comments

How the Edge Processor solution transforms data

Was this topic useful?