Splunk Cloud Platform

Search Manual

Wildcards

When you need to cast a wide net in your searches, you can use wildcards to match characters in string values. The wildcard that you use depends on the command that you are using the wildcard with:

eval and where commands
Use the LIKE function with the percent ( % ) symbol as a wildcard for matching multiple characters. Use the underscore ( _ ) character to match a single character. See the like (TEXT, PATTERN) function in the list of Comparison and Conditional eval functions.
All other commands
Use the asterisk ( * ) character as a wildcard to match an unlimited number of characters in a string. For example,
"my*" matches myhost1 and myhost.ny.mydomain.com.

Be efficient and specific

If you specify an asterisk with no other criteria, you are asking to match everything. Yes, everything. All events are retrieved, up to the maximum limit. A search to match everything is both inefficient and time consuming. You'll use a lot of system resources, which can prevent others from running their searches. Additionally, you might wait a long time for your search results.

To avoid these problems, be as specific as you can in your search criteria.

The more specific your search terms are, the more efficient your search is. Sometimes that means not using a wildcard. Searching for a specific word or phrase is more efficient than a search that uses a wildcard. For example, searching for access denied is always better than searching for access*.

Best practices for using wildcards

The best way to use a wildcard is at the end of a term, such as fail*.

Specify a field-value pair whenever possible to avoid searching the raw field, which is the entire event. For example: status=fail*.

When using wildcards in searches, carefully consider whether you're getting the results you expect. For example, the following searches containing wildcards do not return any events, as expected:

  • NOT *
  • sourcetype=_json NOT *
  • sourcetype=* NOT *

The following searches containing wildcards do return events, as expected:

  • sourcetype=*
  • index=*

Using wildcards with index= and NOT

If you want the inverse of a wildcard when using index= and NOT *, you must use (NOT *) to produce the most accurate search results. For example, the following searches do not return events, as expected.

  • index=<index name> (NOT *)
  • index=* (NOT *)

The following searches incorrectly return events, because NOT * is not enclosed in parentheses.

  • index=<index name> NOT *
  • index=* NOT *

When to avoid wildcard characters

There are several situations in which you should avoid using wildcard characters.

Avoid using wildcards in the middle of a string

Wildcard characters in the middle of a word or string might cause inconsistent results. This is especially true if the string contains punctuation, such as an underscore _ or dash - character.

For example, suppose you have the following list of product IDs.

DB-SG-G01
DC-SG-G02
MB-AG-G07
MB-AG-T01
SC-MG-G01
SF-BVS-G01
SG-SH-G05
WC-SH-A02
WC-SH-G04

You create a search that looks for all of the product IDs that begin with the letter S and end in G01.

productID=S*G01

That search will fail.

When the events with the product IDs are indexed, the product IDs are broken up into segments. For example, the product ID SC-MG-G01 has these segments: SC, MG, G01. There is no segment that starts with an S and ends with G01 which is what the search productID=S*G01 specifies. Because there are no segments that match your search, no results are found.

A search that uses a wildcard in the middle of the term returns inconsistent results because of the way in which data that contains punctuation is indexed and searched.

To learn more about how punctuation can impact using wildcards, see Event segmentation and searching.

The solution to this problem?

  • If the number of product IDs is small, specify the exact product IDs in your search rather than using a wildcard. For example:

    productID=SC-MG-G01 OR productID=SF-BVS-G01

  • If the number of product IDs is large, use a lookup instead of a wildcard.

Avoid using wildcards to match punctuation

Punctuation are characters that are not numbers or letters. If you want to match part of a string that includes punctuation, specify each string with the punctuation that you are searching for.

For example, you have the following values in the uri_path field in your events.

/cart.do	
/cart/error.do	
/cart/success.do
/category.screen		
/oldlink
/product.screen	
/productscreen.html
/show.do	
/stuff/logo.ico	


You want to match every uri_path that starts with /cart. The problem is that the paths contain a forward slash ( / ) character and period ( . ) character. Instead of specifying a wildcard character for the punctuation such as /cart*, specify the punctuation directly in your search criteria. For example,

...uri_path=/cart.do OR uri_path=/cart/error.do OR uri_path=/cart/success.do

Avoid using wildcards as prefixes

When you use a wildcard character at the beginning of a string, the search must look at every string to determine if the end of the string matches what you specify after the asterisk. Using a prefix wildcard is almost like using a wildcard by itself. Prefix wildcards might cause performance issues.

Avoid using wildcards at the beginning of search terms.

Searching for the asterisk character

You can't search for the asterisk ( * ) character directly because the character is reserved as a wildcard.

However, you can search for a term without the asterisk and then use either the where or regex command to filter the results.

For example, to search for a term that contains an asterisk such as *78, use these steps:

  1. First search for 78 without the asterisk, which returns all events that contain the number.
  2. Follow that with | regex _raw=\*78 to return only those events that contain *78.

The backslash ( \ ) is used in the regular expression to not interpret, or escape, the asterisk character. See the regex command.

Other supported wildcards

The LIKE function supports using other wildcards for pattern matching. The percent ( % ) symbol is used as a wildcard for matching multiple characters. The underscore ( _ ) character is used to match a single character.

These wildcards are only applicable to the LIKE function. See like (TEXT, PATTERN) in the list of Comparison and Conditional functions.

You can use the percent ( % ) wildcard anywhere in the PATTERN.

Last modified on 05 April, 2023
Search command primer   Backslashes

This documentation applies to the following versions of Splunk Cloud Platform: 8.2.2112, 8.2.2201, 8.2.2202, 8.2.2203, 9.0.2205, 9.0.2208, 9.0.2209, 9.0.2303, 9.0.2305, 9.1.2308, 9.1.2312, 9.2.2403 (latest FedRAMP release), 9.2.2406


Was this topic useful?







You must be logged into splunk.com in order to post comments. Log in now.

Please try to keep this discussion focused on the content covered in this documentation topic. If you have a more general question about Splunk functionality or are experiencing a difficulty with Splunk, consider posting a question to Splunkbase Answers.

0 out of 1000 Characters