Install the Machine Learning Toolkit
The Splunk Machine Learning Toolkit (MLTK) enables users to create, validate, manage, and operationalize machine learning models through a guided user interface. Use the following directions to install the MLTK on to your system(s).
Requirements
In order to successfully run the Machine Learning Toolkit, the following is required:
- Splunk Enterprise 8.0 and above or Splunk Cloud Platform
- Installation of the Python for Scientific Computing (PSC) add-on version 2.0.0, 2.0.1, or 2.0.2 from Splunkbase
- Installation of the Splunk Machine Learning Toolkit app from Splunkbase.
You can choose the appropriate version of the Python for Scientific Computing (PSC) add-on for your environment:
In order to save models, users need the upload_lookup_files
capability included in their role.
Specific version dependencies
For version information that includes MLTK, the PSC add-on, Python, and Splunk Enterprise, see Machine Learning Toolkit version dependencies matrix.
MLTK Version PSC Version 5.2.2 2.0.0, 2.0.1, or 2.0.2 5.2.1 2.0.0, 2.0.1, or 2.0.2 5.2.0 2.0.0, 2.0.1, or 2.0.2 5.1.0 2.0.0, 2.0.1, or 2.0.2 5.0.0 2.0.0, 2.0.1, or 2.0.2 4.5.0 1.4 4.4.2 1.3 or 1.4 4.4.1 1.3 or 1.4 4.4.0 1.3 or 1.4 4.3.0 1.3 or 1.4 4.2.0 1.3 or 1.4 4.1.0 1.3 4.0.0 1.3 3.4.0 1.3 3.3.0 1.2 or 1.3 3.2.0 1.2 or 1.3 3.1.0 1.2
Versions 2.0.2 and 2.0.1 of the PSC add-on are limited to minor library upgrades from version 2.0.0. There are no differences in functionality to version 2.0.0 of the PSC add-on.
If you have written any custom algorithms that rely on the PSC libraries, upgrading to an updated version PSC library add-on will impact those algorithms. You will need to re-train any models (re-run the search that used the fit
command) using those algorithms after you upgrade the PSC add-on.
Splunk Cloud Platform deployments
Follow the appropriate directions for your instance of self-service or managed Splunk Cloud Platform.
Splunk Cloud Platform trial and self-service Splunk Cloud Platform
Install the Python for Scientific Computing add-on and the Splunk Machine Learning Toolkit app to your self-service instance of Splunk Cloud Platform using the app browser in Splunk Cloud Platform.
- Log in to your Splunk Cloud Platform instance.
- From the Splunk Web home screen, click on the gear icon next to Apps in the left navigation bar.
- Click Browse more apps.
- Search for the Python for Scientific Computing add-on and install it.
- Search for the Splunk Machine Learning Toolkit app and install it.
Managed Splunk Cloud Platform
Open a ticket with support and request the Python for Scientific Computing add-on and Splunk Machine Learning Toolkit app to be installed for you.
Splunk Enterprise single instance deployments
Follow these directions for single instance deployments.
Install the Python for Scientific Computing add-on and Splunk Machine Learning Toolkit app onto your single instance Splunk Enterprise
- Install the Python for Scientific Computing add-on first (required).
- Install the Splunk Machine Learning Toolkit app.
Install an app or add-on in Splunk Web
- In Splunk Web, click on the gear icon next to Apps in the left navigation bar.
- On the Apps page, click Install app from file.
- Click Choose File, navigate to and select the package file for the app or add-on, then click Open.
- Click Upload.
Install an app or add-on from the command line
At the command line, enter the following content, depending on your operating system.
- Unix/Linux:
./splunk install app <path/packagename>
- Windows:
splunk install app <path\packagename>
Alternatively, unpack/unzip the file then copy the app directory to $SPLUNK_HOME/etc/apps
on Unix based systems or %SPLUNK_HOME%\etc\apps
on Windows systems.
Splunk Enterprise distributed deployments
Follow these directions for distributed deployments.
Use the following tables to determine where and how to install the Splunk Machine Learning Toolkit and Python for Scientific Computing add-on in a distributed deployment of Splunk Enterprise. Depending on your environment, you may need to install the Splunk Machine Learning Toolkit and Python for Scientific Computing add-on in multiple places.
Where to install Splunk Machine Learning Toolkit and Python for Scientific Computing add-on
This table provides a reference for installing the Splunk Machine Learning Toolkit (MLTK) and Python for Scientific Computing add-on (PSC) to a distributed deployment of Splunk Enterprise.
Splunk instance type Supported MLTK required PSC required Actions required / Comments Search Heads Yes Yes Yes Install the MLTK and PSC add-on to all search heads where the Machine Learning Toolkit is used. Search heads must be running Splunk Enterprise 6.6 or greater. Indexers No No No Do not install on Indexers. Heavy Forwarders No No No These apps do not contain a data collection component. Universal Forwarders No No No These apps do not contain a data collection component. Light Forwarders No No No These apps do not contain a data collection component.
Distributed deployment feature compatibility
This table describes the compatibility of the Splunk Machine Learning Toolkit and Python for Scientific Computing add-on with Splunk distributed deployment features.
Distributed deployment feature Supported Actions required Search Head Clusters Yes Search heads must be running Splunk Enterprise 6.6 or greater. Indexer Clusters No Do not install on Indexer Clusters.
Machine Learning Toolkit files
You can view the source code for the Machine Learning Toolkit app in Unix and Windows environments:
- For Unix-based systems, see
$SPLUNK_HOME/etc/apps/Splunk_ML_Toolkit
. - For Windows systems, see
%SPLUNK_HOME%\etc\apps\Splunk_ML_Toolkit
.
The MLTK is not open source. MLTK source code is provided as an example and for educational purposes only.
Refer to the following table for sub-directory names and descriptions:
Subdirectory | Description |
---|---|
appserver/static and /bin
|
Contains the underlying code files for Python, JavaScript, CSS, and images. |
/default
|
Contains configuration and dashboard files. |
/lookups
|
Contains the sample datasets used in the Showcase examples, along with more information about the datasets and their licenses. |
Bundle replication
Permanent model files, sometimes referred to as learned models or encoded lookups, are saved on disk. These files follow Splunk knowledge object rules, including permissions and bundle replication. Bundle replication is the process by which knowledge objects on the search head are distributed to the indexers.
The Machine Learning Toolkit includes a number of example model files that support the Showcase page. These examples are powered by .csv lookup files. To prevent performance issues, these .csv lookup files are not included in the MLTK bundle replication process.
Scoring metrics in the Machine Learning Toolkit | Install the GitHub for Machine Learning App |
This documentation applies to the following versions of Splunk® Machine Learning Toolkit: 5.2.2
Feedback submitted, thanks!