Docs » Monitor services and hosts in Splunk Infrastructure Monitoring » Monitor Google Cloud Platform » Monitor Google Cloud Platform services

Monitor Google Cloud Platform services 🔗

You can monitor the following Google Cloud Platform (GCP) services in Splunk Observability Cloud, which provides infrastructure monitoring features using Google Cloud Operations. See the official Google Cloud documentation for more information.

About GCP data 🔗

Uniquely identifying Google Cloud Platform resources 🔗

All of the metrics that the StackDriver integration sends contain a dimension called gcp_id. The value of this dimension starts with the project ID that contains the resource followed by _ (underscore) and then other properties specific to that resource. If you install collectd on a Compute Engine instance using the standard install script this dimension is automatically added.

The simplest way to manually send metrics with this dimension to discover the unique ID value is to find a time series that contains this dimension using the Metadata Catalog. The time series should contain other dimensions that give a more friendly identification to the underlying GCP resource.

Dimensions 🔗

The metric time series (MTS) associated with GCP metrics have the following generic dimensions, common to all services:

Dimension name

Description

gcp_id

unique identifier for GCP objects

project_id

project ID of the monitored resource

monitored_resource

name of the monitored resource

service

service to which the metric belongs

Moreover, each service also has a dimension that identifies the resource to which the metric belongs to. For example, the instance_id dimension identifies compute instances, and the bucket_name dimension identifies storage buckets.

Monitor GCP services and identify problems 🔗

View the health of GCP services at a glance from the Infrastructure page. This page provides a key metric for each service. You can also drill down into specific instances of a GCP service. For example, view key metrics for the GCE service, and filter for a specific ID to analyze a particular GCE instance.

Follow these steps to analyze problems with GCP services from the Infrastructure page:

  1. Select Navigation menu > Infrastructure and view the Google Cloud Platform category.

  2. Select the specific service you want to analyze. For example, select Cloud Storage to view metrics for a specific bucket. If you see “No Data Found,” you need to first configure an integration.

  3. Compare instances of the service along the following metrics with the Color by drop-down list. In the heat map, colors represent the health of instances based on the metrics you select. For example, a heat map that shows green and red, uses green to denote healthy and red to denote unhealthy instances. If your heat map has multiple colors, then the lighter gradient represents less activity, and the darker gradient represents more activity.

  4. Optionally, you might apply visually accessible color palettes on custom dashboards and charts and throughout Infrastructure Monitoring, select Account Settings > Color Accessibility. You can color by metrics like CPU utilization and filter by dimensions like geographic region.

  5. Group instances based on metadata about each instance with the Group by drop-down list. You can group instances according to the region or resource group they are running in or the environment tag. Use this to see correlations between different parts of your infrastructure and its performance.

  6. Find outliers for your metrics with the Find Outliers setting. Specify the Scope and Strategy.

  • Set the Scope to analyze outliers from across the entire visible population of instances, or only within groups defined by the dimension or property you grouped instances by.

  • Select one of two Strategies to find outliers:

    • Deviation from Mean: Instances appear as red that exceed the mean value of the metric by at least three standard deviations. Use this setting to find the most extreme outliers.

    • Deviation from Median: Instances appear as red that exceed the median absolute deviation value by at least three absolute deviations. This setting does not weigh extreme outliers as heavily as the standard deviation.

  1. Select a specific instance you want to investigate further to view all the metadata and key metrics for the instance. For every instance, Observability Cloud provides a default dashboard. Analyze all the available metadata about the cloud service the instance is running in, the instance itself, and any custom tags associated with the instance. The default dashboard provides metric time series (MTS) for key metrics.

Monitor data from other sources 🔗

You can also export and monitor data from these sources running in your GCP environment, as described in the table.

Get data in

Monitor

Description

Collect Kubernetes data

Monitor Kubernetes (classic version)

Collect metrics and logs from Kubernetes clusters running in Google Compute Engine (GCE) or Google Kubernetes Engine (GKE) instances.

Monitor hosts

Collect metrics and logs from Linux and Windows hosts running in GCE instances.

Instrument back-end applications to send spans to Splunk APM

Introduction to Splunk APM

Collect application metrics and spans running in hosts or Kubernetes clusters.

Use default dashboards to monitor GCP services 🔗

Splunk Observability Cloud provides default dashboards for supported GCP services, available in dashboard groups based on the GCP service the dashboard represents data for.

To find default dashboards for GCP services, select Navigation menu > Dashboards and search for the GCP service you want to view dashboards for.

Explore built-in content 🔗

Observability Cloud collects data from many cloud services:

  • To see all of the navigators provided for data collected in your organization, go to the Infrastructure page.

  • To see all the pre-built dashboards for data collected in your organization, select Dashboards > Built-in.

Keep in mind the constraints for GCP Compute Engine and GCP Kubernetes Engine content described in the next sections.

See GCP Compute Engine content 🔗

GCP Compute Engine instances are powered by their respective public cloud service as well as the Splunk Distribution of OpenTelemetry Collector. You need both for all the charts to display data in the built-in dashboards.

  • If you have only the public cloud service configured, you can see all the cards representing the services where data come from, but some charts in the built-in dashboards for GCP Compute Engine instances display no data.

  • If you have only the public cloud service and the Smart Agent (deprecated) configured, some charts in the built-in dashboards for GCP Compute Engine instances display no data.

See GCP Kubernetes Engine content 🔗

In order to populate GCP Kubernetes Engine navigators and dashboards, Splunk Observability Cloud needs access to the control plain endpoint of your Kubernetes cluster to be able to display data.

If you have a private GKE cluster, refer to the official Google documentation on setting up limited access to the public control plane endpoint, and reach out to Support to obtain the IP ranges that you need to authorize.

Alternatively, you can collect Kubernetes data using the OpenTelemetry Collector for Kubernetes and use the provided dashboards and navigators. Learn more at Collect Kubernetes data and Monitor Kubernetes (classic version).