[Metricbeat] Add Google Cloud Platform module #14829

sayden · 2019-11-27T20:43:18Z

ONGOING work on docs bust most code is ready to go.

Seed PR for the Google Cloud Platform module for Metricbeat.

It includes the following:

Stackdriver metricset
Compute metricset based on Stackdriver as config based module

Ignore the following Metricsets which are already included in the PR for testing purposes but they are not going to be merged yet (they'll be removed before merging):

Storage
Firebase
Firestore
Loadbalancing
PubSub

Some vocabulary for people new to Google Cloud

You can find some translations for GCP services in AWS:

Stackdriver -> Cloudwatch
Compute -> EC2
PubSub -> SQS
Storage (GCS) -> S3
Firebase / Firestore -> ~DynamoDB
Bigquery -> ~Redshift+Athena

Labels / Metadata

You'll see lots of mentions to Metadata inside the code. This refers to two different entities within GCP: labels and metadata. For Elasticsearch purposes both can be considered metadata so whenever you read "label" or "metadata" it's going to be treated as the same thing at the end of the pipeline.

Grouping of events

The way that GCP labels metrics is somehow complex to generate "service based events". They export their metrics individually so you don't request "compute metrics" or "metrics of this compute instance" but instead you have to request "give all cpu_utilization values of compute instances" so a single response will bring one or more values per instance for a specified timeframe for all your instances. That's a single response.

For example, a request for CPU utilization can return (in pseudocode):

{
	"metadata": {
		"zone": "eu-central-1",
		"project": "project1"
    },
    "metric": "cpu_utilization",
	"points": [
		{
			"time": 1,
			"value": 2,
			"metadata": {
				"instance": "instance-1"
			}
		},
		{
			"time": 2,
			"value": 2,
			"metadata": {
				"instance": "instance-1"
			}
		}
	]
}

Then, a new call must be done to (in this example it will be Compute API) to request Instance metadata (like working group, network group, user labels or user metadata which is associated only to the instance and not to a particular metrics like CPU). Then you get data like this (again, in pseudocode)

{
    "instance":"instance-1",
    "metadata":{
        "user":{
            "key":"value"
        },
        "system":{
            "key":"value"
        }
    },
    ...
}

At the end, both response for that particular metric must be grouped into a single event that share some common metadata. For compute this includes instance_id and availability zone apart from timestamp. Each service requires an specifici implementation to get non-stackdriver metadata. The service metadata implementation is only developed for Compute at the moment and can be seen in googlecloud/stackdriver/compute, the rest of the services uses only metadata provided by Stackdriver.

ECS

Metadata returned from Stackdriver is ECS compliant for Compute metadata (mainly availability zone, account id and cloud provider, instance id and instance name). Some of the metadata might be written out of the ECS fields. More deployment configurations plus testing is needed find them all.

Modules

All services from https://cloud.google.com/monitoring/api/metrics_gcp can be added as more configuration. Tests until now shows no problem but their specific metadata must be developed separatedly for each of them.

Limitations

You cannot set period under 300s (you can right now, but it won't return any metric). I think it's some kind of limitation of Stackdriver because their metrics are sampled each 60 to 300 seconds.

Happy reviewing :)

Sorry for the big PR, it was impossible to make it smaller

x-pack/metricbeat/module/googlecloud/constants.go

x-pack/metricbeat/module/googlecloud/stackdriver/metricset.go

x-pack/metricbeat/module/googlecloud/stackdriver/response_parser.go

x-pack/metricbeat/module/googlecloud/constants.go

jsoriano

First pass through the code, I haven't found anything serious but I think that some things would need to be polished. Thanks!

vendor/vendor.json

x-pack/metricbeat/module/googlecloud/compute/_meta/data.json

jsoriano · 2019-11-29T19:00:42Z

x-pack/metricbeat/module/googlecloud/constants.go

+	SERVICE_COMPUTE   = "compute"
+	SERVICE_PUBSUB    = "pubsub"
+	SERVICE_FIRESTORE = "firestore"
+	SERVICE_STORAGE   = "storage"


+1, please use camel case for these constants

x-pack/metricbeat/module/googlecloud/metadata.go

x-pack/metricbeat/module/googlecloud/stackdriver/metricset.go

x-pack/metricbeat/module/googlecloud/stackdriver/response_parser.go

x-pack/metricbeat/module/googlecloud/compute/_meta/data.json

x-pack/metricbeat/module/googlecloud/timeseries_metadata_collector.go

x-pack/metricbeat/module/googlecloud/stackdriver/metrics_requester.go

metricbeat/docs/modules/googlecloud.asciidoc

kaiyan-sheng · 2019-12-04T00:02:36Z

metricbeat/docs/modules/googlecloud.asciidoc

+    - firebase
+    - storage
+    - loadbalancing
+  zone: "your zone"


can we specify more than one zone here?

Good question. We can't specify it right now. The idea was to maintain first version as simple as possible. It's actually possible to request all metrics for a project without zone filter or even request various zones but we are moving slow yet and see how it goes because the code to request metrics and convert them using lightweight modules is pretty complex already.

perhaps it would be interesting to put a real zone here so things won't fail if they start the module out of the box?

I prefer that things do fail explicitly so that a user with machines in Europe that runs Metricbeat will have an specific error saying zone "your zone" not found instead of silent errors of simply not sending any event because there are no machines in that zone/region which may lead to think that Metricbeat is not working properly (it's your fault because you didn't set the correct zone, but that's implicit)

x-pack/metricbeat/module/googlecloud/compute/_meta/data.json

kaiyan-sheng · 2019-12-04T01:20:43Z

x-pack/metricbeat/module/googlecloud/compute/_meta/data.json

+                "id": "elastic-metricbeat"
+            },
+            "provider": "googlecloud",
+            "instance": {


Is there a separate API to get more info for each compute instance? For example the machine type, status and etc.

Oh yes! And I'm actually using it already but I completely forgot to attach machine type too! Thanks for the heads up!

jsoriano · 2020-01-09T10:48:58Z

@sayden this will need to be backported to 7.x.

Includes Stackdriver and Compute Metricset # Conflicts: # NOTICE.txt # vendor/vendor.json

exekias · 2020-01-15T12:40:37Z

I just saw this doesn't have a changelog, could you add it in a different PR?

exekias · 2020-01-15T12:41:28Z

as 7.6 branch was already created, could you also add another backport to that one?

Includes Stackdriver and Compute Metricset (cherry picked from commit 8be7745) # Conflicts: # NOTICE.txt # vendor/vendor.json

…ule (#15572)

…ule (#15575) * [Metricbeat] Add Google Cloud Platform module (#14829) Includes Stackdriver and Compute Metricset (cherry picked from commit 8be7745)

kaiyan-sheng · 2020-01-15T22:22:44Z

@sayden I just started testing this PR with compute metricset. Curious, why we have metrics from the same instance but in different events? I also see that you have metrics separated into cpu, disk, firewall and etc in https://github.com/elastic/beats/tree/master/x-pack/metricbeat/module/googlecloud/compute/_meta. Why they are not in the same event/metric from the same instance?

kaiyan-sheng · 2020-01-16T15:32:46Z

Bug found during testing: #15613

kaiyan-sheng · 2020-01-23T15:10:56Z

Missing exported field in documentation for compute metricset: #15776

kaiyan-sheng · 2020-01-23T15:27:48Z

enhancement request(no need for 7.6) for adding regions as a config parameter: #15780

kaiyan-sheng · 2020-01-23T17:16:50Z

potential sensitive data in labels.metadata: #15782

sayden added enhancement Metricbeat Metricbeat labels Nov 27, 2019

sayden self-assigned this Nov 27, 2019

houndci-bot reviewed Nov 27, 2019

View reviewed changes

sayden marked this pull request as ready for review November 29, 2019 12:09

sayden requested a review from a team as a code owner November 29, 2019 12:09

jsoriano requested changes Dec 3, 2019

View reviewed changes

ChrsMark reviewed Dec 3, 2019

View reviewed changes

metricbeat/docs/modules/googlecloud.asciidoc Show resolved Hide resolved

kaiyan-sheng reviewed Dec 4, 2019

View reviewed changes

x-pack/metricbeat/module/googlecloud/compute/_meta/data.json Outdated Show resolved Hide resolved

kaiyan-sheng reviewed Dec 4, 2019

View reviewed changes

x-pack/metricbeat/module/googlecloud/compute/_meta/data.json Outdated Show resolved Hide resolved

kaiyan-sheng reviewed Dec 4, 2019

View reviewed changes

sayden added 16 commits December 11, 2019 17:25

Atomic commit

8d9ed54

add vendor

9a5e4e3

Atomic commit

a4969cd

Add google.golang.org/api/googleapi to vendoring

1c787b1

Added google.golang.org/api/internal/third_party/uritemplates library

410b901

Added field.yml for compute and update docs

b562f27

Add google.golang.org/genproto/googleapis/api to vendor

57f956a

Add google.golang.org/genproto/googleapis/api/distribution to vendor

e01a700

Add google.golang.org/genproto/googleapis/api/label to vendor

39f9ed9

Add google.golang.org/genproto/googleapis/type/calendarperiod to vendor

af4dd18

Run go vet

f17f4c6

Fix unnamed variable in test

3f33340

Update notice

13a00ed

Add context to the Metrics Requester

94c2b95

Separate groups of ECS fields

87435f5

Add machine type to the event output

c84bc76

sayden merged commit 8be7745 into elastic:master Jan 8, 2020

zube bot added [zube]: Done and removed [zube]: In Review labels Jan 8, 2020

jsoriano added needs_backport PR is waiting to be backported to other branches. test-plan Add this PR to be manual test plan v7.6.0 labels Jan 9, 2020

andresrc removed the [zube]: Done label Jan 10, 2020

andresrc unassigned sayden Jan 14, 2020

andresrc added the needs testing notes label Jan 14, 2020

sayden mentioned this pull request Jan 15, 2020

Cherry-pick #14829 to 7.x: [Metricbeat] Add Google Cloud Platform module #15571

Closed

sayden removed the needs_backport PR is waiting to be backported to other branches. label Jan 15, 2020

sayden added a commit to sayden/beats that referenced this pull request Jan 15, 2020

[Metricbeat] Add Google Cloud Platform module (elastic#14829)

57ee987

Includes Stackdriver and Compute Metricset # Conflicts: # NOTICE.txt # vendor/vendor.json

sayden mentioned this pull request Jan 15, 2020

Cherry-pick #14829 to 7.x: [Metricbeat] Add Google Cloud Platform module #15572

Merged

exekias added the needs_backport PR is waiting to be backported to other branches. label Jan 15, 2020

sayden mentioned this pull request Jan 15, 2020

Cherry-pick #14829 to 7.6: [Metricbeat] Add Google Cloud Platform module #15575

Merged

sayden added a commit to sayden/beats that referenced this pull request Jan 15, 2020

[Metricbeat] Add Google Cloud Platform module (elastic#14829)

81ac67c

Includes Stackdriver and Compute Metricset (cherry picked from commit 8be7745) # Conflicts: # NOTICE.txt # vendor/vendor.json

sayden removed the needs_backport PR is waiting to be backported to other branches. label Jan 15, 2020

sayden added a commit that referenced this pull request Jan 15, 2020

Cherry-pick #14829 to 7.x: [Metricbeat] Add Google Cloud Platform mod…

0cf12f5

…ule (#15572)

kaiyan-sheng self-assigned this Jan 15, 2020

kaiyan-sheng added the test-plan-regression Manually testing this PR found a regression label Jan 16, 2020

urso mentioned this pull request Feb 8, 2020

[docs] Add 7.6 breaking changes and release highlights #16202

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Metricbeat] Add Google Cloud Platform module #14829

[Metricbeat] Add Google Cloud Platform module #14829

sayden commented Nov 27, 2019 •

edited

Loading

jsoriano left a comment

jsoriano Nov 29, 2019

kaiyan-sheng Dec 4, 2019

sayden Dec 5, 2019

exekias Dec 11, 2019

sayden Dec 12, 2019

kaiyan-sheng Dec 4, 2019

sayden Dec 5, 2019

jsoriano commented Jan 9, 2020

exekias commented Jan 15, 2020 •

edited

Loading

exekias commented Jan 15, 2020

kaiyan-sheng commented Jan 15, 2020

kaiyan-sheng commented Jan 16, 2020

kaiyan-sheng commented Jan 23, 2020

kaiyan-sheng commented Jan 23, 2020 •

edited

Loading

kaiyan-sheng commented Jan 23, 2020

[Metricbeat] Add Google Cloud Platform module #14829

[Metricbeat] Add Google Cloud Platform module #14829

Conversation

sayden commented Nov 27, 2019 • edited Loading

Some vocabulary for people new to Google Cloud

Labels / Metadata

Grouping of events

ECS

Modules

Limitations

Happy reviewing :)

jsoriano left a comment

Choose a reason for hiding this comment

jsoriano Nov 29, 2019

Choose a reason for hiding this comment

kaiyan-sheng Dec 4, 2019

Choose a reason for hiding this comment

sayden Dec 5, 2019

Choose a reason for hiding this comment

exekias Dec 11, 2019

Choose a reason for hiding this comment

sayden Dec 12, 2019

Choose a reason for hiding this comment

kaiyan-sheng Dec 4, 2019

Choose a reason for hiding this comment

sayden Dec 5, 2019

Choose a reason for hiding this comment

jsoriano commented Jan 9, 2020

exekias commented Jan 15, 2020 • edited Loading

exekias commented Jan 15, 2020

kaiyan-sheng commented Jan 15, 2020

kaiyan-sheng commented Jan 16, 2020

kaiyan-sheng commented Jan 23, 2020

kaiyan-sheng commented Jan 23, 2020 • edited Loading

kaiyan-sheng commented Jan 23, 2020

sayden commented Nov 27, 2019 •

edited

Loading

exekias commented Jan 15, 2020 •

edited

Loading

kaiyan-sheng commented Jan 23, 2020 •

edited

Loading