add_cloud_metadata detecting wrong cloud provider (aws as openstack) #13816

aidan- · 2019-09-26T23:46:30Z

I am running a large number of instances in AWS which have multiple beat agents running on them (metricbeat, winlogbeat and filebeat). A small percentage of instances are starting up and detecting that they are running on 'openstack' instead of 'ec2' (ie, meta.cloud.provider=openstack).

Looking through the code and the way the cloud platform is detected, it's not very surprising that this is occurring as it looks like the endpoints/paths used by EC2 and Openstack both collide with each other. This appears to be have been briefly discussed as a potential issue in the original pull request that added Openstack as a cloud provider: #7663 (comment) but it doesn't look like the concern was addressed.

Perhaps using the non-ec2 compatible Openstack metadata endpoint would be a simple solution to avoid this?

https://docs.openstack.org/nova/latest/user/metadata.html#metadata-openstack-format

Version:
libbeat v6.8.2

Operating System:
Experienced on Windows but would affect all.

Discuss Forum URL:
Not created by me but a pre-existing one:
https://discuss.elastic.co/t/add-cloud-metadata-wrong-provider/189780

Steps to reproduce:
Starting beat on AWS EC2 instances can sometimes result in openstack being identified as the provider:
INFO add_cloud_metadata/add_cloud_metadata.go:323 add_cloud_metadata: hosting provider type detected as openstack, metadata={"availability_zone":"ap-southeast-2a","instance_id":"i-xxxxxxxxxxxxxxx","instance_name":"ip-10-xx-xx-xx","machine_type":"r5d.2xlarge","provider":"openstack"}

The text was updated successfully, but these errors were encountered:

urso · 2019-10-02T14:57:40Z

Beats 7.4 introduced a new setting to select the providers to query. Original PR #13812

If all your instances run on AWS, you can configure the processor as follows:

processors:
- add_cloud_metadata:
    providers: ["aws"]

aidan- · 2019-10-10T03:02:31Z

Thanks for the information. We are currently running beats v6.x and it's unlikely we will upgrade to v7 in the short term, so we may have to live with this one.

botelastic · 2020-09-09T03:19:09Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

inqueue · 2020-09-11T17:42:00Z

Is there any way to fix? Another user has reported the issue.

elasticmachine · 2021-05-10T13:54:31Z

Pinging @elastic/integrations (Team:Integrations)

botelastic · 2022-05-10T14:28:35Z

Hi!
We just realized that we haven't looked into this issue in a while. We're sorry!

We're labeling this issue as Stale to make it hit our filters and make sure we get back to it as soon as possible. In the meantime, it'd be extremely helpful if you could take a look at it as well and confirm its relevance. A simple comment with a nice emoji will be enough :+1.
Thank you for your contribution!

VimCommando · 2022-07-27T02:09:42Z

👍

botelastic · 2023-07-27T02:28:44Z

Hi!
We just realized that we haven't looked into this issue in a while. We're sorry!

We're labeling this issue as Stale to make it hit our filters and make sure we get back to it as soon as possible. In the meantime, it'd be extremely helpful if you could take a look at it as well and confirm its relevance. A simple comment with a nice emoji will be enough :+1.
Thank you for your contribution!

andrewkroh · 2023-08-02T19:42:30Z

The suggested workaround is not available to Elastic Agent users who cannot modify the configuration for the add_cloud_metadata processor. We need a code fix and I think @aidan- 's suggestion is promising:

Perhaps using the non-ec2 compatible Openstack metadata endpoint would be a simple solution to avoid this?

bplies-ATX · 2023-08-25T20:46:42Z

The suggested workaround is not available to Elastic Agent users who cannot modify the configuration for the add_cloud_metadata processor. We need a code fix and I think @aidan- 's suggestion is promising:

Perhaps using the non-ec2 compatible Openstack metadata endpoint would be a simple solution to avoid this?

We just upgraded Elastic Agent from 8.7.1 to 8.9.1 and started to notice misidentifications as well. Notice cloud.provider and cloud.service.name are now wrong.

    "cloud": {
      "availability_zone": "us-east-1b",
      "instance": {
        "name": "ip-10-102-2-203.ec2.internal",
        "id": "i-0639f8a4c790252e4"
      },
      "provider": "openstack",
      "machine": {
        "type": "t3.2xlarge"
      },
      "service": {
        "name": "Nova"
      }
    }

renzedj · 2023-11-21T20:12:51Z

I'm encountering this with 8.11.x. AWS is being misidentified as Openstack.

it-ops-liron · 2023-12-13T18:25:43Z

Same here. Just tested migrating to 8.11 from 8.5 and found out some data was mislabeled as "openstack"

BenB196 · 2023-12-28T23:56:45Z

Also hitting this issue on 8.11 agent.

Edit: One thing that I found was a bit more helpful in generally fixing this issue was to ensure the IMDSv2 was set to required, not entirely sure why that makes a difference, but at least in my case it did.

udayshingwekar · 2024-01-19T01:44:14Z

I am hitting the same issue in 8.11 and resolved it by adding processor in each of system integration outputs (very painful as I could not find a global way to do so) in the fleet managed elastic agents.

add_cloud_metadata:
providers: ["aws"]

toddferg · 2024-02-02T15:13:03Z

I think I found a workaround that can use the @Custom ingest pipeline that will require less specific workarounds.

PUT _ingest/pipeline/metrics-aws.ec2_metrics@custom
{
  "description": "Custom pipeline for AWS EC2 metrics with failure handling",
  "processors": [
    {
      "script": {
        "source": """
        if (ctx.cloud?.provider != null && ctx.cloud.provider == 'openstack') {
          ctx.cloud.provider = 'aws';
        }
        """,
        "on_failure": [
          {
            "set": {
              "field": "_ingest._failure_message",
              "value": "{{ _ingest.on_failure_message }}"
            }
          }
        ]
      }
    }
  ]
}

PUT _ingest/pipeline/logs-aws.ec2_logs@custom
{
  "description": "Custom pipeline for AWS EC2 logs with failure handling",
  "processors": [
    {
      "script": {
        "source": """
        if (ctx.cloud?.provider != null && ctx.cloud.provider == 'openstack') {
          ctx.cloud.provider = 'aws';
        }
        """,
        "on_failure": [
          {
            "set": {
              "field": "_ingest._failure_message",
              "value": "{{ _ingest.on_failure_message }}"
            }
          }
        ]
      }
    }
  ]
}

axw · 2024-02-22T05:43:38Z

Perhaps using the non-ec2 compatible Openstack metadata endpoint would be a simple solution to avoid this?

I think this makes sense, but may require a substantial amount of testing. Another option that involves fewer changes would be to check if the OpenStack-specific endpoint exists, and then continue using the EC2-compatible endpoint for returning the values.

Ideally we should have some automated integration testing for this. Probably not running all the time, possibly just on-demand. I was looking for an easy way to test against OpenStack and found https://microstack.run/docs/single-node; I tried it in an EC2 instance and it's timing out, so not sure if that's a viable option.

EDIT: managed to get it ~~working~~ installed, I was using the wrong instance type earlier.
EDIT2: even after it's installed, it's still not working... trying to create an OpenStack VM fails

george-viaud · 2024-07-30T19:56:33Z

Experiencing this using fleet, agent v8.14.3

Our infra is AWS, EC2

Still seeing:

cloud.provider: openstack

We have tried to find a way to force configuration via fleet to:

processors:
  - add_cloud_metadata:
      providers: ["aws"]

but so far no luck.

If it makes a difference, we are registering our ephemeral ec2 instances via cron on startup:

./elastic-agent install --url=https://[OBFUSCATED]:8220 --insecure --force --enrollment-token=[OBFUSCATED]

Any advice would be greatly appreciated

george-viaud · 2024-08-02T19:00:38Z

Some additional information (for my case, at least) - I noticed that our fleet server instance is getting the correct cloud.provider and service - it appears that the instances using the fleet-configured Apache HTTP Server as well as the system integration access log entries (and perhaps others) that are getting the wrong integration info. Not sure how to debug this, wish I could help myself and others further.

Kavindu-Dodan · 2024-10-31T22:08:39Z

I had a look into this and the following are my observations.

Background

The root cause have few aspects, first the openstack implementation ¹ relies on the EC2-compatible metadata ² endpoints. Then the both Openstack and EC2/AWS implementations are enabled by default ³ ⁴. (note - Local is a misleading name)

For AWS EC2 instances that enforce IMDSv2, openstack metadata fetch fails as IMDSv2 require a session token ⁵ to access endpoints (as observed here - #13816 (comment)). This makes Openstack implementation to fail where EC2 metadata fetch wins the race condition.

Action

I am looking into migrating openstack implementation to use Nova metadata service ⁶ as proposed by many. Further, while this is being investigated, a workaround here is to use the providers selector in the processor. For example, in metricbeat.yaml,

processors:
  - add_cloud_metadata:
      providers:
        aws

Kavindu-Dodan · 2024-11-14T22:57:46Z

#41636 attempts to fix this by adding priority to AWS/EC2 & Azure metadata fetch mechanisms. I had to do this as I was unable to get a stable Openstack instance to validate their dedicated metadata endpoints.

andrewkroh added :Processors bug libbeat labels Sep 27, 2019

botelastic bot added Stalled needs_team Indicates that the issue/PR needs a Team:* label labels Sep 9, 2020

botelastic bot removed the Stalled label Sep 11, 2020

jsoriano added the Team:Integrations Label for the Integrations team label May 10, 2021

botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label May 10, 2021

botelastic bot added the Stalled label May 10, 2022

botelastic bot removed the Stalled label Jul 27, 2022

tetianakravchenko mentioned this issue May 2, 2023

[Processor: add_cloud_metadata] Use AWS client to get instance metadata and EKS cluster name #35182

Merged

6 tasks

kaiyan-sheng mentioned this issue Jul 19, 2023

Set providers parameter in config to aws for provider_aws_ec2_test.go #36106

Merged

6 tasks

botelastic bot added the Stalled label Jul 27, 2023

botelastic bot removed the Stalled label Aug 2, 2023

andrewkroh mentioned this issue Aug 8, 2023

Wrong detection of cloud.provider since Beats 8.0 (openstack as huawei) #31022

Closed

andrewkroh changed the title ~~add_cloud_metadata detecting wrong cloud provider~~ add_cloud_metadata detecting wrong cloud provider (aws as openstack) Aug 8, 2023

axw mentioned this issue Feb 12, 2024

Fix hetzner and openstack tests by adding AWS_EC2_METADATA_DISABLED=true in ec2 #37907

Merged

6 tasks

axw mentioned this issue Mar 28, 2024

add_cloud_metadata: env var override for providers #38669

Merged

5 tasks

mergify bot mentioned this issue Apr 16, 2024

[8.13](backport #38669) add_cloud_metadata: env var override for providers #38965

Merged

5 tasks

nimarezainia mentioned this issue Oct 23, 2024

[Elastic Agent] Provide additional Cloud metadata in the agent local_metadata elastic/elastic-agent#3213

Open

Kavindu-Dodan self-assigned this Oct 31, 2024

Kavindu-Dodan mentioned this issue Nov 13, 2024

[libbeat] fix: aws & openstack metadata conflict in add_cloud_metadata processor #41636

Merged

6 tasks

Kavindu-Dodan closed this as completed in #41636 Nov 27, 2024

This was referenced Nov 27, 2024

[8.16](backport #41636) [libbeat] fix: aws & openstack metadata conflict in add_cloud_metadata processor #41814

Merged

[8.x](backport #41636) [libbeat] fix: aws & openstack metadata conflict in add_cloud_metadata processor #41815

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add_cloud_metadata detecting wrong cloud provider (aws as openstack) #13816

add_cloud_metadata detecting wrong cloud provider (aws as openstack) #13816

aidan- commented Sep 26, 2019

urso commented Oct 2, 2019

aidan- commented Oct 10, 2019

botelastic bot commented Sep 9, 2020

inqueue commented Sep 11, 2020

elasticmachine commented May 10, 2021

botelastic bot commented May 10, 2022

VimCommando commented Jul 27, 2022

botelastic bot commented Jul 27, 2023

andrewkroh commented Aug 2, 2023

bplies-ATX commented Aug 25, 2023

renzedj commented Nov 21, 2023

it-ops-liron commented Dec 13, 2023

BenB196 commented Dec 28, 2023 •

edited

Loading

udayshingwekar commented Jan 19, 2024

toddferg commented Feb 2, 2024

axw commented Feb 22, 2024 •

edited

Loading

george-viaud commented Jul 30, 2024 •

edited

Loading

george-viaud commented Aug 2, 2024

Kavindu-Dodan commented Oct 31, 2024 •

edited

Loading

Kavindu-Dodan commented Nov 14, 2024

add_cloud_metadata detecting wrong cloud provider (aws as openstack) #13816

add_cloud_metadata detecting wrong cloud provider (aws as openstack) #13816

Comments

aidan- commented Sep 26, 2019

urso commented Oct 2, 2019

aidan- commented Oct 10, 2019

botelastic bot commented Sep 9, 2020

inqueue commented Sep 11, 2020

elasticmachine commented May 10, 2021

botelastic bot commented May 10, 2022

VimCommando commented Jul 27, 2022

botelastic bot commented Jul 27, 2023

andrewkroh commented Aug 2, 2023

bplies-ATX commented Aug 25, 2023

renzedj commented Nov 21, 2023

it-ops-liron commented Dec 13, 2023

BenB196 commented Dec 28, 2023 • edited Loading

udayshingwekar commented Jan 19, 2024

toddferg commented Feb 2, 2024

axw commented Feb 22, 2024 • edited Loading

george-viaud commented Jul 30, 2024 • edited Loading

george-viaud commented Aug 2, 2024

Kavindu-Dodan commented Oct 31, 2024 • edited Loading

Footnotes

Kavindu-Dodan commented Nov 14, 2024

BenB196 commented Dec 28, 2023 •

edited

Loading

axw commented Feb 22, 2024 •

edited

Loading

george-viaud commented Jul 30, 2024 •

edited

Loading

Kavindu-Dodan commented Oct 31, 2024 •

edited

Loading