Admin endpoint security #2763

htuch · 2018-03-08T17:10:18Z

The admin endpoint today is unsecured (no authentication or TLS), with the assumption that it is only available to localhost or accessible on a trusted network. Ideally:

We want to be able to restrict access to only trusted IPs, client certificates and ensure we have transport security.
We want to have some ability to distinguish roles and access to the admin console, i.e. distinct identities might be allowed to operate /quitquitquit vs. stats monitoring.

Beyond just security, there's also the question of what the admin console is. Is it just a curlable utility, an interactive web console or is it a first-class API intended for programatic use? Should it offer gRPC endpoints (in particular as we are moving towards a proto definition of its contents in places such as #2172). Answers to this affect the framing of security considerations.

Opening this issue to start the design discussion here.

The text was updated successfully, but these errors were encountered:

mattklein123 · 2018-03-08T17:13:29Z

A few initial points are in the comment here: envoyproxy/data-plane-api#523 (comment)

htuch · 2018-03-09T15:25:26Z

One thing I wonder about is whether we should be offering a traditional web interface at all. Here's one alternative design; implement only gRPC or REST endpoints, have folks build out Javascript client side interfaces which can be served from a listener via Envoy's direct response (https://www.envoyproxy.io/docs/envoy/latest/api-v2/api/v2/route/route.proto#envoy-api-field-route-route-direct-response) mechanism. This removes Envoy from being in the business of worrying about XSS/CSRF/other web security concerns, while providing the convenience of browser admin capability.

mattklein123 · 2018-03-09T23:57:18Z

For reference the web stuff was only recently added by @jmarantz. I objected to this slightly at the time because realistically I think the only production use for the admin endpoint is either curl or other automated tools, but it didn't seem like a big deal to me to serve the landing page in HTML so I didn't worry about it that much.

My overall view of things right now is that the admin endpoint is not secure it all and must be accessed only over a trusted network. In the future I think we should do the following things:

Generally provide only gRPC/REST endpoints codified in the data-plane-api repo (already tracked by various issues).
Promote the admin endpoint to a fuller listener config allowing for both TLS/mTLS and inline RBAC security on a per-endpoint basis (once we have the inline RBAC filter). This will allow operators to configure things as they like it.
Per @htuch we should consider having a "secure by default" admin config which locks admin down to localhost only and then requires operators to open up individual endpoints (.e.g., /stats) as they see fit.

In general I think that worrying about things like XSS/CSRF/etc. is kind of a waste of time for this. I will defer to @jmarantz who knows substantially more about this on what we should do on that front security-wise (hopefully optimizing for realistic usage scenarios).

mattklein123 · 2018-03-10T00:01:56Z

P.S., it would be great if someone in the community who is passionate about admin security might want to own this. There will be a non-trivial amount of work here to get to where we want to ultimately be.

jmarantz · 2018-03-10T04:38:52Z

One clarification: the http handlers for mutating operations were pre-existing. The change added a web home-page with proper escaping, and reduced XSS through proper http content types, and AFAIK added no additional exposure.

I agree that more restricted access by default would help.

mattklein123 · 2018-03-10T22:57:05Z

One clarification: the http handlers for mutating operations were pre-existing. The change added a web home-page with proper escaping, and reduced XSS through proper http content types, and AFAIK added no additional exposure.

Sorry I spoke incorrectly. Before your change we had no HTML. I know basically nothing about web security. My real point was that if the existence of HTML is going to cause security consternation, I don't think it's worth maintaining because IMHO the use of the HTML endpoints is not going to happen in production. Assuming there is no additional exposure, than it's fine by me. I just wanted to point out that we should be careful about the HTML stuff if that is going to cause us additional maintenance burden.

ofek · 2018-03-12T02:20:24Z

Additionally, I would strongly recommend Envoy have a separate listener/endpoint that only serves /stats with optional basic auth.

ofek · 2018-03-13T03:06:58Z

@DataDog is recommending https://gist.github.com/ofek/6051508cd0dfa98fc6c13153b647c6f8 until this is solved.

Idea courtesy of @ggreenway
Config courtesy of @bndw (with this modification from @htuch)

Fixes envoyproxy/envoy#2769 References envoyproxy/envoy#2763 Signed-off-by: Matt Klein <[email protected]>

taiki45 · 2018-03-14T13:18:56Z

I vote to disabling web admin interface. Alternatively:

Move admin operations like /cpuprofiler or /logging to runtime configuration flags or simple Unix domain socket commands like haproxy one. This allows us to manage permissions via traditional file system permissions.
Promote pull-based endpoints like /stats to gRPC/REST API and let them have fuller listener config.

In additon, these features might be disabled by default. All of programatic use goes gRPC/REST API to supoprt user extendability, and rest of admin interface gets on FS permissions.

Personally, I like Envoy's curl-able interface so I prefer Unix domain socket commands which is similar to the current web admin interface. I have a little passion to move admin operations from web interface to Unix domain socket one.

ofek · 2018-03-14T17:11:16Z

What's the difference between a "pull-based endpoint" like /stats and a REST API version?

jmarantz · 2018-03-14T17:19:43Z

I think it makes sense to control access via configuration, including disabling the http admin interface completely or restricting it to an IP, with separate controls for read-access vs POSTed mutations.

taiki45 · 2018-03-14T18:24:43Z

What's the difference between a "pull-based endpoint" like /stats and a REST API version?

The main difference that I thought is the API one will have a dedicated listener and will be properly schema controlled. For example, /stats already has a json format for programmatic access.

taiki45 · 2018-03-14T18:36:28Z

It's not a strong objection but, to say source IP base restriction or local loopback binding, we want more detailed permission control in some deployment cases: developers can login a host in which Envoy runs but do not want to allow them to take admin operations of the Envoy instance, but want to allow only administrators to do that operations for easy debbuging.

envoyproxy/envoy#2971 adds warning-checks that mutations should be POSTed. This documents that status. In a future PR, mutations will fail if they are not POSTs. See envoyproxy/envoy#2763 for more detail. Also adds a doc stub for @jsedgewick to fill in for /runtime_modify.

htuch · 2019-12-31T00:26:35Z

@justincely I think if we had the admin port as a listener, it would be reasonable to add a simple HTTP filter to block this; it's probably entirely doable with something like the RBAC filter today (although probably more complicated than you'd want from a UX perspective).

I don't think anyone is working on this right now, so go ahead. I would recommend moving to admin listener as the first step here.

mattklein123 · 2019-12-31T15:53:06Z

+1 let's start by just making the admin listener a real listener.

cstrahan · 2020-01-02T17:08:35Z

I think if we had the admin port as a listener, it would be reasonable to add a simple HTTP filter to block this

@justincely and I were talking about this earlier today. I'll be working on this shortly -- just now getting a sense of what all needs to be done to get us there. As I work on this, feel free to assign this issue to me whenever you feel confident in my ability to deliver.

ofek · 2020-01-02T17:26:54Z

Hello all! For those of us providing monitoring solutions based on /stats, could someone please briefly explain what would need to be changed config-wise to retain access to that endpoint once the proposed feature lands?

mattklein123 · 2020-01-03T15:58:47Z

Hello all! For those of us providing monitoring solutions based on /stats, could someone please briefly explain what would need to be changed config-wise to retain access to that endpoint once the proposed feature lands?

I think the default behavior is likely to be the same as it is today (fully open), but we will allow for real listener configuration including the RBAC filter, etc. so that certain endpoints can be blocked. It's possible that eventually we would change the default posture but I'm not sure this would happen in the initial version.

ofek · 2020-01-03T16:48:39Z

Excellent, thanks!

Signed-off-by: gargnupur <[email protected]>

cstrahan · 2020-05-07T14:31:49Z

A proposal to secure the admin endpoint

I was chatting with @mattklein123 a couple weeks ago about securing the admin endpoint, under the assumption that we'd want some way to allow users to specify arbitrary filters (e.g. RBAC).

I would like to propose that we allow specifying a Listener config in the Admin message, and deprecate the Admin fields that can be taken directly from the Listener (e.g. address details).

The AdminFilter would be made a first class filter (registered with just like the other http filters), but we would validate that the AdminFilter is only used within the Admin config, and that the filter is specified last in the filter chain.

I would appreciate feedback from both Envoy users and developers; would this approach work for you?

/cc @justincely

mattklein123 · 2020-05-07T18:40:41Z

I would appreciate feedback from both Envoy users and developers; would this approach work for you?

Yes I think this SGTM. Thank you for working on this! cc @envoyproxy/security-team

htuch · 2021-01-11T15:34:56Z

For posterity, I'd like to note some outcome of a recent discussion around why admin endpoint is sometimes opened more widely than it should. Some users are making use of the Prometheus stats endpoint (https://www.envoyproxy.io/docs/envoy/latest/operations/admin.html?highlight=prometheus#get--stats-prometheus) for exposing out Envoy stats. Unfortunately, due to the lack of fine-grained access control (or any for that matter), we end up exposing the entire endpoint out to the network.

We would probably have some reasonable wins for security by having a separate stats and admin endpoint, but ultimately, making this a first class listener would provide RBAC and per-route granularity.

geo-y · 2021-02-14T23:56:05Z

I can use one light-weight web server backend to access the admin interface back with basic authentication.
Here is my example:

docker-compose.yml(key part):

services:
  h2o:
    image: fukata/h2o-php:latest
    volumes:
      - <path>/h2o/:/etc/h2o/ext/
      - <path>/html/:/var/www/
    command: ["h2o", "-m", "master", "-c", "/etc/h2o/ext/h2o.conf"]
    restart: on-failure
  envoy:
    image: envoyproxy/envoy-alpine-dev:latest
    volumes:
      - <path>/envoy/:/etc/envoy/ext/:ro
      - <path>/cert/:/etc/crts/:ro
    ports:
      - "80:8000"
      - "443:8443"
    depends_on:
      - h2o
    command: /usr/local/bin/envoy -c /etc/envoy/ext/front_v3.yaml
    restart: on-failure

envoy(https://github.com/envoyproxy/examples/blob/main/front-proxy/envoy.yaml):

###
            virtual_hosts:
            - name: envoy_admin1
              domains:
              - "<admin-host1>"
              - "<admin-host2>"
              routes:
              - match:
                  prefix: "/"
                route:
                  cluster: h2o1
###
  clusters:
  - name: h2o1
    connect_timeout: 2s
    type: strict_dns
    lb_policy: round_robin
    load_assignment:
      cluster_name: h2o1
      endpoints:
      - lb_endpoints:
        - endpoint:
            address:
              socket_address:
                address: h2o
                port_value: 80
###

h2o.conf:

hosts:
  "<admin-host>":
    listen:
      port: 80
    paths:
      "/":
        mruby.handler: |
          require "htpasswd.rb"
          Htpasswd.new("/etc/h2o/ext/htpass", "realm-name")
        proxy.reverse.url: http://envoy:<admin-port>
        proxy.preserve-host: ON

The "htpasswd" file manages the admin user and password:

htpasswd ./htpass admin-username

tpetkov-VMW · 2022-03-01T18:55:18Z

In order to have secure settings by default, we have a local patch that adds a list of explicitly allowed endpoints directly into the bootstrap configuration. There are a few things to consider around the design, but the implementation is pretty straight forward and would give some guarantees that nobody can, for example, do /quitquitquit .
Would this be helpful/desired?

maxres-ch · 2023-02-08T18:24:39Z

I was wondering if there's any movement on this issue? We'd really like to see it as a configurable thing.

jmarantz · 2024-03-18T19:20:04Z

#11367 is stale and needs to be re-started with a dev ready to push it forward.

See also #32346 which just merged, and is somewhat related.

ira-gordin-sap · 2024-09-09T13:26:51Z

Hi, what happens with this issue?

htuch added enhancement Feature requests. Not bugs or questions. tech debt area/security help wanted Needs help! labels Mar 8, 2018

htuch mentioned this issue Mar 8, 2018

Documentation for hystrix dashboard support feature (issue #1244) envoyproxy/data-plane-api#523

Closed

htuch mentioned this issue Mar 9, 2018

EP-01-001 HTTP: Lacking Admin-Interface Security allows CSRF and DOS (Cure53) #2769

Closed

mattklein123 mentioned this issue Mar 11, 2018

admin: add security warning envoyproxy/data-plane-api#534

Merged

ofek mentioned this issue Mar 12, 2018

[envoy] new integration DataDog/integrations-core#1156

Merged

htuch pushed a commit to envoyproxy/data-plane-api that referenced this issue Mar 13, 2018

admin: add security warning (#534)

a6378e5

Fixes envoyproxy/envoy#2769 References envoyproxy/envoy#2763 Signed-off-by: Matt Klein <[email protected]>

jsedgwick mentioned this issue Mar 14, 2018

runtime: add admin endpoint to view/modify loaded values #514

Closed

This was referenced Mar 23, 2018

admin: add /runtime_modify endpoint and make runtime work without fs backing #2837

Merged

server: warn on admin mutations, GET requests, add mocking of messages #2912

Closed

jmarantz mentioned this issue Apr 3, 2018

server: Warn in logs on admin mutations via GET. Mutations should be POSTed. #2971

Merged

This was referenced Apr 3, 2018

Document that admin mutations should be POSTs envoyproxy/data-plane-api#599

Closed

Document that admin mutations should be POSTs envoyproxy/data-plane-api#600

Closed

Document that admin mutations should be POSTs envoyproxy/data-plane-api#601

Closed

Shikugawa pushed a commit to Shikugawa/envoy that referenced this issue Mar 28, 2020

Use onLog for TCP stats login (envoyproxy#2763)

cf701d9

Signed-off-by: gargnupur <[email protected]>

htuch mentioned this issue May 6, 2020

proposal: securing admin endpoint #11082

Closed

cstrahan mentioned this issue May 29, 2020

configure admin endpoint as a listener/http-filter (e.g. to secure via RBAC) #11367

Closed

6 tasks

sfudeus mentioned this issue Aug 21, 2020

Move Envoy admin interface from TCP:15000 to a Unix Domain Socket istio/istio#19684

Closed

9 tasks

kevincantu mentioned this issue Dec 14, 2020

Is there a more secure way to allow ALBs to health-check Envoy? projectcontour/contour#3201

Closed

duderino mentioned this issue Feb 23, 2021

Add initial security best practices documentation istio/istio.io#8952

Merged

mattklein123 added area/admin and removed enhancement Feature requests. Not bugs or questions. labels Mar 9, 2021

markmandel mentioned this issue Mar 15, 2021

Make metric port configurable googleforgames/quilkin#101

Closed

sunjayBhatia mentioned this issue Jul 27, 2021

Allow /healthcheck/fail from not admin webpage #17508

Closed

htuch mentioned this issue Oct 19, 2021

Global Rate Limit Listener Opt Out #18678

Closed

nak3 mentioned this issue Mar 1, 2022

disable ext_authz filter for http01 challenge knative-extensions/net-kourier#778

Merged

ggreenway mentioned this issue Apr 22, 2022

admin: Richer HTML home page with forms for params #19546

Merged

EltonzHu mentioned this issue Oct 23, 2023

Support FQDN Address Type in EndpointSlice envoyproxy/gateway#1922

Closed

nezdolik mentioned this issue Jun 7, 2024

Make it possible to configure secure access to Envoy admin endpoint envoyproxy/gateway#3565

Open

GabrielAlacchi mentioned this issue Nov 29, 2024

Allow Locking Down of Admin Interface in Sidecars istio/istio#54109

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Admin endpoint security #2763

Admin endpoint security #2763

htuch commented Mar 8, 2018

mattklein123 commented Mar 8, 2018

htuch commented Mar 9, 2018

mattklein123 commented Mar 9, 2018

mattklein123 commented Mar 10, 2018

jmarantz commented Mar 10, 2018 •

edited

Loading

mattklein123 commented Mar 10, 2018

ofek commented Mar 12, 2018 •

edited

Loading

ofek commented Mar 13, 2018

taiki45 commented Mar 14, 2018

ofek commented Mar 14, 2018

jmarantz commented Mar 14, 2018 •

edited

Loading

taiki45 commented Mar 14, 2018

taiki45 commented Mar 14, 2018

htuch commented Dec 31, 2019

mattklein123 commented Dec 31, 2019

cstrahan commented Jan 2, 2020

ofek commented Jan 2, 2020

mattklein123 commented Jan 3, 2020

ofek commented Jan 3, 2020

cstrahan commented May 7, 2020 •

edited

Loading

mattklein123 commented May 7, 2020

htuch commented Jan 11, 2021

geo-y commented Feb 14, 2021 •

edited

Loading

tpetkov-VMW commented Mar 1, 2022

maxres-ch commented Feb 8, 2023

jmarantz commented Mar 18, 2024

ira-gordin-sap commented Sep 9, 2024

Admin endpoint security #2763

Admin endpoint security #2763

Comments

htuch commented Mar 8, 2018

mattklein123 commented Mar 8, 2018

htuch commented Mar 9, 2018

mattklein123 commented Mar 9, 2018

mattklein123 commented Mar 10, 2018

jmarantz commented Mar 10, 2018 • edited Loading

mattklein123 commented Mar 10, 2018

ofek commented Mar 12, 2018 • edited Loading

ofek commented Mar 13, 2018

taiki45 commented Mar 14, 2018

ofek commented Mar 14, 2018

jmarantz commented Mar 14, 2018 • edited Loading

taiki45 commented Mar 14, 2018

taiki45 commented Mar 14, 2018

htuch commented Dec 31, 2019

mattklein123 commented Dec 31, 2019

cstrahan commented Jan 2, 2020

ofek commented Jan 2, 2020

mattklein123 commented Jan 3, 2020

ofek commented Jan 3, 2020

cstrahan commented May 7, 2020 • edited Loading

A proposal to secure the admin endpoint

mattklein123 commented May 7, 2020

htuch commented Jan 11, 2021

geo-y commented Feb 14, 2021 • edited Loading

tpetkov-VMW commented Mar 1, 2022

maxres-ch commented Feb 8, 2023

jmarantz commented Mar 18, 2024

ira-gordin-sap commented Sep 9, 2024

jmarantz commented Mar 10, 2018 •

edited

Loading

ofek commented Mar 12, 2018 •

edited

Loading

jmarantz commented Mar 14, 2018 •

edited

Loading

cstrahan commented May 7, 2020 •

edited

Loading

geo-y commented Feb 14, 2021 •

edited

Loading