unable to retrieve IAM credentials in sagemaker serverless inference #737

ncullen93 · 2024-01-05T02:00:56Z

I am unfortunately having an issue on sagemaker inference, but only on the serverless inference. I am deploying a model using the standard vetiver functions for doing so (ref: https://juliasilge.com/blog/vetiver-sagemaker/), along with some slight changes to the config to be serverless. The vetiver deployment works perfectly with real-time inference but when I change to serverless, it fails because paws can't find any credentials.

I wonder if there is anything special that should be done when building a docker for serverless inference, or if this is just a paws issue. Thanks! Apologies for double posting.

DyfanJones · 2024-01-05T09:51:57Z

Hi @ncullen93 sorry to hear that. Is it possible for you to include an example in what you did? I would like to reproduce it :)

ncullen93 · 2024-01-05T10:28:13Z

Of course, thanks! Note that to create the serverless endpoint, I had to alter the endpoint config vetiver creates. So you can either install my fork at ncullen93/vetiver-r or it's probably easier to just skip the vetiver_sm_endpoint function at the end and create the endpoint manually on sagemaker with a serverless config. Both ways fail.

library(tidymodels)
library(pins)
library(vetiver)

## Fit a basic model
data(ames)
set.seed(123)
ames_split <-
  ames %>%
  mutate(Sale_Price = log10(Sale_Price)) %>%
  mutate_if(is.integer, as.numeric) %>%
  initial_split(prop = 0.80, strata = Sale_Price)

ames_train <- training(ames_split)
ames_test  <- testing(ames_split)

rf_spec <-
  rand_forest(trees = 1000) %>%
  set_engine("ranger") %>%
  set_mode("regression")

rf_wflow <-
  workflow(
    Sale_Price ~ Neighborhood + Gr_Liv_Area + Year_Built + Bldg_Type +
      Latitude + Longitude,
    rf_spec
  )

rf_fit <- rf_wflow %>% fit(data = ames_train)


# turn it into a vetiver model
v <- vetiver_model(rf_fit, "ames-pricing")

# write model to s3 board -> need a bucket and credentials here
board <- pins::board_s3(bucket = 'sagemaker-vetiver',
                        access_key = Sys.getenv("AWS_ACCESS_KEY_ID"),
                        secret_access_key = Sys.getenv("AWS_SECRET_ACCESS_KEY"),
                        region = 'us-east-2')
vetiver_pin_write(board, v)

## START SAGEMAKER SPECIFIC ACTIONS FROM VETIVER ##

# build docker and upload to ECR -> works fine
new_image_uri <- vetiver_sm_build(board, "ames-pricing")

# create sagemaker model -> works fine
model_name <- vetiver_sm_model(new_image_uri)

# create endpoint -> my fork alters only the config in this function to be serverless
# But it fails when plumber is run due to paws not finding credentials
# install my fork at devtools::install_github('ncullen93/vetiver-r') to see
# You can also skip this + create the serverless endpoint on aws. That fails too
# the 'ml.t2.medium' instance is ignored
new_endpoint <- vetiver_sm_endpoint(model_name, 'ml.t2.medium')

DyfanJones · 2024-01-05T10:32:37Z

Thanks for the example code :) I will have a little look at it to see why it is failing :)

DyfanJones · 2024-01-05T11:07:04Z

@ncullen93 nearly forgot do you have any logs or errors?

ncullen93 · 2024-01-05T11:45:14Z

Yes, nothing too informative I'm afraid.


2024-01-05T02:13:23.894+01:00 | ARGUMENT 'serve' __ignored__
-- | --
  | 2024-01-05T02:13:24.128+01:00 | R version 4.3.1 (2023-06-16) -- "Beagle Scouts"
  | 2024-01-05T02:13:24.128+01:00 | Copyright (C) 2023 The R Foundation for Statistical Computing
  | 2024-01-05T02:13:24.128+01:00 | Platform: x86_64-pc-linux-gnu (64-bit)
  | 2024-01-05T02:13:24.128+01:00 | R is free software and comes with ABSOLUTELY NO WARRANTY.
  | 2024-01-05T02:13:24.128+01:00 | You are welcome to redistribute it under certain conditions.
  | 2024-01-05T02:13:24.128+01:00 | Type 'license()' or 'licence()' for distribution details.
  | 2024-01-05T02:13:24.132+01:00 | Natural language support but running in an English locale
  | 2024-01-05T02:13:24.132+01:00 | R is a collaborative project with many contributors.
  | 2024-01-05T02:13:24.132+01:00 | Type 'contributors()' for more information and
  | 2024-01-05T02:13:24.132+01:00 | 'citation()' on how to cite R or R packages in publications.
  | 2024-01-05T02:13:24.132+01:00 | Type 'demo()' for some demos, 'help()' for on-line help, or
  | 2024-01-05T02:13:24.132+01:00 | 'help.start()' for an HTML browser interface to help.
  | 2024-01-05T02:13:24.132+01:00 | Type 'q()' to quit R.
  | 2024-01-05T02:13:24.396+01:00 | > options('paws.log_level' = 3L); pr <- plumber::plumb('/opt/ml/plumber.R'); pr$run(host = '0.0.0.0', port = 8080)
  | 2024-01-05T02:13:26.376+01:00 | INFO [2024-01-05 01:13:26.376]: Unable to locate credentials file
  | 2024-01-05T02:13:26.376+01:00 | INFO [2024-01-05 01:13:26.376]: Unable to locate config file
  | 2024-01-05T02:13:26.376+01:00 | INFO [2024-01-05 01:13:26.376]: Unable to obtain access_key_id, secret_access_key or session_token
  | 2024-01-05T02:13:28.399+01:00 | INFO [2024-01-05 01:13:28.399]: Unable to obtain iam role
  | 2024-01-05T02:13:28.400+01:00 | Error in stopOnLine(lineNum, file[lineNum], e) :
  | 2024-01-05T02:13:28.400+01:00 | Error on line #6: 'library(vetiver)' - Error: No compatible credentials provided.
  | 2024-01-05T02:13:28.400+01:00 | Calls: <Anonymous> ... tryCatchList -> tryCatchOne -> <Anonymous> -> stopOnLine
  | 2024-01-05T02:13:28.400+01:00 | Execution halted

DyfanJones · 2024-01-05T12:54:21Z

@ncullen93 no worries I will have a look now :)

DyfanJones · 2024-01-05T13:35:16Z

@ncullen93 do you get a successful endpoint build? and this error only happens when attempting predict?

DyfanJones · 2024-01-05T13:40:47Z

Interesting I am getting the following error:

I am going to try with the latest dev version of paws.common

ncullen93 · 2024-01-05T14:36:15Z

Interesting.. seems related to the credentials. I wonder if there is something that must be changed in the docker file. Hard to find any documentation on the difference between real-time and serverless inference from a model perspective.

ncullen93 · 2024-01-05T14:36:47Z

@ncullen93 do you get a successful endpoint build? and this error only happens when attempting predict?

No, the endpoint build fails. At the very end just like yours.

DyfanJones · 2024-01-05T16:08:38Z

Hmm I wonder if it is down to paws only looking at the ipv4 for iam credentials and we need to include the support for ipv6 🤔

DyfanJones · 2024-01-05T16:50:33Z

Found it, we need to support this environmental variable:

AWS_CONTAINER_CREDENTIALS_FULL_URI

This will then get the credentials :D I will have a look in implementing this shortly :D

ncullen93 · 2024-01-05T20:22:39Z

That sounds promising! Happy to test it whenever.. appreciate the help immensely.

DyfanJones · 2024-01-06T11:51:39Z

@ncullen93 I believe I have a solution. Please try out:

remotes::install_github("dyfanjones/paws/paws.common", ref = "env_container_cred_full_uri")

And let me know how you get on :)

ncullen93 · 2024-01-06T19:41:43Z

It works! You are an absolute mad lad.

A little trouble making predictions from the endpoint using paws.machine.learning::sagemakerruntime however. It's giving a 424 model error and the log on aws says the data is empty. So the input data is not getting picked up somehow. This is the code used to invoke an endpoint:

predict.vetiver_endpoint_sagemaker <- function(object, new_data, ...) {
    check_installed(c("jsonlite", "smdocker", "paws.machine.learning"))
    data_json <- jsonlite::toJSON(new_data, na = "string")
    config <- smdocker::smdocker_config()
    sm_runtime <- paws.machine.learning::sagemakerruntime(config)
    tryCatch(
        {
            resp <- sm_runtime$invoke_endpoint(object$model_endpoint, data_json, ...)
            resp <- resp$Body
        },
        error = function(error) {
            error_code <- error$error_response$ErrorCode
            if (!is.null(error_code) && error_code == "NO_SUCH_ENDPOINT") {
                cli::cli_abort("Model endpoint {.val {object$model_endpoint}} not found.")
            }
            stop(error)
        }
    )
    con <- rawConnection(resp)
    on.exit(close(con))
    resp <- jsonlite::fromJSON(con)
    return(tibble::as_tibble(resp))
}

Still, I know the endpoint works because invoking it from python returns predictions. You can see there is a slight addition to the recommended way to invoke a serverless SM endpoint compared with real-time (ContentType is added) so perhaps that's the issue ?

Python example from aws:

response = runtime.invoke_endpoint(
    EndpointName=endpoint_name,
    ContentType=content_type, # this is added for serverless: e.g., "application/json"
    Body=payload # e.g., bytes('__some json__', 'utf-8')
)

In any case, I will invoke the endpoints from python anyways so not a big deal I think. Really appreciate the help.

DyfanJones · 2024-01-06T20:01:33Z

That is great news :) I can get this into the latest paws.common 0.7.0 release (#720)

Does adding ContentType work from paws? When invoking the endpoint as well?

Possibly worth raising a pr on vetiver to get the serverless method enabled. @juliasilge would vetiver be interested in extending its sagemaker support with serverless stuff? I am more than happy to contribute again :)

DyfanJones · 2024-01-06T20:15:46Z

@ncullen93 you could also try sending the data across as a raw vector, so:

data_json <- charToRaw(jsonlite::toJSON(new_data, na = "string"))

Let me know how you get on :)

ncullen93 · 2024-01-08T12:40:09Z

I will try it. I think that should work.

DyfanJones · 2024-01-08T13:28:29Z

@ncullen93 Just had a little play and the following worked for me:

predict(new_endpoint, ames_test, ContentType = "application/json")

This is using the standard vetiver:::predict.vetiver_endpoint_sagemaker method.

DyfanJones · 2024-01-09T10:41:48Z

Note: paws.common 0.7.0 has been released to the cran

juliasilge · 2024-01-10T21:16:52Z

Thank you so much for your continued support on this @DyfanJones!

Do you think I should set ContentType = "application/json" in the predict method for a SageMaker endpoint?
https://github.com/rstudio/vetiver-r/blob/581a4e98d9673013a386a9715f180f025fc3f03f/R/sagemaker.R#L417
I am thinking yes, since we are definitely passing JSON?

DyfanJones · 2024-01-10T21:32:32Z

Yeah I agree. If anything we could have it in the parameters for the predict method:

predict.vetiver_endpoint_sagemaker <- function(object, new_data, content_type = "application/json", ...) { }

DyfanJones added the bug 🐞 Something isn't working label Jan 5, 2024

DyfanJones mentioned this issue Jan 6, 2024

Add AWS_CONTAINER_CREDENTIALS_FULL_URI environmental variable #738

Merged

ncullen93 closed this as completed Jan 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unable to retrieve IAM credentials in sagemaker serverless inference #737

unable to retrieve IAM credentials in sagemaker serverless inference #737

ncullen93 commented Jan 5, 2024

DyfanJones commented Jan 5, 2024 •

edited

Loading

ncullen93 commented Jan 5, 2024 •

edited

Loading

DyfanJones commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

ncullen93 commented Jan 5, 2024 •

edited

Loading

DyfanJones commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

ncullen93 commented Jan 5, 2024

ncullen93 commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

ncullen93 commented Jan 5, 2024

DyfanJones commented Jan 6, 2024

ncullen93 commented Jan 6, 2024 •

edited

Loading

DyfanJones commented Jan 6, 2024 •

edited

Loading

DyfanJones commented Jan 6, 2024

ncullen93 commented Jan 8, 2024

DyfanJones commented Jan 8, 2024

DyfanJones commented Jan 9, 2024

juliasilge commented Jan 10, 2024

DyfanJones commented Jan 10, 2024

unable to retrieve IAM credentials in sagemaker serverless inference #737

unable to retrieve IAM credentials in sagemaker serverless inference #737

Comments

ncullen93 commented Jan 5, 2024

DyfanJones commented Jan 5, 2024 • edited Loading

ncullen93 commented Jan 5, 2024 • edited Loading

DyfanJones commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

ncullen93 commented Jan 5, 2024 • edited Loading

DyfanJones commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

ncullen93 commented Jan 5, 2024

ncullen93 commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

DyfanJones commented Jan 5, 2024

ncullen93 commented Jan 5, 2024

DyfanJones commented Jan 6, 2024

ncullen93 commented Jan 6, 2024 • edited Loading

DyfanJones commented Jan 6, 2024 • edited Loading

DyfanJones commented Jan 6, 2024

ncullen93 commented Jan 8, 2024

DyfanJones commented Jan 8, 2024

DyfanJones commented Jan 9, 2024

juliasilge commented Jan 10, 2024

DyfanJones commented Jan 10, 2024

DyfanJones commented Jan 5, 2024 •

edited

Loading

ncullen93 commented Jan 5, 2024 •

edited

Loading

ncullen93 commented Jan 5, 2024 •

edited

Loading

ncullen93 commented Jan 6, 2024 •

edited

Loading

DyfanJones commented Jan 6, 2024 •

edited

Loading