Task Run Command Design #702

kohidave · 2020-02-27T21:21:28Z

Some parts of customers workflows don't fall neatly into services, cron jobs, or triggers. Occasionally, customers need to run one-off tasks (think creating a database in a newly minted RDS instance, running a query against their instance, or just for debugging). This design talks about how we can enable this for customers.

What category is this command?

I believe the tasks domain should be under a new category called Operate. These commands are focused around one-off executions that are a normal part of operations, but may not be part of a customers day to day workflow.

How will it work?

copilot task run

By default, this will build the local image. Then, if the customer is working within a workspace associated with a project, we can prompt them to choose an environment.

copilot task run 
Which environment would you like to run this task?
> Test
> Prod
> None (runs in your default VPC)

If they select an existing environment, we can set up an ECR repo for on-off tasks real quick (or
bake the one-off repo into our environment template), push the image, and launch the task on Fargate with a sensible default configuration.

If the customer doesn't have any projects set up (or they are not in a workspace) - we can just deploy the task to the Default ECS cluster and the Default VPC. Since this is a lower level command, we can expose flags like subnet or security group.

For this command, you don't have to have set up any ecscli projects.

Optional Flags

--num (the number of tasks to set up)
--cpu (the cpu-units)
--memory (the amount of memory)
--image (an optional image to run, instead of building a dockerfile)
--task-role (optional role for the task to use when running. If running in an environment, the default can be one of our roles like the env task role)
--env, --project (for selecting the environment and project to run this in)
--subnet-id, security-group-id (for choosing a vpc)
--env-vars (key/value list of environment variables)
--command (list of commands which are passed to docker run)

Some examples might be

# Builds, and pushes the local docker file to an ECR repo in environment test. This is run within
# a workspace, so we don't need to provide the --project flag. The VPC will be the env's VPC
copilot task run --env test
✔️ Great, we started your task _i3df4cg43_
$ run copilot task logs --task _i3df4cg43_ to view your task's logs.

# If not run in a workspace, builds and executes the image in the default VPC/Cluster.
copilot task run

# Runs in the default VPC (if not in a WS). Spins up 4 tasks with 256vcpu, 2GB mem. Runs a 
# particular image (kohidave/ddb-parallel-scan-backup), passes in environment roles, and uses
# a particular role for the task to executed under. 
copilot task run --num 4 
                     --cpu 256 --memory 2048
                     --image kohidave/ddb-parallel-scan-backup 
                     --env-var bucket,davids-ddb-output-s3-bucket 
                     --task-role ddb-backup-role

# Same as above, but overwrites the default commands in the Dockerfile
copilot task run --num 4 
                     --cpu 256 --memory 2048
                     --image kohidave/my-service 
                     --env-var bucket,davids-ddb-output-s3-bucket 
                     --task-role ddb-backup-role
                     --commands rake db migrate

Isn't this like the Fargate CLI?

Yeap! Almost exactly like that. But we can also include integrations into our view of environments so folks can run their tasks in a particular environment (ECS cluster / VPC). This is super useful if you have resources within a particular VPC that can only be accessed from within the VPC.

What next?

We'll have to follow up with visibility features for tasks. Logs, list, PS, etc.

Questions

Should we do anything to limit folks from running commands in production environments?

The text was updated successfully, but these errors were encountered:

kohidave · 2020-02-27T21:24:11Z

Other Example Usecases

Calling a private service API (not exposed through the LB, but through service discovery)
Querying a memcached/redis instance within an environment's VPC
Running a task on Fargate quickly
Running a one-off script which requires secrets, but you don't want to copy secrets to your dev machine

JoeAlamo · 2020-02-28T16:35:39Z

Being able to override the default command would be great.

For example, I have an image / task definition for a Django container that I use for my API. By default it will serve HTTP requests. But if I can pass an override, I can run admin tasks using manage.py, on the same codebase etc. just as if I was running it locally or on a server.

kohidave · 2020-02-29T00:03:22Z

Being able to override the default command would be great.

For example, I have an image / task definition for a Django container that I use for my API. By default it will serve HTTP requests. But if I can pass an override, I can run admin tasks using manage.py, on the same codebase etc. just as if I was running it locally or on a server.

All right - added a "commands" flag to address this. Good call. Thanks a bunch.

aidansteele · 2020-03-10T09:27:57Z

This would be amazing.

First thought: It would be amazing to have a "synchronous" mode. I'm thinking something like ecs task run --wait .... It would do the following:

Execute run as described by @kohidave.
Stream logs from the task.
Waits until the task terminates. Because this is for tasks that terminate, right?
The CLI returns the task's container's exit code itself. So a failure ECS-side causes failure on CLI-side. Great for running within a pipeline.

Second thought: It would be 🤯 if this was combined with ECS Cloud Debug's support for shelling into a running task. Then I could run a handful of commands, type exit and have the container be terminated.

Louise15926 · 2020-06-04T20:33:59Z

Some thoughts on validating flags such as subnet-id, security-group-ids, and task-role.

It'd be easier to wait until Execute stage to "validate" these flags - the API call that deploys the task using the task definition will return an error if any of these arguments are invalid and prevent deployment.
Validate the arguments at Validate stage: several API calls need to be made in order to validate these arguments. For example, to validate subnet-id, we'd probably need to make API call to DescribeSubnets filtered by VPC id (which is either found via environment or is the default VPC).
Other options that I don't know 💭 💭 ...

Any thought or comment?

efekarakus · 2020-06-04T21:40:55Z

It'd be easier to wait until Execute stage to "validate" these flags - the API call that deploys the task using the task definition will return an error if any of these arguments are invalid and prevent deployment.

Your recommendation sounds good to me 👍 let's go for it

Louise15926 · 2020-06-11T17:32:15Z

Current behavior of the --app and --env flags:

If --app is specified:
--- If currently under a workspace
--- Inconsistent with current workspace (workspace’s app is a, but specified b) => is a valid input; use b instead of a as app
--- consistent => nothing special to do
--- If not in a workspace
--- the specified app exists => valid input
--- if doesn't exist => invalid input; throw error

If --env is specified:
--- If --app is specified and is valid
--- check if env exists under the app; error if it doesn’t
--- If --app is not specified
--- if currently under a workspace => Check if env exist; error if not
--- If not under a workspace => invalid input; throw error

This PR adds validate to task run command Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

efekarakus · 2020-06-22T18:36:45Z

One thing that would be neat after we deliver the proposed version is adding a new flag like --spot-count. That leverages the FARGATE_SPOT capacity provider: https://aws.amazon.com/blogs/compute/deep-dive-into-fargate-spot-to-run-your-ecs-tasks-for-up-to-70-less/

I'm thinking:

copilot task run --count 5 runs 5 regular fargate tasks.
copilot task run --spot-count 5 tries to run up to 5 fargate spot tasks.
copilot task run --count 5 --spot-count 5 runs 5 regular fargate tasks and up to 5 fargate spot tasks.

(Related #1044)

This PR has two major changes: 1. add ask to task run; 2. add a required flag for task-family name, update validate and ask correspondingly  Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…or task (#1101) This PR adds a CloudFormation template for a task stack, and implement stack configuration methods for task. Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…1109) This PR includes the following changes to aws package: 1. adds a ec2 package to get subnet and security groups 2. add runtask and defaultCluster method to ecs package Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

#1134) This PR includes changes to deploy a task to CloudFormation. Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

toricls · 2020-07-14T04:47:34Z

I'm not sure it's related to this issue or not, but as an example use-case of standalone ECS task, sometime I use Step Functions' state machine to wrap a task to ensure the task to be retried when it fails.

This PR adds a task package that runs one-off tasks in different configurations, including running in the default cluster and subnets, running in a Copilot environment, and running in specified subnets and security groups. Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This PR implements a task runner that runs tasks in user provided subnets and security groups. Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This PR contains the following changes to aws/ec2 package: 1. adds a new method that gets public subnets with optional filters; this is to support `task run` in copilot environment 2. rename methods to conform to [Go Best Practice](https://golang.org/doc/effective_go.html#Getters) 3. refactor redundant code Related #1109, #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This PR implements a task runner that runs tasks in a copilot environment. Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This PR builds and pushes image and updates image to task definition for `task run`. This is a part of the series of PRs that implement `Execute` for `task run`. Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This PR starts the task(s) for `task run`. This is a part of the series of PRs that implement Execute for task run. Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This PR adds a `--default` flag for `task run`. Should be merged after #1216. Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This PR contains the following changes: 1. implement `--follow` flag for `task run` 2. implement `EarliestStartTime` method in `task` package in support of `--follow` 3. wrap `aws-sdk-go/ecs` constant `DesiredStatusStopped` in `ecs` package in support of `--follow` Should be merged only after #1212 Related #702 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

efekarakus · 2020-08-12T21:57:17Z

This feature will be released in v0.3.0, we'll have a separate design to incorporate step functions to a task :).

kohidave added WIP Pull requests that are being modified now. type/design Issues that are design proposals. labels Feb 27, 2020

This was referenced Jun 11, 2020

Add validate to task run Louise15926/amazon-ecs-cli-v2#1

Merged

feat(cli): add validate for task run #1015

Merged

Louise15926 mentioned this issue Jun 19, 2020

feat(cli): implement ask for task run #1036

Merged

piradeepk assigned Louise15926 and piradeepk Jun 29, 2020

Louise15926 mentioned this issue Jun 30, 2020

feat(cli): implement exec for task run #1077

Closed

This was referenced Jul 9, 2020

chore(stack): add task to deploy/cfn/stack package and add template for task #1101

Merged

chore(aws): modify ecs and ec2 client #1102

Closed

Louise15926 mentioned this issue Jul 10, 2020

chore(aws): update ecs and ec2 packages to support task run command #1109

Merged

paulxuca mentioned this issue Jul 13, 2020

Ability to start an interactive session with deployed containers #1133

Closed

Louise15926 mentioned this issue Jul 13, 2020

chore(deploy): update deploy/cfn package to support task run command #1134

Merged

Louise15926 mentioned this issue Jul 14, 2020

chore(task): implement task package that runs one-off tasks #1139

Merged

Louise15926 mentioned this issue Jul 15, 2020

chore(task): run task in provided subnets and security groups #1145

Merged

Louise15926 mentioned this issue Jul 15, 2020

chore(ec2): get public subnets to support task run #1146

Merged

Louise15926 mentioned this issue Jul 16, 2020

chore(task): run task in environment #1155

Merged

Louise15926 mentioned this issue Jul 22, 2020

chore(cli): update image to task for task run #1183

Merged

Louise15926 mentioned this issue Jul 24, 2020

chore(cli): start tasks for task run #1190

Merged

This was referenced Jul 28, 2020

chore(cli): implement follow flag #1200

Merged

chore(cli): add --default flag for task run #1214

Merged

efekarakus closed this as completed Aug 12, 2020

malkovro mentioned this issue Sep 29, 2020

Feature request: easy way to use a service’s task role when running a one-off task #1432

Open

nbarrera mentioned this issue Jun 25, 2021

access task run command's exit code #2525

Closed

efekarakus moved this to Complete in Copilot Roadmap 🗺️ Nov 13, 2022

efekarakus added this to Copilot Roadmap 🗺️ Nov 13, 2022

fortran01 mentioned this issue Dec 20, 2022

add --spot-count for task run on FARGATE_SPOT capacity provider #4293

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task Run Command Design #702

Task Run Command Design #702

kohidave commented Feb 27, 2020 •

edited

Loading

kohidave commented Feb 27, 2020

JoeAlamo commented Feb 28, 2020

kohidave commented Feb 29, 2020

aidansteele commented Mar 10, 2020

Louise15926 commented Jun 4, 2020

efekarakus commented Jun 4, 2020

Louise15926 commented Jun 11, 2020

efekarakus commented Jun 22, 2020 •

edited

Loading

toricls commented Jul 14, 2020

efekarakus commented Aug 12, 2020

Task Run Command Design #702

Task Run Command Design #702

Comments

kohidave commented Feb 27, 2020 • edited Loading

What category is this command?

How will it work?

Optional Flags

Isn't this like the Fargate CLI?

What next?

Questions

kohidave commented Feb 27, 2020

Other Example Usecases

JoeAlamo commented Feb 28, 2020

kohidave commented Feb 29, 2020

aidansteele commented Mar 10, 2020

Louise15926 commented Jun 4, 2020

efekarakus commented Jun 4, 2020

Louise15926 commented Jun 11, 2020

efekarakus commented Jun 22, 2020 • edited Loading

toricls commented Jul 14, 2020

efekarakus commented Aug 12, 2020

kohidave commented Feb 27, 2020 •

edited

Loading

efekarakus commented Jun 22, 2020 •

edited

Loading