Batch System Jobs #2527

labria · 2017-04-06T12:25:00Z

Currently, for running short living jobs, there is the batch scheduler, and for cluster-wide jobs (one per node) there is the service scheduler.
But sometimes you may want a short living job (that exits fast, not running any daemon) that is propagated across all the nodes. One example would be a job that changes some kind of config on the nodes.
Would it be possible to either support this kind of jobs in the system scheduler, or allow the batch scheduler to be run on all nodes?

Thanks

dadgar · 2017-04-06T17:54:04Z

Hey I updated the title and yes this is on the roadmap but timeline wise it may be a bit further out there.

siffiejoe · 2017-04-12T13:51:34Z

Consul has an exec command that may help (just in case you are using Consul).

labria · 2017-04-16T21:55:04Z

@dadgar thanks!
@siffiejoe yes, I know, but in my case I actually need to deploy and run a docker container, albeit a short-living one.

shantanugadgil · 2017-05-12T17:28:22Z

My use case is to update DNS entries when a certain service starts.
For now I am running the DNS update as separate job altogether.

ketzacoatl · 2018-01-22T20:02:16Z

Mind sharing a use case? We get this request from time-to-time, but I believe most people solve it using existing configuration management solutions.

We would love to use nomad to replace our use of salt-master and other CM tools, but it's difficult to do this without a batch-system job, or some way of running a one-off job. To be clear, using nomad is so great, I'd like to have it handle all of my "run X on Y" problems.

@dadgar / @schmichael - does this require a new job type, or fit within the existing schedulers? is this relatively easy to implement, possibly a good task for someone wanting to learn about go, or too difficult?

dadgar · 2018-01-22T22:35:01Z

@ketzacoatl It would have to be a new scheduler type. So this is a pretty significant amount of work. It is also unclear what the expectation is if the node doesn't have enough resources to run the given job? Would you want it to keep trying to place forever? What if the node is fully filled with service jobs that don't migrate often (once a month)? Does the system batch job try to keep placing it for that entire duration?

ketzacoatl · 2018-01-23T16:35:09Z

@dadgar ah, ok.. I did not realize this would be more like a new scheduler. I guess that makes the design process a little easier.

I like the questions you've brought up, and I think there are probably others to ask, but to start, here are a few answers and details on use-cases I am targeting.

In nomad's world, a "batch system job" could be reworded as "run this thing once, on all hosts". A close variant is "run this thing periodically, on all hosts" ("periodic batch system job", and I think there's another ticket for that, I'll update this comment if I find it again). I'm not sure if the periodic and run-once batch-system jobs should be one scheduler type, but I think it's worth considering.

In my world, I want to "run X across the fleet(s) when Y happens". X is sometimes "apply a patch" or "update all users on the host", or similar. These actions are boxed up as CM formula, and otherwise applied with a single shell command / script / etc. Y might be something an operator is involved with, eg a security event, a new user joining the team, etc, or Y might be a completely automated reaction to some other happening in the deployment (eg consul event, change in catalog, lost instance, etc), or could just be a periodic task to run ("apply all formula, just make sure you are good and clean", or some scanning task).

ATM, in these scenarios, I am using a combination of consul and CM tooling. While it is functional and "works", there are a lot of administrative annoyances that would be addressed by nomad's features (for example, checking job status, logs, stats, resource reservation and allocation, the UI, etc).

For example, in the past, I've configured consul with a watch on some stuff in the catalog, and triggering CM tooling to "apply some formula" (eg if a "users" config changed, apply the "users" formula, or similar). It works well, and lets me manage a lot of systems through git with very little overhead, but using nomad for this would provide numerous advantages and assurances.

It's also worth noting that some of these tasks are "run on all hosts", while others are "targeting specific hosts", but I would expect to use constraints for that targeting.

Integrating with Consul, and servicing automated reactions, a batch system job could be submitted which "runs whenever X changes in the Consul catalog". EG, nomad monitors consul and runs a new instance of the batch job when the watch triggers.

Here are a few more specific answers to your questions:

First, I think those details are worked out in the job spec. For example, I could say "run-once, optimistically, and fail-fast, and tell me which nodes/tasks failed where" via parameters such as "fail-fast" (don't keep retrying, if a node isn't able to run the task, just tell me), or "retry based on X policy".

If a node does not have enough resources, the job operator could configure the job to fail the first time, or to continue to retry the task until it were possible, or retry for some amount of time X. Another option would be to allow for "draining a node" first, if there are not enough resources (which looks a bit like a rolling deploy).

Overall, I think the goal here is to provider operators with a means of running one-off (and periodic) tasks, across all hosts, and to make it reasonably easy for an operator to see what happened (which failed, which do I need to poke at some more).

lmayorga1980 · 2018-11-07T15:52:08Z

👍 Hope this feature is implemented soon.

eigengrau · 2019-04-29T11:59:58Z

Chiming in with our desired use-case: running docker-gc on schedule.

bdossantos · 2019-05-01T21:36:34Z

Hope this feature is implemented soon. My use-case: host backups, warming-up nodes with recent docker images

tommyalatalo · 2019-05-13T15:28:43Z

I'm also looking forward to this feature, cluster-wide cleanup jobs would benefit greatly from a 'system batch' job type

the-maldridge · 2019-06-10T16:05:26Z

Adding a use case here. I realized recently that I needed to cleanup /var/lib/docker/overlay2 more aggressively than I had previously thought. In my ideal world, I would do this as part of a system batch task that runs once a week and GC's that directory.

The root cause is actually that my images all include a particular data file, which I could remove if I had a way of ensuring that it was always present on the host and within 2 days of its release timestamp, again something that if I had a batch job available I could do.

For both of these I believe batch-periodic is what I would want, but I could also just have another task which is a batch task that deploys the system batch job on a schedule. I would consider this to be a defect in nomad, but given that it already has a periodic scheduler, it would be a decent workaround.

nirmalkq · 2019-09-20T12:40:24Z

Another use case may be for a patching activity we want to perform on nomad nodes in large cluster.

shantanugadgil · 2019-09-29T05:03:57Z

My use case for this is to do a "yum update".

I achieved this by having a simple system job. The shell script is in a while 1 loop with a large wait of 24 hours.

zbliujia · 2019-10-22T15:17:54Z

My use case for this is to do a "brew install".

perrymanuk · 2019-10-29T19:28:02Z

Would like to have this for package maintenance on the nomad nodes

kcajf · 2019-11-14T21:54:07Z

This would be very useful for little cleanup jobs, and file transfers

israellot · 2020-05-14T12:59:36Z

+1 on this feature. Updating packages would be much easier.

shantanugadgil · 2020-05-14T15:05:20Z

adding a "+1" to the first post would be beneficial ... that's how HashiCorp tracks interest in a ticket/feature .... (AFAIK)

kasimon · 2020-06-15T10:05:37Z

Our use case would be launching periodic backups on all servers. We use borg backup, which is strictly run from the client, and being able to create a periodic bach-system backup job would create a better overview over this process.

pySilver · 2020-06-15T14:47:35Z

My use case is to register nodes in some external monitoring app. Since there is no lifecycle that can be executed with hook="poststart" I need a batch job that can be executed cluster-wide.

yishan-lin · 2020-06-29T19:55:33Z

Coming soon - on our roadmap!

the-maldridge · 2020-06-29T19:56:28Z

Amazing! This will clean up so much stuff in the Hashistack use cases.

shantanugadgil · 2020-06-29T21:34:12Z

@yishan-lin as this is more batch oriented, would "wait for" semantics be also part of the feature?
(like Jenkins' "wait for the other job/group/task to finish, before proceeding?)

yishan-lin · 2020-06-29T21:41:41Z

That's a good question. Ideally, the lifecycle hooks in my mind (e.g PostStart and PostStop hooks coming in the next month with our 0.12.X patches) would help address the "wait for" semantics with this across all schedulers, not just system/batch.

Would love to continue hearing thoughts and use-cases from all on this in detail, as that'd greatly help our design for this feature and ensure we build for everyone's immediate success (which should be coming quite soon). We'd be looking to address this and #4740, #4072, and #4267 in one fell swoop - not in the same feature of course, but in the same timeframe.

shantanugadgil · 2020-06-30T16:15:58Z

My over-arching thought/ideas are basically coming from a Jenkins Pipeline thought process with Git plugin.

A full featured git plugin (feature equivalent of the Jenkins plugin) would be an super excellent addition.
(I wonder if a git plugin can be implemented using Nomad's plugin sdk 😁 )

for now I am (sort of) making do with the git clean, fetch, pull, whatever commands using a raw_exec job and shell script.
having a much enhanced declarative syntax for the repo clone/update (artifact) could also be an intermediate solution, maybe ?!

This PR implements a new "System Batch" scheduler type. Jobs can make use of this new scheduler by setting their type to 'sysbatch'. Like the name implies, sysbatch can be thought of as a hybrid between system and batch jobs - it is for running short lived jobs intended to run on every compatible node in the cluster. As with batch jobs, sysbatch jobs can also be periodic and/or parameterized dispatch jobs. A sysbatch job is considered complete when it has been run on all compatible nodes until reaching a terminal state (success or failed on retries). Feasibility, rolling updates, and preemption are governed the same as with system jobs. Closes #2527

This PR implements a new "System Batch" scheduler type. Jobs can make use of this new scheduler by setting their type to 'sysbatch'. Like the name implies, sysbatch can be thought of as a hybrid between system and batch jobs - it is for running short lived jobs intended to run on every compatible node in the cluster. As with batch jobs, sysbatch jobs can also be periodic and/or parameterized dispatch jobs. A sysbatch job is considered complete when it has been run on all compatible nodes until reaching a terminal state (success or failed on retries). Feasibility and preemption are governed the same as with system jobs. In this PR, the update stanza is not yet supported. The update stanza is sill limited in functionality for the underlying system scheduler, and is not useful yet for sysbatch jobs. Further work in #4740 will improve support for the update stanza and deployments. Closes #2527

Ramesh7 · 2020-11-23T11:21:22Z

Hi @yishan-lin, I see sysbatch is getting tracked here, is it going to land with 1.0.0 GA or in some minor versions?

josegonzalez · 2021-03-14T04:55:19Z

Maybe related is #1944.

This PR implements a new "System Batch" scheduler type. Jobs can make use of this new scheduler by setting their type to 'sysbatch'. Like the name implies, sysbatch can be thought of as a hybrid between system and batch jobs - it is for running short lived jobs intended to run on every compatible node in the cluster. As with batch jobs, sysbatch jobs can also be periodic and/or parameterized dispatch jobs. A sysbatch job is considered complete when it has been run on all compatible nodes until reaching a terminal state (success or failed on retries). Feasibility and preemption are governed the same as with system jobs. In this PR, the update stanza is not yet supported. The update stanza is sill limited in functionality for the underlying system scheduler, and is not useful yet for sysbatch jobs. Further work in #4740 will improve support for the update stanza and deployments. Closes #2527

ketzacoatl · 2021-08-16T13:38:50Z

Yahoo! This is awesome, thank you!

github-actions · 2022-10-16T02:45:59Z

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

dadgar changed the title ~~Allow deploying short-living system jobs.~~ Batch System Jobs Apr 6, 2017

dadgar added type/enhancement theme/scheduling labels Apr 6, 2017

schmichael mentioned this issue Oct 6, 2017

[feature-request] Allow batch-system jobs #3337

Closed

chelseakomlo mentioned this issue Jan 31, 2018

system scheduler doesn't stop on "exit 0" #3822

Closed

tgross added the stage/accepted Confirmed, and intend to work on. No timeline committment though. label Aug 24, 2020

shoenig self-assigned this Oct 9, 2020

shoenig mentioned this issue Oct 22, 2020

core: implement system batch scheduler #9160

Merged

tgross mentioned this issue Mar 23, 2021

Restricting a certain kind of job to one per node #10215

Closed

shoenig removed their assignment Jun 15, 2021

notnoop closed this as completed in #9160 Aug 16, 2021

github-actions bot locked as resolved and limited conversation to collaborators Oct 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch System Jobs #2527

Batch System Jobs #2527

labria commented Apr 6, 2017

dadgar commented Apr 6, 2017

siffiejoe commented Apr 12, 2017

labria commented Apr 16, 2017

shantanugadgil commented May 12, 2017

ketzacoatl commented Jan 22, 2018

dadgar commented Jan 22, 2018

ketzacoatl commented Jan 23, 2018

lmayorga1980 commented Nov 7, 2018

eigengrau commented Apr 29, 2019

bdossantos commented May 1, 2019

tommyalatalo commented May 13, 2019

the-maldridge commented Jun 10, 2019 •

edited

Loading

nirmalkq commented Sep 20, 2019

shantanugadgil commented Sep 29, 2019 •

edited

Loading

zbliujia commented Oct 22, 2019

perrymanuk commented Oct 29, 2019

kcajf commented Nov 14, 2019

israellot commented May 14, 2020

shantanugadgil commented May 14, 2020

kasimon commented Jun 15, 2020

pySilver commented Jun 15, 2020

yishan-lin commented Jun 29, 2020 •

edited

Loading

the-maldridge commented Jun 29, 2020

shantanugadgil commented Jun 29, 2020

yishan-lin commented Jun 29, 2020 •

edited

Loading

shantanugadgil commented Jun 30, 2020 •

edited

Loading

Ramesh7 commented Nov 23, 2020

josegonzalez commented Mar 14, 2021

ketzacoatl commented Aug 16, 2021

github-actions bot commented Oct 16, 2022

Batch System Jobs #2527

Batch System Jobs #2527

Comments

labria commented Apr 6, 2017

dadgar commented Apr 6, 2017

siffiejoe commented Apr 12, 2017

labria commented Apr 16, 2017

shantanugadgil commented May 12, 2017

ketzacoatl commented Jan 22, 2018

dadgar commented Jan 22, 2018

ketzacoatl commented Jan 23, 2018

lmayorga1980 commented Nov 7, 2018

eigengrau commented Apr 29, 2019

bdossantos commented May 1, 2019

tommyalatalo commented May 13, 2019

the-maldridge commented Jun 10, 2019 • edited Loading

nirmalkq commented Sep 20, 2019

shantanugadgil commented Sep 29, 2019 • edited Loading

zbliujia commented Oct 22, 2019

perrymanuk commented Oct 29, 2019

kcajf commented Nov 14, 2019

israellot commented May 14, 2020

shantanugadgil commented May 14, 2020

kasimon commented Jun 15, 2020

pySilver commented Jun 15, 2020

yishan-lin commented Jun 29, 2020 • edited Loading

the-maldridge commented Jun 29, 2020

shantanugadgil commented Jun 29, 2020

yishan-lin commented Jun 29, 2020 • edited Loading

shantanugadgil commented Jun 30, 2020 • edited Loading

Ramesh7 commented Nov 23, 2020

josegonzalez commented Mar 14, 2021

ketzacoatl commented Aug 16, 2021

github-actions bot commented Oct 16, 2022

the-maldridge commented Jun 10, 2019 •

edited

Loading

shantanugadgil commented Sep 29, 2019 •

edited

Loading

yishan-lin commented Jun 29, 2020 •

edited

Loading

yishan-lin commented Jun 29, 2020 •

edited

Loading

shantanugadgil commented Jun 30, 2020 •

edited

Loading