[BUG] - CPU usage 100% in WaitMode #333

aibotsoft · 2022-05-25T08:19:07Z

With this test code:

package main

import (
	"github.com/go-co-op/gocron"
	"time"
)

func job() {
	time.Sleep(time.Second * 3)
}

func main() {
	s := gocron.NewScheduler(time.UTC)
	s.SetMaxConcurrentJobs(1, gocron.WaitMode)
	s.Every(time.Second * 5).Do(job)
	s.Every(time.Second * 5).Do(job)
	s.StartBlocking()
}

I get 100% CPU utilization (at least one core) which is equivalent to an infinite loop:

package main

func main() {
	for {
		
	}
}

Version
v1.13.0

Expected behavior
While waiting for a free executor, the processor should not be loaded at 100%

The text was updated successfully, but these errors were encountered:

AlexanderSutul · 2022-05-27T20:06:04Z

@JohnRoesler Is that a good approach to add a delay in 1 * nanosecond? It's just a fastest solution I can suggest for now 😊

AlexanderSutul · 2022-05-28T22:40:27Z

@aibotsoft Hey! Are you sure it's a problem? I just checked that 100% here doesn't effect any I/O operations. So, yeah it's 100% but it's like fake 100% of usage. So, it's just flooded OS scheduler.
Correct me, if I'm wrong, please. (links for articles about that are appreciated)

aibotsoft · 2022-05-29T10:00:55Z

Maybe you are right, but I am sure that the system should not be overloaded with such a simple task as cron, not with the jobs themselves, but with the cron scheduler.

I think there is an inaccuracy here, in the executor.go file:

if !e.maxRunningJobs.TryAcquire(1) {
  switch e.limitMode {
  case RescheduleMode:
	  return
  case WaitMode:
	  for {
		  select {
		  case <-stopCtx.Done():
			  return
		  case <-f.ctx.Done():
			  return
		  default:
		  }
  
		  if e.maxRunningJobs.TryAcquire(1) {
			  break
		  }
	  }
  }
}

Scheduler trying to get a free executor, if there is no free one, in RescheduleMode we just leave, but in WaitMode we enter an endless loop that constantly tries to get the executor using the TryAcquire method, which simply returns bool without blocking.

Perhaps instead of the TryAcquire method, we should use the Acquire method, which acquires the semaphore blocking until resources are available or ctx is done. In this case, you don't need an infinite loop.

AlexanderSutul · 2022-05-29T13:57:43Z

@aibotsoft Thanks for your thoughts. Let's see what I can do. I'll try to fix it till the end of next week.

JohnRoesler · 2022-06-14T11:08:18Z

@aibotsoft this is interesting! The reason it is done the way currently is to allow checking the state of both the scheduler and individual job context states. The Acquire method allows waiting for a single context to be done, but I notice even in that case, it can still run the job even after the context done occurs:

// Acquired the semaphore after we were canceled. Rather than trying to
// fix up the queue, just pretend we didn't notice the cancelation.

Now maybe that's a cost worth paying to lower CPU. I haven't tried it yet with this method, but it looks to be doing something similar to the executor logic under the hood.

aibotsoft added the bug Something isn't working label May 25, 2022

mhxw mentioned this issue Jun 15, 2022

[BUG] - Program level exit #341

Closed

mistu4u mentioned this issue Oct 14, 2022

fix for high cpu usage #386

Merged

2 tasks

JohnRoesler closed this as completed Oct 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] - CPU usage 100% in WaitMode #333

[BUG] - CPU usage 100% in WaitMode #333

aibotsoft commented May 25, 2022

AlexanderSutul commented May 27, 2022 •

edited

Loading

AlexanderSutul commented May 28, 2022

aibotsoft commented May 29, 2022

AlexanderSutul commented May 29, 2022 •

edited

Loading

JohnRoesler commented Jun 14, 2022

[BUG] - CPU usage 100% in WaitMode #333

[BUG] - CPU usage 100% in WaitMode #333

Comments

aibotsoft commented May 25, 2022

AlexanderSutul commented May 27, 2022 • edited Loading

AlexanderSutul commented May 28, 2022

aibotsoft commented May 29, 2022

AlexanderSutul commented May 29, 2022 • edited Loading

JohnRoesler commented Jun 14, 2022

AlexanderSutul commented May 27, 2022 •

edited

Loading

AlexanderSutul commented May 29, 2022 •

edited

Loading