Only register queue workers using drb for dequeue #19829

carbonin · 2020-02-11T22:31:45Z

Workers should only actually attempt to add themselves to the @workers hash if they are queue workers, if they are using drb to fetch queue messages, and the drb server is available. In the past, this process was tied into heartbeating over drb and was thus skipped when using files for heartbeating. After the switch was made to heartbeat to files exclusively, this registration started running even when using the run_single_worker script from the command line.

This lead to issue #19793. This PR fixes the issue by ensuring that workers are only registered when they are going to dequeue messages over drb and only allowing workers to dequeue over drb when the server is actually available.

Additionally because heartbeating is now completely separate from worker registration I moved the #register_worker method out of the heartbeat concern and into the dequeue one.

Now that heartbeating is done through a file uncondidionally this method to add a worker to the drb workers list is only applicable to dequeue

jrafanie · 2020-02-11T22:36:09Z

app/models/miq_queue_worker_base/runner.rb

+    @drb_dequeue_available ||=
+      begin
+        server.drb_uri.present? && worker_monitor_drb.respond_to?(:register_worker)
+      rescue DRb::DRbError


Was there something else in this block before? How would DRb::DRbError be raised? We're not doing anything with drb, right?

ah, I see, we used to call worker_monitor_drb.register_worker which could raise that exception...

Right, so in this case I just need some call to the drb object to raise to tell if it's available or not. I used a respond_to? here because that proxies over to the server side without actually doing anything.

Exactly. I don't think the begin/rescue is needed here.

(it's needed in the method above that calls register_worker on the drb object)

This is very much needed I think. respond_to? is proxied to the remote drb object so it will raise rather than return false if the drb server is inaccessible.

I would love a way to tell if the drb server is up that didn't rely on exception handling, do you know of something like that?

@agrare do you know if there's anything like that?

My bad. I forgot the drb object was proxied and could raise here. Oh DRb.

Not that I'm aware of, in miq_fault_tolerant_vim which wraps MiqVim to deal with DRb "issues" we just catch DRb::DRbConnError and retry if the broker is on a new port

jrafanie · 2020-02-11T22:37:32Z

app/models/miq_worker/runner.rb

@@ -288,8 +288,6 @@ def heartbeat
      _log.info("#{log_prefix} Synchronizing configuration complete...")
    end

-    register_worker_with_worker_monitor unless MiqEnvironment::Command.is_podified?


yay, that is_podified? removal must feel good...

jrafanie

Other than the comment about the unneeded exception handling, it looks really good.

jrafanie · 2020-02-17T15:09:00Z

spec/models/miq_queue_worker_base/runner_spec.rb

+        end
+      end
+    end
+


…able The worker monitor only needs drb for queue message prefetch now so only queue workers should be registered. Additionally, make run_single_worker work from the command line (without a server process) by ensuring that we can connect to the DRb server before trying to communicate with it. Fixes ManageIQ#19793

miq-bot · 2020-02-17T15:30:44Z

Checked commits carbonin/manageiq@ef2d304~...3928c43 with ruby 2.5.7, rubocop 0.69.0, haml-lint 0.20.0, and yamllint
5 files checked, 2 offenses detected

app/models/miq_server/worker_management/dequeue.rb

❗ - Line 103, Col 5 - Style/MultilineIfModifier - Favor a normal unless-statement over a modifier clause in a multiline statement.
❗ - Line 103, Col 5 - Style/SafeNavigation - Use safe navigation (&.) instead of checking if an object exists before calling the method.

Move register_worker from heartbeat concern to dequeue

ef2d304

Now that heartbeating is done through a file uncondidionally this method to add a worker to the drb workers list is only applicable to dequeue

carbonin added bug core labels Feb 11, 2020

carbonin assigned jrafanie Feb 11, 2020

carbonin requested a review from agrare February 11, 2020 22:31

jrafanie reviewed Feb 11, 2020

View reviewed changes

jrafanie requested changes Feb 11, 2020

View reviewed changes

jrafanie reviewed Feb 17, 2020

View reviewed changes

spec/models/miq_queue_worker_base/runner_spec.rb Outdated

end

end

end

Copy link

Member

jrafanie Feb 17, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👾 🔫

carbonin force-pushed the fix_run_single_worker branch from 32bbc51 to 3928c43 Compare February 17, 2020 15:30

jrafanie approved these changes Feb 17, 2020

View reviewed changes

jrafanie merged commit d8d9555 into ManageIQ:master Feb 17, 2020

jrafanie added this to the Sprint 130 Ending Feb 17, 2020 milestone Feb 17, 2020

carbonin deleted the fix_run_single_worker branch April 23, 2020 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only register queue workers using drb for dequeue #19829

Only register queue workers using drb for dequeue #19829

carbonin commented Feb 11, 2020

jrafanie Feb 11, 2020

jrafanie Feb 11, 2020

carbonin Feb 12, 2020

jrafanie Feb 12, 2020

jrafanie Feb 12, 2020

carbonin Feb 17, 2020

carbonin Feb 17, 2020

jrafanie Feb 17, 2020

agrare Feb 17, 2020

jrafanie Feb 11, 2020

jrafanie left a comment

jrafanie Feb 17, 2020

miq-bot commented Feb 17, 2020

Only register queue workers using drb for dequeue #19829

Only register queue workers using drb for dequeue #19829

Conversation

carbonin commented Feb 11, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jrafanie left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

miq-bot commented Feb 17, 2020