This repository has been archived by the owner on Apr 26, 2024. It is now read-only.
-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
events get soft-failed when the federation_inbound worker is busy #7744
Labels
A-Federation
A-Soft-Failure
O-Uncommon
Most users are unlikely to come across this or unexpected workflow
S-Major
Major functionality / product severely impaired, no satisfactory workaround.
T-Defect
Bugs, crashes, hangs, security vulnerabilities, or other reported issues.
Comments
inspection of the state_groups around that event show clearly that the sender of the event was in the room. It's not even a particularly complex DAG, so it kinda has to be a transient problem with "current_state". Possibly there was a delay in invalidating the current_state cache on the worker processing the inbound events? next steps might be to dig up the logs for when those events arrived over federation to see if there is anything funky? |
FWIW I think we keep track of the forward extremities at a given stream position in |
so we have, on the master:
and then, on federation_inbound:
so indeed it looks like a cache isn't being correctly flushed. |
see also: #7444 |
also see also: #7669 |
richvdh
changed the title
events mysteriously soft-failed
events get soft-failed when the federation_inbound worker is busy
Jul 1, 2020
MadLittleMods
added
T-Defect
Bugs, crashes, hangs, security vulnerabilities, or other reported issues.
A-Federation
labels
Jul 8, 2021
DMRobertson
added
S-Major
Major functionality / product severely impaired, no satisfactory workaround.
O-Uncommon
Most users are unlikely to come across this or unexpected workflow
A-Soft-Failure
and removed
z-p1
z-bug
(Deprecated Label)
labels
Sep 6, 2022
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Labels
A-Federation
A-Soft-Failure
O-Uncommon
Most users are unlikely to come across this or unexpected workflow
S-Major
Major functionality / product severely impaired, no satisfactory workaround.
T-Defect
Bugs, crashes, hangs, security vulnerabilities, or other reported issues.
event ids
$44J0EFE_wD30pBA2EZ2hu_viEDWZ55CFZCJEliLoQOY
,$d6_JvuU1AdlqnwBEK2shSB30_qFWIiZPlg526HJJuQQ
,$FnodZIXcp1YSMx1RTvfp1RZIp51VgU32JCQaVmEv8Z8
were all mysteriously soft-failed on matrix.org, presumably because synapse thought that the sender wasn't in the room - but she had joined a clear 3 minutes earlier.The text was updated successfully, but these errors were encountered: