🔥🔥🔥 "Libraries Test Run checked coreclr Linux" timing out on all PRs #45061

jkotas · 2020-11-21T22:57:07Z

Dotnet-GitSync-Bot · 2020-11-21T22:57:09Z

I couldn't figure out the best area label to add to this issue. If you have write-permissions please help me learn by adding exactly one area label.

ghost · 2020-11-21T22:57:40Z

Tagging subscribers to this area: @ViktorHofer
See info in area-owners.md if you want to be subscribed.

Issue Details

Author:	jkotas
Assignees:	-
Labels:	`area-Infrastructure`, `untriaged`
Milestone:	-

ViktorHofer · 2020-11-22T21:54:07Z

@safern can you please take a look at that one?

ViktorHofer · 2020-11-23T16:33:09Z

Still happening in PRs, i.e. #45108.

ericstj · 2020-11-23T16:53:11Z

cc @aik-jahoda
Does anyone know if this is related to a code change, or do we think this is due to infrastructure (EG: reduced agents in machine pools?)

wfurt · 2020-11-23T17:26:00Z

I was searching throug Kusto and I don't see even console link. SO this may be infrastructure. All the cases I look at are containers.

SteveMCarroll · 2020-11-23T18:05:04Z

@jkotas let me know about this and i'm catching up.
My amateur sleuthing suggests this is not likely a repo level issue.
has this been reported to First Responders? please cc me on this one.

safern · 2020-11-23T18:08:37Z

@safern can you please take a look at that one?

Just catching up on this. I can help to look at data and follow up with FR if there is no thread already.

stephentoub · 2020-11-23T18:09:12Z

has this been reported to First Responders? please cc me on this one.

It was here:
#44980 (comment)

safern · 2020-11-23T18:10:03Z

Thanks @stephentoub. I'm taking over to drive closure on this one.

safern · 2020-11-23T18:26:45Z

I just looked at data for some jobs that were linked here, and it looks like that queue was either clogged or had a hicup. The workitems are running fine, taking less than 1 minute one they get a machine, but it looks like the average waiting time on the queue was 11 hours 😮

safern · 2020-11-23T19:32:26Z

The core-eng issues are: https://github.com/dotnet/core-eng/issues/11485 and https://github.com/dotnet/core-eng/issues/11468. @dotnet/dnceng says it is fixed.

I'm going to leave this open to see if we get more instances of this before EOD, if not I will close it.

jkotas · 2020-11-24T07:34:11Z

Still happening: #45137

safern · 2020-11-24T07:40:00Z

Ok, I was just about to close this as the data suggested it didn't happen but I found out that the jobs where it happened are not showing on kusto, so I looked into swagger and that example just posted, shows all workitems as "waiting". https://helix.dot.net/api/jobs/48940d46-78a5-4bab-be97-a0f38db8c27a/details?api-version=2019-06-17

I pinged the FR thread and the issue on: https://github.com/dotnet/core-eng/issues/11468#issuecomment-732715076

Thanks, for reporting the new instance!

safern · 2020-11-24T16:29:03Z

Update: the queue should be back at capacity. We're killing all jobs that started more than 2 hours ago to ease the queue.

Current issue to investigate why machines are suddenly going offline is here: https://github.com/dotnet/core-eng/issues/11503

tarekgh · 2020-11-24T19:24:42Z

Still happening in #45079. I did rerun the timed out test a couple of times without any luck.

safern · 2020-11-26T00:01:10Z

I haven't seen this anymore. I looked at the queue health and it is pretty healthy with average wait times of 15 mins since yesterday. Please re-open if you do see this happen again.

Dotnet-GitSync-Bot added the untriaged New issue has not been triaged by the area owner label Nov 21, 2020

jkotas added the area-Infrastructure label Nov 21, 2020

jkotas added the blocking-clean-ci Blocking PR or rolling runs of 'runtime' or 'runtime-extra-platforms' label Nov 21, 2020

jkotas mentioned this issue Nov 21, 2020

Handle non-ASCII strings in GetNonRandomizedHashCodeOrdinalIgnoreCase #44688

Merged

runfoapp bot mentioned this issue Nov 23, 2020

Infrastructure - Status/Health #702

Closed

ericstj changed the title ~~"Libraries Test Run checked coreclr Linux" timing out on all PRs~~ 🔥🔥🔥 "Libraries Test Run checked coreclr Linux" timing out on all PRs Nov 23, 2020

safern self-assigned this Nov 23, 2020

wfurt mentioned this issue Nov 23, 2020

NetworkInterface.Linux: take into account physical link status for OperationalStatus and GetIsNetworkAvailable #44867

Merged

safern closed this as completed Nov 26, 2020

ghost locked as resolved and limited conversation to collaborators Dec 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🔥🔥🔥 "Libraries Test Run checked coreclr Linux" timing out on all PRs #45061

🔥🔥🔥 "Libraries Test Run checked coreclr Linux" timing out on all PRs #45061

jkotas commented Nov 21, 2020 •

edited

Loading

Dotnet-GitSync-Bot commented Nov 21, 2020

ghost commented Nov 21, 2020

ViktorHofer commented Nov 22, 2020

ViktorHofer commented Nov 23, 2020

ericstj commented Nov 23, 2020

wfurt commented Nov 23, 2020

SteveMCarroll commented Nov 23, 2020

safern commented Nov 23, 2020

stephentoub commented Nov 23, 2020

safern commented Nov 23, 2020

safern commented Nov 23, 2020

safern commented Nov 23, 2020

jkotas commented Nov 24, 2020

safern commented Nov 24, 2020 •

edited

Loading

safern commented Nov 24, 2020

tarekgh commented Nov 24, 2020 •

edited

Loading

safern commented Nov 26, 2020

🔥🔥🔥 "Libraries Test Run checked coreclr Linux" timing out on all PRs #45061

🔥🔥🔥 "Libraries Test Run checked coreclr Linux" timing out on all PRs #45061

Comments

jkotas commented Nov 21, 2020 • edited Loading

Dotnet-GitSync-Bot commented Nov 21, 2020

ghost commented Nov 21, 2020

ViktorHofer commented Nov 22, 2020

ViktorHofer commented Nov 23, 2020

ericstj commented Nov 23, 2020

wfurt commented Nov 23, 2020

SteveMCarroll commented Nov 23, 2020

safern commented Nov 23, 2020

stephentoub commented Nov 23, 2020

safern commented Nov 23, 2020

safern commented Nov 23, 2020

safern commented Nov 23, 2020

jkotas commented Nov 24, 2020

safern commented Nov 24, 2020 • edited Loading

safern commented Nov 24, 2020

tarekgh commented Nov 24, 2020 • edited Loading

safern commented Nov 26, 2020

jkotas commented Nov 21, 2020 •

edited

Loading

safern commented Nov 24, 2020 •

edited

Loading

tarekgh commented Nov 24, 2020 •

edited

Loading