Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mark destructure gradient test as broken #1797

Merged
merged 1 commit into from
Dec 7, 2021
Merged

Conversation

ToucheSir
Copy link
Member

Temporary measure to unblock CI while we investigate the root cause. I think @test_broken is better than increasing the tolerance, as the latter makes it easier to forget about this one entirely.

Temporary measure to unblock CI while we investigate the root cause. I think `@test_broken` is better than increasing the tolerance, as the latter makes it easier to forget about this one entirely.
@github-actions
Copy link
Contributor

github-actions bot commented Dec 2, 2021

Once the build has completed, you can preview any updated documentation at this URL: https://fluxml.ai/Flux.jl/previews/PR1797/ in ~20 minutes

@DhairyaLGandhi
Copy link
Member

Happy to merge this for now. can we let go of the pr comment bot? It seems extraneous to most PRs and the links are always generated, so we aren't missing out.

@ToucheSir
Copy link
Member Author

@logankilpatrick is there a way to have the bot only trigger on markdown file, docstring changes and/or changes in docs/? That or have a manual trigger, though I think watching the docs folder should be conservative enough.

Copy link
Member

@DhairyaLGandhi DhairyaLGandhi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

bors r+

bors bot added a commit that referenced this pull request Dec 2, 2021
1797: Mark destructure gradient test as broken r=DhairyaLGandhi a=ToucheSir

Temporary measure to unblock CI while we investigate the root cause. I think ``@test_broken`` is better than increasing the tolerance, as the latter makes it easier to forget about this one entirely.


Co-authored-by: Brian Chen <[email protected]>
@bors
Copy link
Contributor

bors bot commented Dec 2, 2021

Build failed:

@ToucheSir
Copy link
Member Author

Do we have a flaky test here? https://buildkite.com/julialang/flux-dot-jl/builds/1893#efc52f50-bab7-4968-a07c-e223aa23f13f/402-581 failed on the bors run but not on the commit itself.

@ToucheSir
Copy link
Member Author

bors r+

bors bot added a commit that referenced this pull request Dec 2, 2021
1797: Mark destructure gradient test as broken r=ToucheSir a=ToucheSir

Temporary measure to unblock CI while we investigate the root cause. I think ``@test_broken`` is better than increasing the tolerance, as the latter makes it easier to forget about this one entirely.


Co-authored-by: Brian Chen <[email protected]>
@bors
Copy link
Contributor

bors bot commented Dec 2, 2021

Build failed:

@ToucheSir
Copy link
Member Author

https://github.com/FluxML/Flux.jl/blob/master/test/cuda/layers.jl#L48 behaves differently on the PR branch vs staging, for some reason?! 😕

@DhairyaLGandhi
Copy link
Member

DhairyaLGandhi commented Dec 2, 2021

That test was added recently and overall seems to be reasonable. (Grouped)Conv layer tests weren't flaky earlier, although we need some cleanups both here and in nnlib. Although we shouldn't have a special case in there if we can avoid it.

@DhairyaLGandhi
Copy link
Member

Wondering if you could do a run with Julia 1.6 as a sanity check

@ToucheSir
Copy link
Member Author

I don't have a machine with 1.6 and a free GPU right now, but I can try over the weekend if someone doesn't get to this first.

@DhairyaLGandhi
Copy link
Member

DhairyaLGandhi commented Dec 2, 2021

I was thinking of upper bounding gha ci to 1.6 and doing a try with bors

@ToucheSir
Copy link
Member Author

ToucheSir commented Dec 2, 2021

Do we have GPU instances on GHA? AFAICT the non-CUDA grad tests have been rock stable.

@DhairyaLGandhi
Copy link
Member

Buildkite* sorry my bad.

@CarloLucibello
Copy link
Member

@ToucheSir it's ok to mark it as broken, but we should also add a test with a higher tolerance (if it passes) so that at least we are aware of further regressions

@CarloLucibello
Copy link
Member

let's merge this as quickly as possibly without bors if needed, having broken CI is pretty bad

@DhairyaLGandhi
Copy link
Member

The issue is not about a passing test with higher tolerance, since the code path should not be inducing any more error. I think it's a good idea to have a placeholder which can detect any other breakage.

@DhairyaLGandhi
Copy link
Member

bors try

bors bot added a commit that referenced this pull request Dec 6, 2021
@bors
Copy link
Contributor

bors bot commented Dec 6, 2021

try

Build succeeded:

@DhairyaLGandhi
Copy link
Member

bors r+

bors bot added a commit that referenced this pull request Dec 6, 2021
1797: Mark destructure gradient test as broken r=DhairyaLGandhi a=ToucheSir

Temporary measure to unblock CI while we investigate the root cause. I think ``@test_broken`` is better than increasing the tolerance, as the latter makes it easier to forget about this one entirely.


Co-authored-by: Brian Chen <[email protected]>
Copy link
Member

@DhairyaLGandhi DhairyaLGandhi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

bors r+

bors bot added a commit that referenced this pull request Dec 6, 2021
1797: Mark destructure gradient test as broken r=DhairyaLGandhi a=ToucheSir

Temporary measure to unblock CI while we investigate the root cause. I think ``@test_broken`` is better than increasing the tolerance, as the latter makes it easier to forget about this one entirely.


Co-authored-by: Brian Chen <[email protected]>
@bors
Copy link
Contributor

bors bot commented Dec 6, 2021

Build failed:

@ToucheSir ToucheSir mentioned this pull request Dec 6, 2021
4 tasks
@DhairyaLGandhi
Copy link
Member

bors r+

@bors
Copy link
Contributor

bors bot commented Dec 7, 2021

Build succeeded:

@bors bors bot merged commit bffb541 into master Dec 7, 2021
@ToucheSir
Copy link
Member Author

ToucheSir commented Dec 7, 2021

@DhairyaLGandhi thanks! Do you know what (if anything) changed on CI? Edit: this is strange, master is still not happy :(

@ToucheSir ToucheSir deleted the bc/test-broken-destructure branch December 7, 2021 18:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants