Dict construction calls generator side effects twice on failure #33147

BrantCarlson · 2019-09-03T18:24:28Z

Greetings...

This is not a major issue, but potentially reflects some underlying bug and is certainly confusing: when creating a Dict with a generator expression and an exception occurs, for some reason the generator restarts until it hits the exception a second time. Once it hits the exception a second time, it stops, thankfully. For example (using @show to track execution) Dict(x => sqrt(@show x) for x=[1,2,3,4,-5,6]) does everything (but the 6) twice. I would have expected it to immediately error out, but...

julia> Dict(x => sqrt(@show x) for x=[1,2,3,4,-5,6])
x = 1
x = 2
x = 3
x = 4
x = -5
x = 1
x = 2
x = 3
x = 4
x = -5
ERROR: DomainError with -5.0:
sqrt will only return a complex result if called with a complex argument. Try sqrt(Complex(x)).
Stacktrace:
 [1] throw_complex_domainerror(::Symbol, ::Float64) at ./math.jl:31
 [2] sqrt at ./math.jl:493 [inlined]
 [3] sqrt at ./math.jl:519 [inlined]
 [4] #21 at ./show.jl:576 [inlined]
 [5] iterate at ./generator.jl:47 [inlined]
 [6] _all(::getfield(Base, Symbol("##238#240")), ::Base.Generator{Array{Int64,1},getfield(Main, Symbol("##21#22"))}, ::Colon) at ./reduce.jl:744
 [7] all at ./reduce.jl:732 [inlined]
 [8] Dict(::Base.Generator{Array{Int64,1},getfield(Main, Symbol("##21#22"))}) at ./dict.jl:130
 [9] top-level scope at REPL[7]:1
caused by [exception 1]
DomainError with -5.0:
sqrt will only return a complex result if called with a complex argument. Try sqrt(Complex(x)).
Stacktrace:
 [1] throw_complex_domainerror(::Symbol, ::Float64) at ./math.jl:31
 [2] sqrt at ./math.jl:493 [inlined]
 [3] sqrt at ./math.jl:519 [inlined]
 [4] #21 at ./show.jl:576 [inlined]
 [5] iterate at ./generator.jl:47 [inlined]
 [6] Dict{Int64,Float64}(::Base.Generator{Array{Int64,1},getfield(Main, Symbol("##21#22"))}) at ./dict.jl:103
 [7] dict_with_eltype at ./abstractdict.jl:538 [inlined]
 [8] dict_with_eltype at ./abstractdict.jl:545 [inlined]
 [9] Dict(::Base.Generator{Array{Int64,1},getfield(Main, Symbol("##21#22"))}) at ./dict.jl:128
 [10] top-level scope at REPL[7]:1

In my case an exception occurred near the end of a calculation that should have taken 30 minutes. I started getting suspicious when it was still running after 40 minutes, even more so when I noticed it was re-doing parts of the job that should have been finished earlier. Granted I of course should have written code that didn't crash, but still. :)

Many thanks for a great programming environment...

Version info, if that helps. (The same issue also occurs on 1.3.0-rc1.0)

julia> versioninfo()
Julia Version 1.2.0
Commit c6da87ff4b (2019-08-20 00:03 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
  CPU: Intel(R) Core(TM) i7-7600U CPU @ 2.80GHz
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

The text was updated successfully, but these errors were encountered:

c42f · 2019-09-04T11:06:20Z

This seems to be an unintended consequence of the error handling introduced in #12453: namely that if the generator has side effects, those side effects occur twice (once in the initial construction, and once in the check all(x->isa(x,Union{Tuple,Pair}),kv) for generating the error message.

c42f · 2019-09-04T11:07:53Z

Also, thanks for the bug report!

vtjnash · 2019-09-04T16:41:11Z

Maybe just drop that code now for #12451? I think our default error reporting now may provide enough context from showing the argument types and bounds error information that this won't be needed anymore.

c42f · 2019-09-04T20:47:55Z

I tried that out, but the resulting errors were a lot less obvious. Dict is such a commonly used basic type that it seems worth going to a little effort to preserve a more friendly error.

Maybe a heuristic like

if !isiterable(typeof(kv)) || (eltype(kv) !== Any && !isa(eltype(kv), Union{Tuple,Pair}))

would serve fine — we don't need to be precise here, only offer a more helpful error "when possible"

omus · 2021-04-09T16:08:30Z

On Julia 1.6 the "caused by" statement appears on the same line as the redundant error message resulting in this misleading statement:

caused by: DomainError with -5.0:

Edit: This was just an error formatting change which I was unfamiliar with. I had only seen the caused by with this duplicate error issue

Fixes: #33147

lmshk · 2022-09-25T18:02:31Z

I just ran into this issue; it confused me for a while, especially since side-effects of the generator were caused twice before the double error was printed. I am not sure what guarantees Julia normally makes regarding iteration restarts, but this behavior is especially confusing with once-iterables like channels:

f(xs) = Channel() do c
    push!.(Ref(c), xs)
end

Dict(x => @show log(x) for x in f([1, 2]))  # ok
Dict(x => @show log(x) for x in f([1, -1, 2]))  # DomainError with -1.0
Dict(x => @show log(x) for x in f([1, -1, 2, -2]))  # DomainError with -2.0, caused by: DomainError with -1.0
Dict(x => @show log(x) for x in f([1, -1, 2, -2, 3]))  # same, but 3 is left in channel

The solution proposed in #40445 does not fix this.

I think I would prefer the BoundsError: attempt to access [...] at index [2] to the current behavior, especially since the actual error (if the generator does not itself throw, i.e. ArgumentError: Dict(kv): kv needs to be an iterator of tuples or pairs) is inaccurate too: kv apparently just needs to be indexable at 1 and 2, and trailing elements seem to be ignored:

julia> Dict([[1, 2, 3]])

Dict{Int64, Int64} with 1 entry:
  1 => 2

Alternatively, maybe the code that fails to index could wrap the BoundsError right there? (It looks like that would be here and here.)

Should I try a PR or am I missing something?

Fixes: #33147 Replaces/Closes: #40445 Co-authored-by: Jameson Nash <[email protected]>

Fixes: #33147 Replaces/Closes: #40445 The difference here, compared to past implementations, is that we use the zero-cost `isiterable` check on every intermediate step, instead of wrapping the call in a try/catch and then trying to re-approximate the `isiterable` afterwards. Some samples: ```julia julia> Dict(i for i in 1:3) ERROR: ArgumentError: AbstractDict(kv): kv needs to be an iterator of 2-tuples or pairs Stacktrace: [1] _throw_dict_kv_error() @ Base ./dict.jl:118 [2] grow_to! @ ./dict.jl:132 [inlined] [3] dict_with_eltype @ ./abstractdict.jl:592 [inlined] [4] Dict(kv::Base.Generator{UnitRange{Int64}, typeof(identity)}) @ Base ./dict.jl:120 [5] top-level scope @ REPL[1]:1 julia> Dict(i => error("$i") for i in 1:3) ERROR: 1 Stacktrace: [1] error(s::String) @ Base ./error.jl:35 [2] (::var"#3#4")(i::Int64) @ Main ./none:0 [3] iterate @ ./generator.jl:48 [inlined] [4] grow_to! @ ./dict.jl:124 [inlined] [5] dict_with_eltype @ ./abstractdict.jl:592 [inlined] [6] Dict(kv::Base.Generator{UnitRange{Int64}, var"#3#4"}) @ Base ./dict.jl:120 [7] top-level scope @ REPL[2]:1 ``` The other unrelated change here is that `dest = empty(dest, typeof(k), typeof(v))` is made conditional, so we do not unconditionally construct an empty Dict in order to discard it and allocate an exact duplicate of it, but only do so if inference wasn't precise originally. Co-authored-by: Curtis Vogt <[email protected]>

c42f added the bug Indicates an unexpected problem or unintended behavior label Sep 4, 2019

c42f changed the title ~~Dict creation via generator restarts if exception occurs~~ Dict construction calls generator side effects twice on failure Sep 4, 2019

omus added a commit that referenced this issue Apr 12, 2021

Fix duplicate error when using generator in Dict

60d1660

Fixes: #33147

omus mentioned this issue Apr 12, 2021

Fix duplicate error when using bad generator in Dict #40445

Closed

vtjnash added a commit that referenced this issue Feb 1, 2024

Fix duplicate error when using generator in Dict

49f08cd

Fixes: #33147 Replaces/Closes: #40445 Co-authored-by: Jameson Nash <[email protected]>

vtjnash added a commit that referenced this issue Feb 1, 2024

Fix duplicate error when using generator in Dict

b7b4d0a

Fixes: #33147 Replaces/Closes: #40445 Co-authored-by: Jameson Nash <[email protected]>

vtjnash mentioned this issue Feb 1, 2024

Fix duplicate error when using generator in Dict #53151

Merged

vtjnash closed this as completed in #53151 Feb 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dict construction calls generator side effects twice on failure #33147

Dict construction calls generator side effects twice on failure #33147

BrantCarlson commented Sep 3, 2019

c42f commented Sep 4, 2019 •

edited

Loading

c42f commented Sep 4, 2019

vtjnash commented Sep 4, 2019

c42f commented Sep 4, 2019

omus commented Apr 9, 2021 •

edited

Loading

lmshk commented Sep 25, 2022

Dict construction calls generator side effects twice on failure #33147

Dict construction calls generator side effects twice on failure #33147

Comments

BrantCarlson commented Sep 3, 2019

c42f commented Sep 4, 2019 • edited Loading

c42f commented Sep 4, 2019

vtjnash commented Sep 4, 2019

c42f commented Sep 4, 2019

omus commented Apr 9, 2021 • edited Loading

lmshk commented Sep 25, 2022

c42f commented Sep 4, 2019 •

edited

Loading

omus commented Apr 9, 2021 •

edited

Loading