Slow interaction with DataLoader #141

casper2002casper · 2022-03-02T17:36:42Z

When using Flux.DataLoader loading graph batches is slower than expected. It really slows everything down when dealing with large training sets.
A comparison to using vectors with the same amount of data as a graph:

using Flux
using GraphNeuralNetworks

function test(g)
    loader = Flux.DataLoader(g, batchsize = 100, shuffle=true)
    for a in loader
        print("+")
    end
end
n = 5000
s = 10
x1 = Flux.batch([rand_graph(s, s, ndata = rand(1,s)) for i in 1:n]) 
x2 = Flux.batch([rand(s + s + s + s) for i in 1:n]) #source+target+data+extra
@time test(x1)
@time test(x2)

++++++++++++++++++++++++++++++++++++++++++++++++++  1.388830 seconds (19.59 k allocations: 7.065 MiB, 1.94% compilation time)
++++++++++++++++++++++++++++++++++++++++++++++++++  0.028044 seconds (17.17 k allocations: 2.501 MiB, 94.90% compilation time)

CarloLucibello · 2022-03-05T07:03:09Z

~~I don't see such a large discrepancy. Maybe your are measuring compilation time as well?~~
Edit: sorry I confused microseconds and milliseconds, there is a large discrepancy actually.

using Flux
using GraphNeuralNetworks
using BenchmarkTools

f(x) = 1

function test(g)
    loader = Flux.DataLoader(g, batchsize=100, shuffle=true)
    s = 0 
    for d in loader
        s += f(d)
    end
    return s
end

n = 5000
s = 10
x1 = Flux.batch([rand_graph(s, s, ndata = rand(1, s)) for i in 1:n]) 
x2 = Flux.batch([rand(s + s + s + s) for i in 1:n]) #source+target+data+extra
@btime test(x1); #  1.296 s (2502 allocations: 6.17 MiB)
@btime test(x2); #  400.778 μs (152 allocations: 1.61 MiB)

~~Or maybe in your code the dataloader iterations are optimized away since they aren't used?~~

CarloLucibello · 2022-03-05T07:16:39Z

@profview test(x1) shows that most time is spent on this line in getgraph. Unfortunately, I don't know of a better way than edge_mask = s .∈ Ref(nodes) to create the edge mask.

A way around this is to store in the graph another vector of length num_edges containing the graph membership of each edge.

CarloLucibello · 2022-03-05T16:08:22Z

Thanks to #143 the recommended way to interact with the DataLoader is now

data = [rand_graph(10, 20, ndata=rand(Float32, 2, 10)) for _ in 1:1000]
train_loader = DataLoader(data, batchsize=10)

for g in train_loader
  # ...
end

@casper2002casper is this fast enough for your usecase?

casper2002casper · 2022-03-05T16:25:07Z

Thank you! I will let you know as soon as I have acces to my pc again in a couple days.

casper2002casper · 2022-03-07T08:37:56Z

This has sped up my training 20 times, very much appreciated

CarloLucibello mentioned this issue Mar 5, 2022

dataloader support for vector of graphs #143

Merged

CarloLucibello closed this as completed in #143 Mar 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow interaction with DataLoader #141

Slow interaction with DataLoader #141

casper2002casper commented Mar 2, 2022 •

edited

Loading

CarloLucibello commented Mar 5, 2022 •

edited

Loading

CarloLucibello commented Mar 5, 2022 •

edited

Loading

CarloLucibello commented Mar 5, 2022 •

edited

Loading

casper2002casper commented Mar 5, 2022

casper2002casper commented Mar 7, 2022

Slow interaction with DataLoader #141

Slow interaction with DataLoader #141

Comments

casper2002casper commented Mar 2, 2022 • edited Loading

CarloLucibello commented Mar 5, 2022 • edited Loading

CarloLucibello commented Mar 5, 2022 • edited Loading

CarloLucibello commented Mar 5, 2022 • edited Loading

casper2002casper commented Mar 5, 2022

casper2002casper commented Mar 7, 2022

casper2002casper commented Mar 2, 2022 •

edited

Loading

CarloLucibello commented Mar 5, 2022 •

edited

Loading

CarloLucibello commented Mar 5, 2022 •

edited

Loading

CarloLucibello commented Mar 5, 2022 •

edited

Loading