-
Notifications
You must be signed in to change notification settings - Fork 63
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #591 from LuxDL/ap/fused_ops
Used New Fused Ops from LuxLib
- Loading branch information
Showing
7 changed files
with
39 additions
and
63 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
51f2968
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@JuliaRegistrator register
51f2968
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Error while trying to register: Version 0.5.39 already exists
51f2968
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Benchmark Results
Dense(2 => 2)/cpu/reverse/ReverseDiff (compiled)/(2, 128)
3861
ns3674.375
ns1.05
Dense(2 => 2)/cpu/reverse/Zygote/(2, 128)
7130
ns5854.25
ns1.22
Dense(2 => 2)/cpu/reverse/Tracker/(2, 128)
20018
ns15508
ns1.29
Dense(2 => 2)/cpu/reverse/ReverseDiff/(2, 128)
9644
ns9975.333333333334
ns0.97
Dense(2 => 2)/cpu/reverse/Flux/(2, 128)
8847.5
ns8696
ns1.02
Dense(2 => 2)/cpu/reverse/SimpleChains/(2, 128)
4458.25
ns4494.625
ns0.99
Dense(2 => 2)/cpu/forward/NamedTuple/(2, 128)
1121.198717948718
ns2060.9
ns0.54
Dense(2 => 2)/cpu/forward/ComponentArray/(2, 128)
1186.6060606060605
ns1664.8521126760563
ns0.71
Dense(2 => 2)/cpu/forward/Flux/(2, 128)
1793.1509433962265
ns1815.6923076923076
ns0.99
Dense(2 => 2)/cpu/forward/SimpleChains/(2, 128)
179.78284923928078
ns179.37413073713492
ns1.00
Dense(20 => 20)/cpu/reverse/ReverseDiff (compiled)/(20, 128)
17293
ns17743
ns0.97
Dense(20 => 20)/cpu/reverse/Zygote/(20, 128)
17333
ns18735
ns0.93
Dense(20 => 20)/cpu/reverse/Tracker/(20, 128)
36698
ns35667
ns1.03
Dense(20 => 20)/cpu/reverse/ReverseDiff/(20, 128)
28303
ns28753
ns0.98
Dense(20 => 20)/cpu/reverse/Flux/(20, 128)
19607
ns19787
ns0.99
Dense(20 => 20)/cpu/reverse/SimpleChains/(20, 128)
16842
ns17562.5
ns0.96
Dense(20 => 20)/cpu/forward/NamedTuple/(20, 128)
3847.125
ns4920.571428571428
ns0.78
Dense(20 => 20)/cpu/forward/ComponentArray/(20, 128)
3942.375
ns5003.571428571428
ns0.79
Dense(20 => 20)/cpu/forward/Flux/(20, 128)
4867.714285714285
ns5028
ns0.97
Dense(20 => 20)/cpu/forward/SimpleChains/(20, 128)
1653.1
ns1651.1
ns1.00
Conv((3, 3), 3 => 3)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 3, 128)
49892211
ns48926002
ns1.02
Conv((3, 3), 3 => 3)/cpu/reverse/Zygote/(64, 64, 3, 128)
57726460.5
ns108271301
ns0.53
Conv((3, 3), 3 => 3)/cpu/reverse/Tracker/(64, 64, 3, 128)
99509688.5
ns84036071.5
ns1.18
Conv((3, 3), 3 => 3)/cpu/reverse/ReverseDiff/(64, 64, 3, 128)
106130667
ns107192834
ns0.99
Conv((3, 3), 3 => 3)/cpu/reverse/Flux/(64, 64, 3, 128)
105713647.5
ns106869664
ns0.99
Conv((3, 3), 3 => 3)/cpu/reverse/SimpleChains/(64, 64, 3, 128)
11648881
ns11898560
ns0.98
Conv((3, 3), 3 => 3)/cpu/forward/NamedTuple/(64, 64, 3, 128)
7036951.5
ns18820810.5
ns0.37
Conv((3, 3), 3 => 3)/cpu/forward/ComponentArray/(64, 64, 3, 128)
7057975
ns18550564.5
ns0.38
Conv((3, 3), 3 => 3)/cpu/forward/Flux/(64, 64, 3, 128)
18159101
ns18693425
ns0.97
Conv((3, 3), 3 => 3)/cpu/forward/SimpleChains/(64, 64, 3, 128)
6374599.5
ns6446973
ns0.99
vgg16/cpu/reverse/Zygote/(32, 32, 3, 1)
117682544
ns106088743.5
ns1.11
vgg16/cpu/reverse/Zygote/(32, 32, 3, 16)
730992081
ns832416622
ns0.88
vgg16/cpu/reverse/Zygote/(32, 32, 3, 64)
2850285111
ns2984233767
ns0.96
vgg16/cpu/reverse/Tracker/(32, 32, 3, 1)
164703848
ns146290469
ns1.13
vgg16/cpu/reverse/Tracker/(32, 32, 3, 16)
892438806
ns1085323519.5
ns0.82
vgg16/cpu/reverse/Tracker/(32, 32, 3, 64)
2801258575
ns3036724601
ns0.92
vgg16/cpu/reverse/Flux/(32, 32, 3, 1)
81992561
ns90590491
ns0.91
vgg16/cpu/reverse/Flux/(32, 32, 3, 16)
851035463
ns733826110
ns1.16
vgg16/cpu/reverse/Flux/(32, 32, 3, 64)
3068020617
ns3075391905
ns1.00
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 1)
29174554.5
ns29790496
ns0.98
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 16)
180762225
ns212277593
ns0.85
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 64)
682516468
ns781425925
ns0.87
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 1)
28838898
ns30383532
ns0.95
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 16)
173516198
ns197512172
ns0.88
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 64)
673443329
ns778085435.5
ns0.87
vgg16/cpu/forward/Flux/(32, 32, 3, 1)
22791681
ns29280386
ns0.78
vgg16/cpu/forward/Flux/(32, 32, 3, 16)
185722808.5
ns188405608
ns0.99
vgg16/cpu/forward/Flux/(32, 32, 3, 64)
708367926.5
ns809432575
ns0.88
Conv((3, 3), 64 => 64)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 64, 128)
1095069819
ns1147032763
ns0.95
Conv((3, 3), 64 => 64)/cpu/reverse/Zygote/(64, 64, 64, 128)
1853949272
ns1880482284
ns0.99
Conv((3, 3), 64 => 64)/cpu/reverse/Tracker/(64, 64, 64, 128)
2135692150
ns2148352018
ns0.99
Conv((3, 3), 64 => 64)/cpu/reverse/ReverseDiff/(64, 64, 64, 128)
2328333728.5
ns2539276579
ns0.92
Conv((3, 3), 64 => 64)/cpu/reverse/Flux/(64, 64, 64, 128)
1806353666.5
ns1864259381
ns0.97
Conv((3, 3), 64 => 64)/cpu/forward/NamedTuple/(64, 64, 64, 128)
316924856
ns358738282
ns0.88
Conv((3, 3), 64 => 64)/cpu/forward/ComponentArray/(64, 64, 64, 128)
319541589.5
ns405361001.5
ns0.79
Conv((3, 3), 64 => 64)/cpu/forward/Flux/(64, 64, 64, 128)
366675180
ns412981399.5
ns0.89
Conv((3, 3), 1 => 1)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 1, 128)
11925685.5
ns12013680.5
ns0.99
Conv((3, 3), 1 => 1)/cpu/reverse/Zygote/(64, 64, 1, 128)
17932328
ns18334339
ns0.98
Conv((3, 3), 1 => 1)/cpu/reverse/Tracker/(64, 64, 1, 128)
19128765
ns19696096
ns0.97
Conv((3, 3), 1 => 1)/cpu/reverse/ReverseDiff/(64, 64, 1, 128)
23843499.5
ns24459171
ns0.97
Conv((3, 3), 1 => 1)/cpu/reverse/Flux/(64, 64, 1, 128)
17990776.5
ns18368123
ns0.98
Conv((3, 3), 1 => 1)/cpu/reverse/SimpleChains/(64, 64, 1, 128)
1152412.5
ns1168429
ns0.99
Conv((3, 3), 1 => 1)/cpu/forward/NamedTuple/(64, 64, 1, 128)
2037817
ns2120374.5
ns0.96
Conv((3, 3), 1 => 1)/cpu/forward/ComponentArray/(64, 64, 1, 128)
2036956
ns2133928
ns0.95
Conv((3, 3), 1 => 1)/cpu/forward/Flux/(64, 64, 1, 128)
2053085
ns2118672
ns0.97
Conv((3, 3), 1 => 1)/cpu/forward/SimpleChains/(64, 64, 1, 128)
199553
ns216398
ns0.92
Dense(200 => 200)/cpu/reverse/ReverseDiff (compiled)/(200, 128)
294264.5
ns309096.5
ns0.95
Dense(200 => 200)/cpu/reverse/Zygote/(200, 128)
265987
ns277497.5
ns0.96
Dense(200 => 200)/cpu/reverse/Tracker/(200, 128)
360333
ns374849
ns0.96
Dense(200 => 200)/cpu/reverse/ReverseDiff/(200, 128)
404816
ns418761
ns0.97
Dense(200 => 200)/cpu/reverse/Flux/(200, 128)
272669
ns279090
ns0.98
Dense(200 => 200)/cpu/reverse/SimpleChains/(200, 128)
403614
ns409864.5
ns0.98
Dense(200 => 200)/cpu/forward/NamedTuple/(200, 128)
80691
ns93404
ns0.86
Dense(200 => 200)/cpu/forward/ComponentArray/(200, 128)
81091
ns94686.5
ns0.86
Dense(200 => 200)/cpu/forward/Flux/(200, 128)
86171
ns89286
ns0.97
Dense(200 => 200)/cpu/forward/SimpleChains/(200, 128)
104385
ns104365
ns1.00
Conv((3, 3), 16 => 16)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 16, 128)
193514136
ns193458437
ns1.00
Conv((3, 3), 16 => 16)/cpu/reverse/Zygote/(64, 64, 16, 128)
326751148.5
ns373472760.5
ns0.87
Conv((3, 3), 16 => 16)/cpu/reverse/Tracker/(64, 64, 16, 128)
401460640
ns404755227
ns0.99
Conv((3, 3), 16 => 16)/cpu/reverse/ReverseDiff/(64, 64, 16, 128)
431980614
ns454774364.5
ns0.95
Conv((3, 3), 16 => 16)/cpu/reverse/Flux/(64, 64, 16, 128)
370530564
ns372704909
ns0.99
Conv((3, 3), 16 => 16)/cpu/reverse/SimpleChains/(64, 64, 16, 128)
335121405
ns371496475.5
ns0.90
Conv((3, 3), 16 => 16)/cpu/forward/NamedTuple/(64, 64, 16, 128)
43967310
ns60350010
ns0.73
Conv((3, 3), 16 => 16)/cpu/forward/ComponentArray/(64, 64, 16, 128)
43943682.5
ns52074054
ns0.84
Conv((3, 3), 16 => 16)/cpu/forward/Flux/(64, 64, 16, 128)
54701594
ns51366141
ns1.06
Conv((3, 3), 16 => 16)/cpu/forward/SimpleChains/(64, 64, 16, 128)
28158813
ns28579992.5
ns0.99
Dense(2000 => 2000)/cpu/reverse/ReverseDiff (compiled)/(2000, 128)
19527366
ns20025770.5
ns0.98
Dense(2000 => 2000)/cpu/reverse/Zygote/(2000, 128)
19511676
ns19976154
ns0.98
Dense(2000 => 2000)/cpu/reverse/Tracker/(2000, 128)
23170806
ns24021852
ns0.96
Dense(2000 => 2000)/cpu/reverse/ReverseDiff/(2000, 128)
24000771
ns24619312
ns0.97
Dense(2000 => 2000)/cpu/reverse/Flux/(2000, 128)
19543440.5
ns19994222
ns0.98
Dense(2000 => 2000)/cpu/forward/NamedTuple/(2000, 128)
6503799
ns6681275.5
ns0.97
Dense(2000 => 2000)/cpu/forward/ComponentArray/(2000, 128)
6489076.5
ns6671001
ns0.97
Dense(2000 => 2000)/cpu/forward/Flux/(2000, 128)
6483030.5
ns6636669
ns0.98
This comment was automatically generated by workflow using github-action-benchmark.