Avalon

Avalon is a deep learning library in Julia with focus on high performance and interoperability with existing DL frameworks. Its main features include:

tracing autograd engine - models are just structs, transformations are just functions
optimizing code generator based on hackable computational graph
GPU support
layer API similar to PyTorch's to ease translation of existing Python code to Julia
high backward compatibility to allow accumulation of models

Usage

To get you a feeling of what Avalon is like, here's a definition of a small convolutional neural network:

using Avalon


mutable struct Net
    conv1::Conv2d
    conv2::Conv2d
    fc1::Linear
    fc2::Linear
end


Net() = Net(
    Conv2d(1, 20, 5),
    Conv2d(20, 50, 5),
    Linear(4 * 4 * 50, 500),
    Linear(500, 10)
)

function (m::Net)(x::AbstractArray)
    x = maxpool2d(relu.(m.conv1(x)), (2, 2))
    x = maxpool2d(relu.(m.conv2(x)), (2, 2))
    x = reshape(x, 4*4*50, :)
    x = relu.(m.fc1(x))
    x = logsoftmax(m.fc2(x))
    return x
end

For detailed explanation of this and other models see the tutorial. Some predefined models are also available in the zoo.

Performance

Performance comparison between different libraries is hard and benchmarks are rarely fair, but here's our best shot in this direction:

Convolutional neural network

Code available here

	training 1 epoch	training total time*	prediction
Avalon (CPU)	170 s	1742 s	39 ms
Flux (CPU)	250 s	2515 s	42 ms
-------------	----------------	--------------------	----------
Avalon (GPU)	10 s	164 s	5 ms
Flux (GPU)	12 s	150 s	5 ms
PyTorch (GPU)	12 s	120 s	2 ms

* - total time includes 10 epochs + compilation time

Note that in the test on GPU Avalon has longest compilation time and thus longest total training time after 10 epochs. However, time per epoch is the lowest, so Avalon is typically the fastest one in longer run.

Variational Autoencoder

Code available here

	training 1 epoch	training total time	prediction
Avalon (CPU)	50 s	535 s	395 μs
Flux (CPU)	948 s	158 min	81 ms
-------------	----------------	--------------------	----------
Avalon (GPU)	3 s	93 s	194 μs
Flux (GPU)**	---	---	---
PyTorch (GPU)	7 s	66 s	501 µs

** - VAE example from the Flux zoo doesn't work on GPU

API Stability

One of the central ideas behind Avalon is the ability to reuse existing code instead of writing everything from scratch. To facilitate it, Avalon is committed to high, although not absolute backward compatibility. The following table outlines stability level you should expect from various components of the library.

Component	API Stable?
Basic layers	Yes
CNN	Yes
RNN	No*
Losses	Mostly
Activations	Yes
Initializations	Mostly
Optimizers	Yes
Device API	Yes
Fitting API	No**

* - currently Avalon provides only basic implementations of vanilla RNN, LSTM and GRU; this implementation will be improved in future version and made more compatible with PyTorch version, but currently it cannot be considered stable

** - function fit!() provides a convenient shortcut for training supervised learning models, but in its current state it's too basic for most real use cases; for more durable code consider writing your own method for training using fit!() as a template

Please note that until version 1.0 "stable API" means that we will try our best to keep it unchanged, but we reserve the right to the break the rule in some rare and exceptional cases.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Avalon

Usage

Performance

Convolutional neural network

Variational Autoencoder

API Stability

Files

README.md

Latest commit

History

README.md

File metadata and controls

Avalon

Usage

Performance

Convolutional neural network

Variational Autoencoder

API Stability