GitHub - abhi9716/d2l-pytorch: This is an attempt to modify Dive into Deep Learning, Berkeley STAT 157 (Spring 2019) textbook's code into PyTorch.

This project is inspired of the original Dive Into Deep Learning book by Aston Zhang, Zack C. Lipton, Mu Li, Alex J. Smola and all the community contributors. We have made an effort to modify the book and convert the MXnet code snippets into PyTorch.

Note: Some ipynb notebooks may not be rendered perfectly in Github. We suggest cloning the repo or using nbviewer to view the notebooks.

Chapters

Ch02 Installation
- Installation
Ch03 Introduction
- Introduction
Ch04 The Preliminaries: A Crashcourse
- 4.1 Data Manipulation
- 4.2 Linear Algebra
- 4.3 Automatic Differentiation
- 4.4 Probability and Statistics
- 4.5 Naive Bayes Classification
- 4.6 Documentation
Ch05 Linear Neural Networks
Ch06 Multilayer Perceptrons
Ch07 Deep Learning Computation
- 7.1 Layers and Blocks
- 7.2 Parameter Management
- 7.3 Deferred Initialization
- 7.4 Custom Layers
- 7.5 File I/O
- 7.6 GPUs
Ch08 Convolutional Neural Networks
- 8.1 From Dense Layers to Convolutions
- 8.2 Convolutions for Images
- 8.3 Padding and Stride
- 8.4 Multiple Input and Output Channels
- 8.5 Pooling
- 8.6 Convolutional Neural Networks (LeNet)
Ch09 Modern Convolutional Networks
- 9.1 Deep Convolutional Neural Networks (AlexNet)
- 9.2 Networks Using Blocks (VGG)
- 9.3 Network in Network (NiN)
- 9.4 Networks with Parallel Concatenations (GoogLeNet)
- 9.5 Batch Normalization
- 9.6 Residual Networks (ResNet)
- 9.7 Densely Connected Networks (DenseNet)
Ch10 Recurrent Neural Networks
- 10.1 Sequence Models
- 10.2 Language Models
- 10.3 Recurrent Neural Networks
- 10.4 Text Preprocessing
- 10.5 Implementation of Recurrent Neural Networks from Scratch
- 10.6 Concise Implementation of Recurrent Neural Networks
- 10.7 Backpropagation Through Time
- 10.8 Gated Recurrent Units (GRU)
- 10.9 Long Short Term Memory (LSTM)
- 10.10 Deep Recurrent Neural Networks
- 10.11 Bidirectional Recurrent Neural Networks
- 10.12 Machine Translation and DataSets
- 10.13 Encoder-Decoder Architecture
- 10.14 Sequence to Sequence
- 10.15 Beam Search
Ch11 Attention Mechanism
- 11.1 Attention Mechanism
- 11.2 Sequence to Sequence with Attention Mechanism
- 11.3 Transformer
Ch12 Optimization Algorithms
- 12.1 Optimization and Deep Learning
- 12.2 Convexity
- 12.3 Gradient Descent
- 12.4 Stochastic Gradient Descent
- 12.5 Mini-batch Stochastic Gradient Descent
- 12.6 Momentum
- 12.7 Adagrad
- 12.8 RMSProp
- 12.9 Adadelta
- 12.10 Adam

Contributing

Please feel free to open a Pull Request to contribute a notebook in PyTorch for the rest of the chapters. Before starting out with the notebook, open an issue with the name of the notebook in order to contribute for the same. We will assign that issue to you (if no one has been assigned earlier).
Strictly follow the naming conventions for the IPython Notebooks and the subsections.
Also, if you think there's any section that requires more/better explanation, please use the issue tracker to open an issue and let us know about the same. We'll get back as soon as possible.
Find some code that needs improvement and submit a pull request.
Find a reference that we missed and submit a pull request.
Try not to submit huge pull requests since this makes them hard to understand and incorporate. Better send several smaller ones.

Support

If you like this repo and find it useful, please consider (★) starring it, so that it can reach a broader audience.

References

[1] Original Book Dive Into Deep Learning -> Github Repo

[2] Deep Learning - The Straight Dope

[3] PyTorch - MXNet Cheatsheet

Cite

If you use this work or code for your research please cite the original book with the following bibtex entry.

@book{zhang2019dive,
    title={Dive into Deep Learning},
    author={Aston Zhang and Zachary C. Lipton and Mu Li and Alexander J. Smola},
    note={\url{http://www.d2l.ai}},
    year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 144 Commits
Ch02_Installation		Ch02_Installation
Ch03_Introduction		Ch03_Introduction
Ch04_The_Preliminaries_A_Crashcourse		Ch04_The_Preliminaries_A_Crashcourse
Ch05_Linear_Neural_Networks		Ch05_Linear_Neural_Networks
Ch06_Multilayer_Perceptrons		Ch06_Multilayer_Perceptrons
Ch07_Deep_Learning_Computation		Ch07_Deep_Learning_Computation
Ch08_Convolutional_Neural_Networks		Ch08_Convolutional_Neural_Networks
Ch09_Modern_Convolutional_Networks		Ch09_Modern_Convolutional_Networks
Ch10_Recurrent_Neural_Networks		Ch10_Recurrent_Neural_Networks
Ch11_Attention_Mechanism		Ch11_Attention_Mechanism
Ch12_Optimization_Algorithms		Ch12_Optimization_Algorithms
d2l		d2l
data		data
img		img
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chapters

Contributing

Support

References

Cite

About

Releases

Packages

Languages

License

abhi9716/d2l-pytorch

Folders and files

Latest commit

History

Repository files navigation

Chapters

Contributing

Support

References

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages