[DL Edition] T034: GNN based molecular property prediction #287

PaulaKramer · 2022-12-06T11:55:32Z

Details

Talktorial ID: 034
Title: [DL Edition] T034: GNN based molecular property prediction
Original authors: Paula Kramer
Reviewer(s): XXX
Date of review: DD-MM-YYYY

Content

One line summary: Introduction to Graph Neural Networks for Property Prediction
Potential labels or categories (e.g. machine learning, small molecules, online APIs): Machine learning, small molecules, graph neural networks
Time it took to execute (approx.): 7 min
I have used the talktorial template and followed the content and formatting suggestions there
Packages must be open-sourced and should be installable from conda-forge. If you are adding new packages to the TeachOpenCADD environment, please check if already installed packages can perform the same functionality and if not leave a sentence explaining why the new addition is needed. If the new package is not on conda-forge, please list them and their intended usage here.
- numpy, matplotlib: Already in TeachOpenCADD
- pytorch 1.12.1, pytorch-cluster 1.6.0, pytorch-scatter 2.1.0, pytorch-sparse 0.6.15, pyg 2.2.0 (conda-forge): I use it for implementing graph neural networks
Data must be publicly available, preferably accessible via a webserver or downloadable via a URL. Please list the data resources that you use and how to access them:
- QM9 dataset: Access via (torch-geometric)

Content style

Talktorial includes cross-references to other talktorials if applicable
The table of contents reflects the talktorial story-line; order of #, ##, ### headers is correct
URLs are linked with meaningful words, instead of pasting the URL directly or linking words like here.
I have spell-checked the notebook
Images have enough resolution to be rendered with quality, without being too heavy.
All figures have a description
Markdown cell content is still in-line with code cell output (whenever results are discussed)
I have checked that cell outputs are not incredibly long (this applies also to DataFrames)
Formatting looks correctly on the Sphinx render (bold, italics, figure placing)

Code style

Website

We present our talktorials on our TeachOpenCADD website (https://projects.volkamerlab.org/teachopencadd/), so we have to check as well if the Jupyter notebook renders nicely there.

If this PR adds a new talktorial, please follow these steps:
- Add your talktorial to the complete list of talktorials here (at the end).
- Add your talktorial to one or multiple of the collections here. Or propose a new collection section in your PR.
- Add your talktorial's nblink file by running python generate_nblinks.py from within the directory teachopencadd/docs/talktorials.
- Please complile the website following the instructions here.
Check the rendering of the talktorial of this PR.
Is your talktorial listed in the talktorial list?
Is your talktorial listed in the talktorial collections?
- Add a picture for your talktorial in the collection view by following these instructions.

review-notebook-app · 2022-12-09T09:34:43Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

gerritgr · 2023-01-30T11:02:33Z

GNNs should be defined first as differentiable and trainable, permutation equi(/in)variant functions. The architectures should be introduced as specific instances of such functions.
The relationship between massage passing as a general and powerful framework and GCN/GIN as (less powerful) instances could be clarified: Also the relationship between the aggregation/pooling function (input is a set) and the permutation invariance.
refer to T33 in the introduction.
d is overloaded with the degree and the feature dimension.
The advantages of a GNN library should be stated (sparse matrices, graph batching), also mention www.dgl.ai.
bessere property? -> ChatGPT suggests: Electronegativity, Ionization potential, Bond angles and distances, no idea if they make sense, though.
Say explicitly that the pooling layer is invariant to the order of the input (the same as the aggregation function).
True vs predicted value -> would say Ground truth vs prediction
Can you give an intuition on what makes GIN more powerful?

Start branch

9e21758

AndreaVolkamer changed the title ~~Start branch~~ [DL Edition] T034: GNN based molecular property prediction Dec 8, 2022

AndreaVolkamer added the new talktorial New talktorial label Dec 8, 2022

first version

20178cb

Paula Kramer and others added 4 commits December 16, 2022 19:10

added theory + new model

e25a28e

added test error and plots

cf82cd3

added model parameters, references, discussion and plots

ff12a52

added quiz, reformat, spell check

5f813d3

PaulaKramer requested a review from AndreaVolkamer December 23, 2022 12:55

dominiquesydow mentioned this pull request Dec 27, 2022

[2023.05.2-base] DL edition #285

Merged

9 tasks

PaulaKramer and others added 3 commits January 3, 2023 16:36

some improvements in theory part

d327a73

feedback t034

1cafe63

Delete .DS_Store

933e4d9

added feedback from Joschka and Gerrit

d3a3dfb

gerritgr merged commit f7542d6 into DL_edition Apr 11, 2023

mbackenkoehler deleted the pk-034-gnns branch January 29, 2024 10:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DL Edition] T034: GNN based molecular property prediction #287

[DL Edition] T034: GNN based molecular property prediction #287

PaulaKramer commented Dec 6, 2022 •

edited

Loading

review-notebook-app bot commented Dec 9, 2022

gerritgr commented Jan 30, 2023 •

edited

Loading

[DL Edition] T034: GNN based molecular property prediction #287

[DL Edition] T034: GNN based molecular property prediction #287

Conversation

PaulaKramer commented Dec 6, 2022 • edited Loading

Details

Content

Content style

Code style

Website

review-notebook-app bot commented Dec 9, 2022

gerritgr commented Jan 30, 2023 • edited Loading

PaulaKramer commented Dec 6, 2022 •

edited

Loading

gerritgr commented Jan 30, 2023 •

edited

Loading