Question about the GCNConv layer code #211

5tinzi · 2022-10-09T02:19:26Z

Thanks for such wonderful project, I get some troubles in follow the idea of GCNConv layer code.

Allow me to begin by presenting my understanding of the GCN. From the GCN paper Semi-supervised Classification with Graph Convolutional Networks, The computation representation of the GCN layer is simple as:

$\begin{matrix} A = {\hat{D}}^{- \frac{1}{2}} \hat{A} {\hat{D}}^{- \frac{1}{2}} \\ X^{l + 1} = f (W, A) = σ (A X^{l} W) \end{matrix}$

where $A$ is a normalized graph Laplacian could be first calculated in a pre-processing step. $X^{l + 1}$ is the output of layer l .

Intuitively a GCNConv could be build with the following code (pseudo):

function  pre_processing(g)  
  A_hat = adj_Matrix(g) + I # I is an identity matrix, for node self-connect 
  d = degree_Matrix(g)
  d_hat = 1 ./ sqrt.(d)
  return d_hat * A_hat * d_hat
end

struct GCNConv
  w
  A
end

# constructor  
function GCNconv(g,in, out)   
  A =  pre_processing(g) 
  w = randn(out,in)
  GCNconv(w, A)
end 

#  layer calculation 
function (l::GCNconv)(x)  
    sigma(l.A * l.w * x)
end

then we can define and apply a GCN layer as follow:

#give a graph g, and a  node feature array X
l = GCNconv(g, in, out )
l(x)  # one layer GCN.
l(l(x)) # two layer GCN.

The above is my very personal understanding. Maybe it's some preconceived notion, I find the GraphNeuralNetworks' GCNConv design is a bit difficult to follow:

In the GraphNeuralNetworks GCNConv layer，it seems there are no pre-processing step for something like ${\hat{D}}^{- \frac{1}{2}} \hat{A} {\hat{D}}^{- \frac{1}{2}}$ , so it seems each time we call a GCNConv layer, it will do these repeat caculate steps, but during the model training, hundreds of thousands of runs is need, so will this slow down the speed?
another question is that GCNConv do a repeat x = x .* c'(in line 102 and 110 ) , so it seems it performs $x {\hat{D}}^{- \frac{1}{2}} {\hat{D}}^{- \frac{1}{2}}$ instead of ${\hat{D}}^{- \frac{1}{2}} \hat{A} {\hat{D}}^{- \frac{1}{2}}$ . it is hard to follow the idea here, any tips or suggestion here?
https://github.com/CarloLucibello/GraphNeuralNetworks.jl/blob/afd80e8024abb270db79cd8592376ba60bc63f60/src/layers/conv.jl#L100-L110

Any advice would be appreciated and thank you for your patience.

The text was updated successfully, but these errors were encountered:

CarloLucibello · 2022-10-10T05:53:26Z

Hi @5tinzi, your implementation is actually correct and functionally equivalent to the one in this repo.
The one in this repo is complicated by the fact that you don't want to materialize large dense matrices but do gather/scatter operations or sparse matrix multiplications and you want to support edge weights if present.

5tinzi closed this as completed Oct 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the GCNConv layer code #211

Question about the GCNConv layer code #211

5tinzi commented Oct 9, 2022

CarloLucibello commented Oct 10, 2022

Question about the GCNConv layer code #211

Question about the GCNConv layer code #211

Comments

5tinzi commented Oct 9, 2022

CarloLucibello commented Oct 10, 2022