9.Graph Neural Networks with Pytorch Geometric

Created on December 22|Last edited on January 13
Comment
﻿
Pytorch Geometric has a really great documentation. It has helper functions for data loading, data transformers, batching specific to graph data structures, and also has several graph neural network implementations. It also comes with easy loading of classic graph datasets like, Cora citation network, Zachary Karate Club and etc.
It also has a base Message Passing class so that we can implement our own Graph Neural Networks.
Data HandlingGraph in pytorch geometric is described by an instance of torch_geomtric.data.Data that has the following attributes.
data.x:  node features tensor of shape [num_nodes, num_node_features]
data.edge_index:  Graph connectivity in COO format with shape [2, num_edges]. Basically represents all the edges, an alternative to the Adjacency matrix representation. This is not a list of edges, if you have a list of edges you can get edge_index by doing this edge_index = edge_list.t().contiguous()
data.edge_attr: Edge features tensor of shape [num_edges, num_edge_features]
data.y: Target to train against, it can be anything from node labels of shape [num_nodes, *], or graph labels of shape [1, *]
data.pos: Node position matrix with shape [num_nodes, num_dimensions]
GNN LayersThe minor difference from report 8 and pytorch documentation is that it has a much more generic equation and different variables, for getting the embeddings of a node at layer k from its neighbours. It goes as follows.
xi(k)=γk(xi(k−1),□j∈N(i)ϕ(k)(xi(k−1),xj(k−1),ej,i))\displaystyle x_i^{(k)} = \gamma^{k}\bigg(x_i^{(k-1)}, \square_{j \in \mathcal{N}(i)} \phi^{(k)}(x_i^{(k-1)}, x_j^{(k-1)}, e_{j,i})\bigg)xi(k)​=γk(xi(k−1)​,□j∈N(i)​ϕ(k)(xi(k−1)​,xj(k−1)​,ej,i​))
Where □\square□ our aggregation function, xi(k)x_i^{(k)}xi(k)​ embedding of node xix_ixi​ at layerklayer_klayerk​, ej,i∈ℜDe_{j,i}\in \Re^Dej,i​∈ℜD denoting optional edge features from node jjj to node iii, γ\gammaγ and ϕ\phiϕ differentiable functions such as MLP's. γ\gammaγ, ϕ\phiϕ and □\square□ vary according to the Graph neural network we use. 
GCNFor example in case of a Graph Convolution Layer(GCN) we defined the embedding equation as
hvk=σ(Wk∑u∈N(v)huk−1∣N(v)∣+Bkhvk−1),∀k∈{1,⋯ ,K}\displaystyle h_v^k = \sigma \bigg( W_k \sum_{u \in N(v)} \frac{h_u^{k-1}}{|N(v)|} + B_kh_v^{k-1} \bigg), \forall k \in \lbrace 1, \cdots, K\rbracehvk​=σ(Wk​u∈N(v)∑​∣N(v)∣huk−1​​+Bk​hvk−1​),∀k∈{1,⋯,K}
We included the non-linearity σ\sigmaσ here, but in pytorch-geometric we have it as a separate ReLU layer after the GNN Layer.
Now we can see how we get our GCN equation from the generic equation accordingly
□=∑\square = \sum□=∑
ϕ(xi,xj,ei,j)=xj\phi(x_i, x_j, e_{i,j}) = x_jϕ(xi​,xj​,ei,j​)=xj​
γ(xi,□N)=Bxi+W∑N\gamma(x_i, \square_{\mathcal{N}}) = Bx_i + W\sum_{\mathcal{N}}γ(xi​,□N​)=Bxi​+W∑N​
You can find how to implement GCN Layer from the message passing base class in the documentation here
You can find GCNConv layer from the pytorch geometric documentation here
GraphSAGEHere the equation we had was
hvk=σ([Ak.AGG({hvk−1,∀u∈N(v)}),Bkhvk−1])\displaystyle h_v^k = \sigma([A_k . AGG(\lbrace h_v^{k-1}, \forall u \in N(v) \rbrace), B_kh_v^{k-1}])hvk​=σ([Ak​.AGG({hvk−1​,∀u∈N(v)}),Bk​hvk−1​])
Where
□=AGG\square = AGG□=AGG
ϕ(xi,xj,ei,j)=xj\phi(x_i, x_j, e_{i,j}) = x_jϕ(xi​,xj​,ei,j​)=xj​
γ(xi,□N)=[A.AGGN,Bxi]\gamma(x_i, \square_{\mathcal{N}}) = [A.AGG_{\mathcal{N}}, Bx_i]γ(xi​,□N​)=[A.AGGN​,Bxi​]
Other Conv LayersYou can find the documentation for all the convolutional layers here.
For each layer the documentation has the embedding equation and the link to the original paper.
﻿
GCN on Zachary Karate Club NetworkHere we trained a GCN Model on the Zachary Karate Club network to do a node classification task. We had 3 GCNConv layers, which implies every node got information aggregated messages from it's 3-hop neighbourhood  You can see from the results, before the training the node embeddings were all over the place, but after training we were able to see a clear classification on the nodes. I got this training code from one of the example colab notebooks on pytorch geometric documentation. I forked the notebook to instrument it with wandb. You can find the notebook here
﻿
﻿
﻿
Run set1
﻿
Comparison of Various GNN's on Cora Citation NetworkWe trained various GNN models on the Cora Citation Network, to see how each perform.
You can see how by using GNN we improved test accuracy from 0.5 to 0.8, normal MLP where just node features were used, in GNN we took advantage of the network structure along with node features..
﻿
﻿
﻿
Run set8
﻿
Chapters
Introduction, Structure of Graphs
Properties of Networks and Random Graph Models
Motifs and Structural Roles in Networks
Community Structure in Networks
Spectral Clustering
Message Passing and Node Classification
Graph Representation Learning
Graph Neural Networks
Graph Neural Networks - Pytorch Geometric
Deep Generative Models for Graphs
Link Analysis: PageRank
Network Effects and Cascading Behaviour
Probabilistic Contagion and Models of Influence
Influence Maximization in Networks
Outbreak Detection in Networks
Network Evolution
Reasoning over Knowledge Graphs
Limitations of Graph Neural Networks
Applications of Graph Neural Networks
﻿
﻿
Add a comment