Skip to main content
Reports
Created by
Created On
Last edited
Complexity
model size vs. results
0
2023-03-17
What's wrong with BPE?
Scaling the transformer and it trains BPE.
0
2023-03-02
Improving GNN architectures
A trail to despair
0
2023-03-02
MIDILike vs. REMI vs. CPWord
Some more straightforward tokenization compare.
0
2023-03-02
Maybe it's just all about model size.
I think the models I've been using is indeed a bit too small.
0
2023-02-28
Homogeneous vs. Heterogeneous
Homogeneous graphs also learns, around 15% final acc less than the heterogeneous ones. But in terms of the size it's way smaller, 0.074MB of parameter size while the hetero ones should be > 0.2MB (both with SAGE layers, and level 0 basic features)
0
2023-02-28
Graph comparison
Run set 1: Grey: Old GAT Purple: GAT with the augmentation (mean pool the attentions at each layer, mean aggregate edges) - but idk why they have more parameters! Green: GraphSAGE Run et 2: composer task, as the data is better we have better comparison SAGE learns faster and have less overfitting. Conclusion: maybe GAT is not making things better.
0
2023-02-24