Skip to main content
Reports
Created by
Created On
Last edited
An Introduction to HuggingFace's Accelerate Library
In this article, we dive into the internal workings of the Accelerate library from HuggingFace, to answer "could Accelerate really be this easy?"
8
2021-04-25
Is MLP-Mixer a CNN in Disguise?
As part of this blog post, we look at the MLP Mixer architecture in detail and also understand why it is not considered conv free.
7
2021-07-10
Are fully connected and convolution layers equivalent? If so, how?
As part of this post, we look at the Convolution and Linear layers in MS Excel and compare results from Excel with PyTorch implementations.
1
2021-07-13
ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
As part of this blog post, we will look into the ConViT transformer architecture in detail and learn all about it and also the gated positional self-attention (GPSA) layer! We also see how the ConViT architecture gets the best of both worlds and obtains the benefits of both Transformers and CNNs.
9
2021-05-25
A faster way to get working and up-to-date conda environments using "fastchan"
As part of this blog post, I introduce conda, pip, docker & conda channels. I showcase an example of how recently I got stuck in a "messy" environment and some possible solutions. Finally, I introduce "fastchan" and show how it can make all of our lives easier.
4
2021-05-12
Explained: Characterizing Signal Propagation to Close the Performance Gap in Unnormalized ResNets
BatchNorm is a key component in almost all SOTA image classifiers, but it also introduces practical challenges. Is there a way to get similar performance for image classifiers without BatchNorm? Looks like the answer is "yes"!
8
2021-04-27
Revisiting ResNets: Improved Training and Scaling Strategies
Do training methods matter more than model architectural changes? In this post we take a dive into the ResNet-RS paper and try to answer this question.
22
2021-04-27
EfficientNetV2
As part of this blog, we are going to be looking into the EfficientNetV2 architecture in detail with code implementation in PyTorch.
4
2021-05-03