A Gentle Introduction To Weight Initialization for Neural Networks