Effects of Weight Initialization on Neural Networks