Understanding the Effectivity of Ensembles in Deep Learning