Self Supervised Learning in Audio and Speech