Skip to main content

Wav2Vec2 - DistributedDataParallel (DDP) - 8 GPU V100

num_gpu: 8; batch_size per GPU: 4; no gradient checkpointing; no gradient accumulation; no group length; fp16 true
Created on September 20|Last edited on September 21

Important metrics


5001k1.5k2k2.5k3kStep3
5001k1.5k2k2.5k3kStep12
05001k1.5k2k2.5k3kStep5101520
Run: ./wav2vec2-large-xlsr-turkish-demo-dist
1