Skip to main content

Dataloader Experiments

Simple comparison of dataloaders on the GQA balanced training set for both questions and images.
Created on August 17|Last edited on August 17

Results

As expected, image loading slows the dataloader down. This is clearly seen when comparing the question-only loading times for GQA train-all from the previous report against the question-and-image loading times from this report.

Similarly to the previous report, using 1 worker process instead of 0 increases memory consumption and slows the dataloader, since these tests were run without any attached models (dataloader only). This may change when a model is present to consume the data depending on GPU utilisation.




Showing first 10 runs
1020304050Time (hours)020406080100
Showing first 10 runs
1020304050Time (hours)020406080100
Showing first 10 runs
1020304050Time (hours)020406080100
Run set
21