sebulba profile
Created on January 29|Last edited on January 30
Comment
Finding highlights:
- adding a timeout significantly reduces params_queue get time for actors, especially when paired with a dedicated GPU; however it does come with a side effect (e.g., such as the actor will not be able to pull the latest params from the learner and will use the old params instead)
count data_transfer
1
1
device_put_sharded only in learner
1
1
do data_transfer
2
do data_transfer
1
Add a comment