Skip to main content

Memory error cause failing processes

Memory error cause failing processes
Created on November 11|Last edited on November 11
Hi, I really liked ur amazing tool and just started to use it. I m using sweep config for hyperparameters. Sometimes my running is get broke due to some reason in the models parameters etc. and my gpu get stuck with full memory. Is there a way to kill the gpu's current job to free the gpu ram? it would be amazing to apply it to my code with a given PID number and kill it.
Lev Telyatnikov
Lev Telyatnikov •  
I would be also happy to find the answer
Reply