Skip to main content
darkbird
Projects
psy-llama-pretraining
Workspace
Log in
Sign up
Project
Workspace
Runs
Automat.
Sweeps
Reports
Artifacts
Kirp's workspace
Personal workspace
Automated workspace
Changes are only visible to you.
Runs
4
Name
3 visualized
Train with default qkvogdu+lm+emb on 4*A6000 lr=1e-4 pretraining
Train with default qkvogdu+lm+emb on 4*A6000 lr=1e-4 pretraining
Train with t10000 map qkvogdu+lm+emb on 8*A6000 lr=1e-4 pretraining
Train with t10000 map qkvogdu+lm+emb on 8*A6000 lr=1e-4 pretraining
Train with t10000 random qkvogdu+lm+emb on 4*A6000 lr=1e-4 pretraining
Train with t10000 random qkvogdu+lm+emb on 4*A6000 lr=1e-4 pretraining
Train with default qkvogdu
Train with default qkvogdu
1-4
of 4
Previous
Next
eval/samples_per_second
eval/samples_per_second
50
100
150
200
250
300
350
Step
7.5
8
8.5
9
9.5
10
Train with default qkvogdu+lm+emb on 4*A6000 lr=1e-4 pretraining
Train with t10000 map qkvogdu+lm+emb on 8*A6000 lr=1e-4 pretraining
Train with t10000 random qkvogdu+lm+emb on 4*A6000 lr=1e-4 pretraining