Skip to main content
patjj
Projects
nq_search
Log in
Sign up
Overview
Workspace
Runs
Automat.
Sweeps
Reports
Artifacts
Pj20's workspace
Personal workspace
Automated workspace
Changes are only visible to you.
Runs
5
Name
2 visualized
nq_search_3b_sparse
nq_search_3b_sparse
nq_search_3b
nq_search_3b
nq_search_3b
nq_search_3b
nq_search_3b
nq_search_3b
nq_search_3b
nq_search_3b
1-5
of 5
Settings
Add panels
val
1
actor
6
1-6 of 6
actor/ppo_kl
actor/ppo_kl
50
100
150
200
Step
-0.002
0
0.002
0.004
nq_search_3b_sparse
nq_search_3b
actor/pg_loss
actor/pg_loss
50
100
150
200
Step
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
nq_search_3b_sparse
nq_search_3b
actor/pg_clipfrac
actor/pg_clipfrac
50
100
150
200
Step
0
0.002
0.004
0.006
0.008
0.01
0.012
nq_search_3b_sparse
nq_search_3b
actor/lr
actor/lr
50
100
150
200
Step
0
5e-7
0.000001
0.0000015
0.000002
nq_search_3b_sparse
nq_search_3b
actor/grad_norm
actor/grad_norm
50
100
150
200
Step
4
5
6
7
nq_search_3b_sparse
nq_search_3b
actor/entropy_loss
actor/entropy_loss
50
100
150
200
Step
0.4
0.6
0.8
1
1.2
nq_search_3b_sparse
nq_search_3b
critic
23
1-6 of 23
global_seqlen
6
1-6 of 6
mfu
2
prompt_length
4
1-4 of 4
response_length
4
1-4 of 4
reward
10
1-6 of 10
Add section