Gaochenxiao's workspace
Runs
90
Name
10 visualized
task: hopper-medium-replay-v2
task: hopper-medium-replay-v2
1
5
task: hopper-medium-expert-v2
task: hopper-medium-expert-v2
1
5
task: hopper-medium-v2
task: hopper-medium-v2
1
5
task: hopper-full-replay-v2
task: hopper-full-replay-v2
1
5
task: hopper-random-v2
task: hopper-random-v2
1
5
task: hopper-expert-v2
task: hopper-expert-v2
1
5
State
Notes
User
Tags
Created
Runtime
Sweep
UtilsRL.numpy_fp
UtilsRL.precision
UtilsRL.torch_fp
actor_lr
actor_weight_decay
aw_lambda
batch_size
critic_lr
debug
device
discount
eval_episode
eval_interval
hidden_dims
log_interval
max_action
max_epoch
name
normalize_obs
normalize_reward
save_interval
seed
step_per_epoch
task
tau
wandb.entity
wandb.project
Eval/length_mean
Eval/length_std
Eval/normalized_score_mean
Eval/normalized_score_std
loss/actor
loss/actor_loss
loss/alpha
loss/critic
misc/alpha
Finished
-
gaochenxiao
8m 9s
-
numpy.float32
float32
torch.float32
0.0003
0
0.33333
256
0.0003
false
["cuda:12","cuda:13","cuda:14","cuda:15"]
0.99
10
10
256
10
1
1000
d4rl
true
false
50
2
1000
hopper-medium-replay-v2
0.005
lamda-rl
AWAC-D4RL
936.46
87.31438
95.44885
8.62738
-0.98025
-0.98025
0
18.41725
0
Finished
-
gaochenxiao
6m 46s
-
numpy.float32
float32
torch.float32
0.0003
0
0.33333
256
0.0003
false
["cuda:12","cuda:13","cuda:14","cuda:15"]
0.99
10
10
256
10
1
1000
d4rl
true
false
50
2
1000
hopper-medium-expert-v2
0.005
lamda-rl
AWAC-D4RL
780.14
156.23403
90.55438
17.59282
-2.9406
-2.9406
0
3.09879
0
Finished
-
gaochenxiao
11m 11s
-
numpy.float32
float32
torch.float32
0.0003
0
0.33333
256
0.0003
false
["cuda:12","cuda:13","cuda:14","cuda:15"]
0.99
10
10
256
10
1
1000
d4rl
true
false
50
2
1000
hopper-medium-v2
0.005
lamda-rl
AWAC-D4RL
626.54
137.34729
65.62988
14.25194
-2.28305
-2.28305
0
1.89013
0
Failed
-
gaochenxiao
14m 30s
-
numpy.float32
float32
torch.float32
0.0003
0
0.33333
256
0.0003
false
["cuda:12","cuda:13","cuda:14","cuda:15"]
0.99
10
10
256
10
1
1000
d4rl
true
false
50
2
1000
hopper-full-replay-v2
0.005
lamda-rl
AWAC-D4RL
-
-
-
-
-
-
-
-
-
Finished
-
gaochenxiao
11m 16s
-
numpy.float32
float32
torch.float32
0.0003
0
0.33333
256
0.0003
false
["cuda:12","cuda:13","cuda:14","cuda:15"]
0.99
10
10
256
10
1
1000
d4rl
true
false
50
2
1000
hopper-random-v2
0.005
lamda-rl
AWAC-D4RL
297.14
0.64477
12.92492
0.038002
4.68404
4.68404
0
44.72119
0
Finished
-
gaochenxiao
3m 24s
-
numpy.float32
float32
torch.float32
0.0003
0
0.33333
256
0.0003
false
["cuda:12","cuda:13","cuda:14","cuda:15"]
0.99
10
10
256
10
1
1000
d4rl
true
false
50
2
1000
hopper-expert-v2
0.005
lamda-rl
AWAC-D4RL
809.34
145.00093
94.39681
16.07552
-1.78001
-1.78001
0
0.44673
0
1-6
of 6