Gaochenxiao's workspace
Runs
73
Name
0 visualized
task: walker2d-medium-replay-v2
task: walker2d-medium-replay-v2
6
6
task: hopper-random-v2
task: hopper-random-v2
6
6
task: walker2d-random-v2
task: walker2d-random-v2
6
6
task: halfcheetah-medium-v2
task: halfcheetah-medium-v2
6
6
task: hopper-medium-replay-v2
task: hopper-medium-replay-v2
6
6
task: hopper-medium-v2
task: hopper-medium-v2
6
6
task: walker2d-medium-expert-v2
task: walker2d-medium-expert-v2
6
6
task: walker2d-medium-v2
task: walker2d-medium-v2
6
6
task: halfcheetah-random-v2
task: halfcheetah-random-v2
6
7
task: halfcheetah-medium-replay-v2
task: halfcheetah-medium-replay-v2
6
6
task: hopper-medium-expert-v2
task: hopper-medium-expert-v2
6
6
task: halfcheetah-medium-expert-v2
task: halfcheetah-medium-expert-v2
6
6
State
Notes
User
Tags
Created
Runtime
Sweep
UtilsRL.numpy_fp
UtilsRL.precision
UtilsRL.torch_fp
actor_lr
actor_param
anneal_step
critic_q_lr
critic_v_lr
debug
device
discount
eval_episode
eval_interval
exp_adv_max
hidden_dims
improve_method
log_interval
max_action
max_epoch
name
norm_layer
normalize_obs
normalize_reward
policy_logstd_min
replay_actor_lr
replay_logstd_min
replay_param
save_interval
seed
step_per_epoch
task
tau
train_batch_size
uniform_coef
update_buffer_interval
update_replay_actor_interval
v_param
wandb.entity
wandb.project
warmup_epoch
Eval/length_mean
Eval/length_std
Eval/normalized_score_mean
Eval/normalized_score_std
Finished
-
gaochenxiao
6m 13s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
walker2d-medium-replay-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
941.93333
110.10764
88.52833
11.29683
Finished
-
gaochenxiao
5m 25s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
hopper-random-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
275.38333
9.81713
12.26717
0.90724
Finished
-
gaochenxiao
4m 5s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
walker2d-random-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
98.95
18.22622
1.51411
0.6202
Finished
-
gaochenxiao
5m 35s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
halfcheetah-medium-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
1000
0
53.97476
0.58122
Finished
-
gaochenxiao
4m 1s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
hopper-medium-replay-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
1000
0
101.56036
0.17103
Finished
-
gaochenxiao
4m 50s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
hopper-medium-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
945.33333
72.07602
95.92917
6.99799
Finished
-
gaochenxiao
4m 50s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
walker2d-medium-expert-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
1000
0
113.15157
0.084045
Finished
-
gaochenxiao
6m 12s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
walker2d-medium-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
1000
0
86.17948
0.26722
Crashed
-
gaochenxiao
5m 34s
-
numpy.float32
float32
torch.float32
0.0003
10.71429
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.21429
1
50
0
1000
halfcheetah-random-v2
0.005
512
0
1
1
0.68571
lamda-rl
DAC-D4RL
100
1000
0
19.70662
1.23464
Finished
-
gaochenxiao
5m 11s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
halfcheetah-medium-replay-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
1000
0
50.87172
0.54632
Finished
-
gaochenxiao
4m 6s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
hopper-medium-expert-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
995.91667
10.26787
111.52068
1.75294
Finished
-
gaochenxiao
5m 34s
-
numpy.float32
float32
torch.float32
0.0003
10.83333
1000000
0.0003
0.0003
false
["cuda:0","cuda:1"]
0.99
10
20
100
256
iql
10
1
1000
["v0.5-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-0.5","v0.7-a10.0-u0.0-s-1.0","v0.7-a10.0-u0.0-s-2.0","v0.7-a15.0-u0.0-s-1.0","v0.8-a10.0-u0.0-s-1.0"]
true
true
true
-5
0.0003
-1.08333
1
50
0
1000
halfcheetah-medium-expert-v2
0.005
512
0
1
1
0.68333
lamda-rl
DAC-D4RL
100
1000
0
32.79722
20.06958
1-12
of 12