Gaochenxiao's workspace
Runs
50
Name
10 visualized
env: Cheetah-Run-v1
env: Cheetah-Run-v1
5
env: Humanoid-v3
env: Humanoid-v3
5
env: Hopper-Hop-v1
env: Hopper-Hop-v1
5
env: Walker2d-v3
env: Walker2d-v3
5
env: Swimmer-v3
env: Swimmer-v3
5
env: Quadruped-Run-v1
env: Quadruped-Run-v1
5
env: HalfCheetah-v3
env: HalfCheetah-v3
5
env: Walker-Run-v1
env: Walker-Run-v1
5
env: Ant-v3
env: Ant-v3
5
env: Hopper-v3
env: Hopper-v3
5
State
Notes
User
Tags
Created
Runtime
Sweep
UtilsRL.numpy_fp
UtilsRL.precision
UtilsRL.torch_fp
actor_lr
actor_update_interval
batch_size
critic_lr
debug
device
discount
domain
embedding_dim
encoder_lr
env
env_type
eval_episode
eval_interval
exploration_noise
hidden_dim
log_interval
max_action
max_buffer_size
max_trajectory_length
name
noise_clip
num_epoch
policy_noise
save_interval
seed
step_before_training
step_per_epoch
target_update_interval
task
use_checkpoint
wandb.entity
wandb.project
Eval/episode_return_mean
Eval/episode_return_std
Eval/length_mean
Eval/length_std
loss/actor_loss
loss/bc_loss
loss/critic_loss
loss/encoder_loss
Finished
-
gaochenxiao
4m 38s
-
numpy.float32
float32
torch.float32
0.0001
2
1024
0.0001
false
["cuda:0","cuda:1"]
0.99
cheetah
512
0.0001
Cheetah-Run-v1
dmc
10
10
0.1
512
10
1
1000000
1000
no_checkpoint
0.5
2000
0.2
50
2
25000
1000
250
run
false
lamda-rl
td7-Online
924.64659
1.52194
1000
0
-94.11916
0
0.010756
0.0010583
Finished
-
gaochenxiao
3m 7s
-
numpy.float32
float32
torch.float32
0.0003
2
256
0.0003
false
["cuda:0","cuda:1"]
0.99
-
256
0.0003
Humanoid-v3
mujoco
10
10
0.1
256
10
1
1000000
1000
no_checkpoint
0.5
5000
0.2
50
2
25000
1000
250
Humanoid-v3
false
lamda-rl
td7-Online
9414.5288
1547.05534
878.8
142.15076
-978.86304
0
4.6282
0.001333
Finished
-
gaochenxiao
3m 37s
-
numpy.float32
float32
torch.float32
0.0001
2
1024
0.0001
false
["cuda:0","cuda:1"]
0.99
hopper
512
0.0001
Hopper-Hop-v1
dmc
10
10
0.1
512
10
1
1000000
1000
no_checkpoint
0.5
2000
0.2
50
2
25000
1000
250
hop
false
lamda-rl
td7-Online
152.81336
22.69726
1000
0
-11.46459
0
0.01027
0.0040615
Finished
-
gaochenxiao
7m 33s
-
numpy.float32
float32
torch.float32
0.0003
2
256
0.0003
false
["cuda:0","cuda:1"]
0.99
-
256
0.0003
Walker2d-v3
mujoco
10
10
0.1
256
10
1
1000000
1000
no_checkpoint
0.5
5000
0.2
50
2
25000
1000
250
Walker2d-v3
false
lamda-rl
td7-Online
6755.66182
17.55648
1000
0
-648.80568
0
2.52558
0.00068753
Finished
-
gaochenxiao
8m 59s
-
numpy.float32
float32
torch.float32
0.0003
2
256
0.0003
false
["cuda:0","cuda:1"]
0.99
-
256
0.0003
Swimmer-v3
mujoco
10
10
0.1
256
10
1
1000000
1000
no_checkpoint
0.5
5000
0.2
50
2
25000
1000
250
Swimmer-v3
false
lamda-rl
td7-Online
118.08836
12.11327
1000
0
-8.32662
0
0.00067327
0.0001369
Finished
-
gaochenxiao
7m 44s
-
numpy.float32
float32
torch.float32
0.0001
2
1024
0.0001
false
["cuda:0","cuda:1"]
0.99
quadruped
512
0.0001
Quadruped-Run-v1
dmc
10
10
0.1
512
10
1
1000000
1000
no_checkpoint
0.5
2000
0.2
50
2
25000
1000
250
run
false
lamda-rl
td7-Online
862.12334
67.52392
1000
0
-87.5761
0
0.029089
0.013275
Finished
-
gaochenxiao
5m 48s
-
numpy.float32
float32
torch.float32
0.0003
2
256
0.0003
false
["cuda:0","cuda:1"]
0.99
-
256
0.0003
HalfCheetah-v3
mujoco
10
10
0.1
256
10
1
1000000
1000
no_checkpoint
0.5
5000
0.2
50
2
25000
1000
250
HalfCheetah-v3
false
lamda-rl
td7-Online
18015.80318
46.07278
1000
0
-1595.64932
0
8.99713
0.0020839
Finished
-
gaochenxiao
2m 41s
-
numpy.float32
float32
torch.float32
0.0001
2
1024
0.0001
false
["cuda:0","cuda:1"]
0.99
walker
512
0.0001
Walker-Run-v1
dmc
10
10
0.1
512
10
1
1000000
1000
no_checkpoint
0.5
2000
0.2
50
2
25000
1000
250
run
false
lamda-rl
td7-Online
858.82313
10.15872
1000
0
-81.11054
0
0.0091705
0.0015632
Finished
-
gaochenxiao
5m 29s
-
numpy.float32
float32
torch.float32
0.0003
2
256
0.0003
false
["cuda:0","cuda:1"]
0.99
-
256
0.0003
Ant-v3
mujoco
10
10
0.1
256
10
1
1000000
1000
no_checkpoint
0.5
5000
0.2
50
2
25000
1000
250
Ant-v3
false
lamda-rl
td7-Online
9117.48273
1531.38256
980.36
58.92
-730.88065
0
8.2697
0.0089778
Finished
-
gaochenxiao
6m 12s
-
numpy.float32
float32
torch.float32
0.0003
2
256
0.0003
false
["cuda:0","cuda:1"]
0.99
-
256
0.0003
Hopper-v3
mujoco
10
10
0.1
256
10
1
1000000
1000
no_checkpoint
0.5
5000
0.2
50
2
25000
1000
250
Hopper-v3
false
lamda-rl
td7-Online
3623.46799
351.30466
836.98
78.07156
-387.26062
0
2.42984
0.00021094
1-10
of 10