Howuhh's group workspace
Group: small_scale_awac_chaotic_lstm_multiseed-v0
Name
114 visualized
alpha: null
alpha: null
114
State
Notes
User
Tags
Created
Runtime
Sweep
batch_size
character
checkpoints_path
data_mode
eval_episodes
eval_every
eval_processes
eval_seed
group
learning_rate
mlc_job_name
name
project
rnn_dropout
rnn_hidden_dim
rnn_layers
seq_len
train_seed
update_steps
use_prev_action
version
weight_decay
alpha
clip_range
gamma
tau
num_heads
expectile_tau
temperature
depth_max
depth_mean
depth_median
depth_min
depth_std
loss
reward_max
reward_mean
reward_median
reward_min
reward_std
times/backward_pass
times/batch_loading_cpu
times/batch_loading_gpu
times/evaluation_cpu
Finished
-
howuhh
2d 21h 7m 59s
64
["kni-hum-law","pri-elf-cha","val-dwa-law","val-hum-law","val-hum-neu","wiz-elf-cha","wiz-gno-neu","wiz-hum-cha","wiz-hum-neu","wiz-orc-cha"]
-
memmap
50
10000
14
50
small_scale_awac_chaotic_lstm_multiseed
0.0003
-
["awac-kni-hum-law-8b7dca62","awac-pri-elf-cha-be3cd198","awac-wiz-elf-cha-8a20b8d0","awac-wiz-elf-cha-ca4691ff","awac-wiz-gno-neu-12c7428e","awac-wiz-gno-neu-6f65df81","awac-wiz-gno-neu-98c62f36","awac-wiz-orc-cha-0514f22d","awac-wiz-orc-cha-cab38a53","awac-wiz-orc-cha-ee213c70"]
NetHack
0
2048
2
16
1
500000
true
0
0
-
10
0.999
0.005
-
-
1
0.64912
0.021579
0
0
0.11136
0.22646
2326.40351
511.20596
351.19737
2.88596
522.70602
0.015029
0.058862
0.058819
109.94654
1-1
of 1