Lucas01's workspace
Runs
3
State
Notes
User
Tags
Created
Runtime
Sweep
_n_gpu
_name_or_path
_num_labels
activation_function
adafactor
adam_beta1
adam_beta2
adam_epsilon
add_cross_attention
architectures
attn_pdrop
author
auto_find_batch_size
bf16
bf16_full_eval
bos_token_id
chunk_size_feed_forward
created_date
d_ff
d_kv
d_model
data_seed
dataloader_drop_last
dataloader_num_workers
dataloader_pin_memory
ddp_bucket_cap_mb
ddp_find_unused_parameters
ddp_timeout
debug
decoder_start_token_id
deepspeed
dense_act_fn
disable_tqdm
diversity_penalty
do_eval
do_predict
do_sample
do_train
dropout_rate
early_stopping
embd_pdrop
encoder_no_repeat_ngram_size
eos_token_id
eval_accumulation_steps
Crashed
lucas01
5d 2h 42m 42s
-
0
-
-
false
0.9
0.999
1.0000e-8
false
-
-
-
false
false
false
-
0
-
3072
64
768
None
false
0
true
None
None
-
[]
0
None
silu
false
0
false
false
false
true
0.1
false
-
0
1
None
Crashed
lucas01
8d 11h 19m 55s
-
0
-
-
false
0.9
0.999
1.0000e-8
false
-
-
-
false
false
false
-
0
-
3072
64
768
None
false
0
true
None
None
-
[]
0
None
silu
false
0
false
false
false
true
0.1
false
-
0
1
None
Crashed
lucas01
4d 11h 14m 43s
-
0
-
-
false
0.9
0.999
1.0000e-8
false
-
-
-
false
false
false
-
0
-
3072
64
768
None
false
0
true
None
None
-
[]
0
None
silu
false
0
false
false
false
true
0.1
false
-
0
1
None
1-3
of 3