Cstorm125's workspace
Runs
1
Name
1 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
_n_gpu
_name_or_path
_num_labels
activation_dropout
activation_function
adafactor
adam_beta1
adam_beta2
adam_epsilon
add_bias_logits
add_cross_attention
add_final_layer_norm
architectures
attention_dropout
bad_words_ids
bos_token_id
chunk_size_feed_forward
classif_dropout
classifier_dropout
d_model
dataloader_drop_last
dataloader_num_workers
dataloader_pin_memory
ddp_find_unused_parameters
debug
decoder_attention_heads
decoder_ffn_dim
decoder_layerdrop
decoder_layers
decoder_start_token_id
deepspeed
disable_tqdm
diversity_penalty
do_eval
do_predict
do_sample
do_train
dropout
early_stopping
encoder_attention_heads
encoder_ffn_dim
encoder_layerdrop
encoder_layers
encoder_no_repeat_ngram_size
Failed
-
cstorm125
12h 44m 37s
-
1
modelx
3
0
swish
false
0.9
0.999
1.0000e-8
false
false
false
["MarianMTModel"]
0
[[3]]
1
0
0
0
512
false
0
true
None
[]
8
2048
0
6
3
None
false
0
true
false
false
false
0.1
false
8
2048
0
6
0
1-1
of 1