Dsuamnth17's workspace
Runs
3
State
Notes
User
Tags
Created
Runtime
Sweep
args._name
args.activation_dropout
args.activation_fn
args.adam_betas
args.adam_eps
args.adaptive_input
args.adaptive_softmax_dropout
args.all_gather_list_size
args.arch
args.attention_dropout
args.azureml_logging
args.best_checkpoint_metric
args.bf16
args.broadcast_buffers
args.bucket_cap_mb
args.checkpoint_activations
args.checkpoint_shard_count
args.checkpoint_suffix
args.clip_norm
args.cpu
args.cpu_offload
args.criterion
args.cross_self_attention
args.curriculum
args.data
args.data_buffer_size
args.ddp_backend
args.decoder_attention_heads
args.decoder_embed_dim
args.decoder_ffn_embed_dim
args.decoder_input_dim
args.decoder_layerdrop
args.decoder_layers
args.decoder_learned_pos
args.decoder_normalize_before
args.decoder_output_dim
args.device_id
args.disable_validation
args.distributed_backend
args.distributed_no_spawn
args.distributed_port
args.distributed_rank
args.distributed_world_size
args.dropout
Crashed
-
gowthamr
39m 2s
-
transformer_4x
0
relu
(0.9, 0.98)
1.0000e-8
false
0
16384
transformer_4x
0
false
loss
false
false
25
false
1
1
false
false
label_smoothed_cross_entropy
false
0
../v2-indic-en-pib/final_bin
10
pytorch_ddp
16
1536
4096
1536
0
6
false
false
1536
0
false
nccl
false
-1
0
4
0.2
Finished
-
gowthamr
3h 45m 37s
-
transformer_4x
0
relu
(0.9, 0.98)
1.0000e-8
false
0
16384
transformer_4x
0
false
loss
false
false
25
false
1
1
false
false
label_smoothed_cross_entropy
false
0
../v2-indic-en/final_bin
10
pytorch_ddp
16
1536
4096
1536
0
6
false
false
1536
0
false
nccl
false
-1
0
4
0.2
Finished
-
gowthamr
1d 2h 2m 45s
-
transformer_4x
0
relu
(0.9, 0.98)
1.0000e-8
false
0
16384
transformer_4x
0
false
loss
false
false
25
false
1
1
false
false
label_smoothed_cross_entropy
false
0
../v2-indic-en/final_bin
10
pytorch_ddp
16
1536
4096
1536
0
6
false
false
1536
0
false
nccl
false
-1
0
4
0.2
1-3
of 3