Preetham-gali's group workspace
Group: 2MDEso9ZZqQZHYCJKjxrVM_8crlh237
Name
1 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
activation
adlr_autoresume
adlr_autoresume_interval
alpha_kld
alpha_lm
alpha_mse
apply_query_key_layer_scaling
apply_residual_connection_post_layernorm
attention_config
attention_dropout
attention_softmax_in_fp32
batch_size
bias_dropout_fusion
bias_gelu_fusion
checkpoint_activations
checkpoint_in_cpu
checkpoint_num_layers
checkpoint_validation_with_forward_pass
clip_grad
contiguous_checkpointing
data_impl
data_path
deepscale
deepspeed
deepspeed_activation_checkpointing
deepspeed_mpi
detect_nvlink_pairs
distributed_backend
do_distillation
dump_state
dynamic_loss_scale
eod_mask_loss
eval_interval
eval_iters
eval_results_prefix
finetune
fp16.enabled
fp16.hysteresis
fp16.loss_scale
fp16.loss_scale_window
fp16.min_loss_scale
fp16.type
fp16_lm_cross_entropy
fp32_allreduce
Finished
-
shivanshupurohit
5s
-
gelu
false
1000
1
1
0
false
-
-
0
false
3
false
false
true
false
1
false
1
false
mmap
/mnt/ssd-1/data/pile/pile_text_document
false
true
true
false
false
nccl
true
false
true
false
1000
10
true
true
2
0
1000
1
-
false
false
1-1
of 1