A0970601776's workspace
Runs
11
Name
9 visualized
Tags
H100*8
adamw_torch_fused
batch_5
cutoff_len_1024
epochs_3
grad_acc_4
lr_5e-6
warmup_ratio_0.01
z3
H100*8
adamw_torch_fused
batch_4
cutoff_len_1024
epochs_60
grad_acc_4
lr_5e-6
warmup_ratio_0.01
z3
H100*8
adamw_torch_fused
batch_4
cutoff_len_1024
epochs_3
grad_acc_4
lr_5e-6
warmup_ratio_0.01
z3
H100*8
adamw_torch_fused
batch_4
cutoff_len_1024
epochs_3
grad_acc_4
lr_5e-6
warmup_ratio_0.01
z2
H100*8
adamw_torch_fused
batch_18
cutoff_len_1024
epochs_3
grad_acc_82
lr_5e-6
warmup_ratio_0.01
z2
H100*8
adamw_torch_fused
batch_18
cutoff_len_1024
epochs_3
grad_acc_82
lr_5e-6
warmup_ratio_0.01
z2
H100*8
adamw_torch_fused
batch_18
cutoff_len_1024
epochs_3
grad_acc_82
lr_5e-6
warmup_ratio_0.01
z2
H100*8
adamw_torch_fused
batch_18
cutoff_len_1024
epochs_10
grad_acc_82
lr_5e-6
warmup_ratio_0.01
z2
H100*8
adamw_torch_fused
batch_18
cutoff_len_1024
epochs_10
grad_acc_4
lr_5e-6
warmup_ratio_0.01
z2
H100*8
adamw_torch_fused
batch_18
cutoff_len_1024
epochs_10
grad_acc_2
lr_5e-6
warmup_ratio_0.01
z2
H100*8
adamw_torch_fused
batch_3
cutoff_len_1024
epochs_10
grad_acc_2
lr_5e-6
warmup_ratio_0.01
z2
1-11
of 11