Skip to main content
jnainani-university-of-massachusetts-amherst
Projects
acausal_cc_test2
Workspace
Log in
Sign up
Overview
Workspace
Runs
Automat.
Sweeps
Reports
Artifacts
Tim_hua's workspace
Personal workspace
Automated workspace
Changes are only visible to you.
Runs
22
Name
15 visualized
l1_lambda_2_padding_buffer_1k_refil_0.25_2025-05-05_11-39-39
l1_lambda_2_padding_buffer_1k_refil_0.25_2025-05-05_11-39-39
k120_continuous_buffer_new_padding_buffer_1k_refil_0.25_2025-05-05_11-37-06
k120_continuous_buffer_new_padding_buffer_1k_refil_0.25_2025-05-05_11-37-06
k120_continuous_buffer_new_padding_logic_2025-05-01_21-00-14
k120_continuous_buffer_new_padding_logic_2025-05-01_21-00-14
k120_continuous_buffer_seed_1052025_2025-05-01_13-02-56
k120_continuous_buffer_seed_1052025_2025-05-01_13-02-56
k120_grad_clip_10_dec_init_5e2_lr_1e5_warmup_5e2_2025-05-01_12-58-24
k120_grad_clip_10_dec_init_5e2_lr_1e5_warmup_5e2_2025-05-01_12-58-24
k120_continuous_buffer_filter_1st_token_2025-04-27_18-43-37
k120_continuous_buffer_filter_1st_token_2025-04-27_18-43-37
k120_grad_clip_warmup_1e1_2025-04-22_19-50-34
k120_grad_clip_warmup_1e1_2025-04-22_19-50-34
k120_grad_clip_lr_5e5_warmup_5e2_2025-04-22_19-34-41
k120_grad_clip_lr_5e5_warmup_5e2_2025-04-22_19-34-41
k120_grad_clip_dec_init_3e2_lr_5e5_warmup_5e2_2025-04-22_19-33-57
k120_grad_clip_dec_init_3e2_lr_5e5_warmup_5e2_2025-04-22_19-33-57
k120_grad_clip_1_2025-04-22_15-14-22
k120_grad_clip_1_2025-04-22_15-14-22
k120_dec_init_1_2025-04-22_15-12-28
k120_dec_init_1_2025-04-22_15-12-28
k120_continuous_optimized_buffer_2025-04-21_09-36-58
k120_continuous_optimized_buffer_2025-04-21_09-36-58
l1_lambda_2_2025-04-20_22-24-33
l1_lambda_2_2025-04-20_22-24-33
l1_lambda_5_2025-04-20_19-08-35
l1_lambda_5_2025-04-20_19-08-35
batchtop_k120_continuous_buffer
batchtop_k120_continuous_buffer
batchtop_k80_2025-04-13_17-58-47
batchtop_k80_2025-04-13_17-58-47
k120_2025-04-13_17-58-20
k120_2025-04-13_17-58-20
k120_lr_warmup_too_much
k120_lr_warmup_too_much
k80_lr_warmup_too_much
k80_lr_warmup_too_much
with_bigger_buffer_2025-04-13_17-03-07
with_bigger_buffer_2025-04-13_17-03-07
1-20
of 22
train/sparsity_loss
train/sparsity_loss
0
10k
20k
30k
40k
50k
60k
Step
2000
4000
6000
l1_lambda_2_padding_buffer_1k_refil_0.25_2025-05-05_11-39-39
l1_lambda_2_2025-04-20_22-24-33
l1_lambda_5_2025-04-20_19-08-35
Previous
Next