Skip to main content
akshaykalkunte
Projects
slam
Workspace
Log in
Sign up
Overview
Workspace
Runs
Automat.
Sweeps
Reports
Artifacts
Akshaykalkunte's workspace
Personal workspace
Automated workspace
Changes are only visible to you.
Runs
1,208
Name
3 visualized
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_19
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_19
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_18
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_18
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_17
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_17
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_16
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_16
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_15
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_15
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_14
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_14
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_13
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_13
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_12
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_12
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_11
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_11
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_10
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_10
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_9
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_9
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_4
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_4
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_2
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_2
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_1
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_1
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_0
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_30of50_100stp_apriel_15b_thinker_with_identity_layers_0
h30distsftvrlm2f16420k
h30distsftvrlm2f16420k
test-vision-ssm-distillation-24ssm-init-mil-04-unfreeze
test-vision-ssm-distillation-24ssm-init-mil-04-unfreeze
test-vision-ssm-distillation-24ssm-init-mil-04-unfreeze-prev-checkpoint
test-vision-ssm-distillation-24ssm-init-mil-04-unfreeze-prev-checkpoint
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_0of50_100stp_apriel_15b_thinker_with_identity_layers_46
_mnt_checkpoints_fast_llm_exp_slam_ssm_distill_layer_importance15B_0of50_100stp_apriel_15b_thinker_with_identity_layers_46
test-vision-ssm-distillation-24ssm-init-mil-04-unfreeze-prev-exp
test-vision-ssm-distillation-24ssm-init-mil-04-unfreeze-prev-exp
1-20
of 1,208
Add panels
Charts
25
1-6 of 25
Training.train_iters
Training.train_iters
50
100
150
Step
0
20000
40000
60000
80000
100000
Training.tokens_per_sec_per_gpu
Training.tokens_per_sec_per_gpu
50
100
150
Step
8000
10000
12000
14000
16000
Training.step_time_ms
Training.step_time_ms
50
100
150
Step
1500
2000
2500
3000
Training.step_time_average_ms
Training.step_time_average_ms
50
100
150
Step
1500
2000
2500
3000
Training.skipped_iters
Training.skipped_iters
50
100
150
Step
-2
-1
0
1
2
Training.run
Training.run
50
100
150
Step
0
0.2
0.4
0.6
0.8
1
System
31
1-6 of 31
Add section