Muennighoff's workspace
Runs
292
State
Notes
User
Tags
Created
Runtime
Sweep
adafactor
adam_beta1
adam_beta2
adam_epsilon
auto_find_batch_size
bf16
bf16_full_eval
dataloader_drop_last
dataloader_num_workers
dataloader_pin_memory
ddp_timeout
debug
disable_tqdm
do_eval
do_predict
do_train
eval_delay
evaluation_strategy
fix_position_embedding
fp16
fp16_backend
fp16_full_eval
fp16_opt_level
fsdp
fsdp_config.min_num_params
fsdp_config.xla
fsdp_config.xla_fsdp_grad_ckpt
fsdp_min_num_params
full_determinism
gradient_accumulation_steps
gradient_checkpointing
group_by_length
half_precision_backend
hub_always_push
hub_private_repo
hub_strategy
hub_token
ignore_data_skip
include_inputs_for_metrics
include_tokens_per_second
jit_mode_eval
Finished
-
muennighoff
1d 13h 30m 59s
-
false
0.9
0.999
1.0000e-8
false
true
false
true
0
true
1800
[]
false
false
false
false
0
no
-
false
auto
false
O1
[]
0
false
false
0
false
1
true
false
auto
false
false
every_save
<HUB_TOKEN>
false
false
false
false
Finished
-
muennighoff
1d 12h 56m 40s
-
false
0.9
0.999
1.0000e-8
false
true
false
true
0
true
1800
[]
false
false
false
false
0
no
-
false
auto
false
O1
[]
0
false
false
0
false
1
true
false
auto
false
false
every_save
<HUB_TOKEN>
false
false
false
false
Finished
-
muennighoff
1d 12h 39m 21s
-
false
0.9
0.999
1.0000e-8
false
true
false
true
0
true
1800
[]
false
false
false
false
0
no
-
false
auto
false
O1
[]
0
false
false
0
false
1
true
false
auto
false
false
every_save
<HUB_TOKEN>
false
false
false
false
1-3
of 3