Chuanli11's workspace
Runs
3
Name
3 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
_name_or_path
adafactor
adam_beta1
adam_beta2
adam_epsilon
add_cross_attention
architectures
auto_find_batch_size
bf16
bf16_full_eval
bos_token_id
chunk_size_feed_forward
data_seed
dataloader_drop_last
dataloader_num_workers
dataloader_pin_memory
ddp_bucket_cap_mb
ddp_find_unused_parameters
ddp_timeout
debug
deepspeed
disable_tqdm
diversity_penalty
do_eval
do_predict
do_sample
do_train
early_stopping
encoder_no_repeat_ngram_size
eos_token_id
eval_accumulation_steps
eval_batch_size
eval_delay
eval_steps
evaluation_strategy
fp16
fp16_backend
fp16_full_eval
fp16_opt_level
fsdp
fsdp_min_num_params
fsdp_transformer_layer_cls_to_wrap
full_determinism
gradient_accumulation_steps
Finished
chuanli11
18h 27m 17s
-
EleutherAI/pythia-12b-deduped
false
0.9
0.999
1.0000e-8
false
["GPTNeoXForCausalLM"]
false
false
false
0
0
None
false
0
true
None
None
1800
[]
./src/configs/deepspeed/8xA100_80GB_pythia-12b-deduped.json
false
0
true
false
false
false
false
0
0
2
4
0
500
steps
true
auto
false
O1
[]
0
None
false
1
Finished
chuanli11
18h 8m 23s
-
EleutherAI/pythia-12b-deduped
false
0.9
0.999
1.0000e-8
false
["GPTNeoXForCausalLM"]
false
false
false
0
0
None
false
0
true
None
None
1800
[]
./src/configs/deepspeed/8xA100_80GB_pythia-12b-deduped.json
false
0
true
false
false
false
false
0
0
2
4
0
500
steps
true
auto
false
O1
[]
0
None
false
1
Finished
chuanli11
17h 40m 58s
-
EleutherAI/pythia-12b-deduped
false
0.9
0.999
1.0000e-8
false
["GPTNeoXForCausalLM"]
false
false
false
0
0
None
false
0
true
None
None
1800
[]
./src/configs/deepspeed/8xA100_80GB_pythia-12b-deduped.json
false
0
true
false
false
false
false
0
0
2
4
0
500
steps
true
auto
false
O1
[]
0
None
false
1
1-3
of 3