Venetispall's workspace
Runs
3
State
Notes
User
Tags
Created
Runtime
Sweep
_name_or_path
accelerator_config.even_batches
accelerator_config.split_batches
accelerator_config.use_seedable_sampler
adafactor
adam_beta1
adam_beta2
adam_epsilon
add_cross_attention
architectures
attention_bias
attention_dropout
auto_find_batch_size
bench_dataset
bench_source_max_len
bench_split
bf16
bf16_full_eval
bos_token_id
chunk_size_feed_forward
dataloader_drop_last
dataloader_num_workers
dataloader_persistent_workers
dataloader_pin_memory
ddp_timeout
debug
disable_tqdm
diversity_penalty
do_bench_eval
do_causal_lm_eval
do_eval
do_predict
do_sample
do_train
early_stopping
encoder_no_repeat_ngram_size
eos_token_id
eval_accumulation_steps
eval_delay
eval_do_concat_batches
eval_sample_packing
eval_steps
evaluation_strategy
fp16
Finished
-
venetispall
1h 55m 9s
-
meta-llama/Meta-Llama-3-8B
true
false
true
false
0.9
0.999
1.0000e-8
false
["LlamaForCausalLM"]
false
0
false
pharaouk/dharma-1/dharma_1_mini.json
2048
eval
true
false
128000
0
false
0
false
true
1800
[]
false
0
false
false
true
false
false
false
false
0
128256
4
0
true
true
0.0625
steps
false
Finished
-
venetispall
1h 54m 38s
-
meta-llama/Meta-Llama-3-8B
true
false
true
false
0.9
0.999
1.0000e-8
false
["LlamaForCausalLM"]
false
0
false
pharaouk/dharma-1/dharma_1_mini.json
2048
eval
true
false
128000
0
false
0
false
true
1800
[]
false
0
false
false
true
false
false
false
false
0
128256
4
0
true
true
0.0625
steps
false
Finished
-
venetispall
1h 38m 12s
-
meta-llama/Meta-Llama-3-8B
true
false
true
false
0.9
0.999
1.0000e-8
false
["LlamaForCausalLM"]
false
0
false
pharaouk/dharma-1/dharma_1_mini.json
2048
eval
true
false
128000
0
false
0
false
true
1800
[]
false
0
false
false
true
false
false
false
false
0
128001
4
0
true
true
0.0625
steps
false
1-3
of 3