Andreas122001's workspace
Runs
248
Name
6 visualized
dataset: null
dataset: null
4
6
dataset: wiki_labeled
dataset: wiki_labeled
4
15
dataset: research_abstracts_labeled
dataset: research_abstracts_labeled
4
25
State
Notes
User
Tags
Created
Runtime
Sweep
_name_or_path
adafactor
adam_beta1
adam_beta2
adam_epsilon
add_cross_attention
architectures
attention_probs_dropout_prob
auto_find_batch_size
base_model
bf16
bf16_full_eval
bos_token_id
chunk_size_feed_forward
data_seed
dataloader_drop_last
dataloader_num_workers
dataloader_pin_memory
dataset
ddp_bucket_cap_mb
ddp_find_unused_parameters
ddp_timeout
debug
deepspeed
disable_tqdm
diversity_penalty
do_eval
do_predict
do_sample
do_train
early_stopping
encoder_no_repeat_ngram_size
eos_token_id
eval_accumulation_steps
eval_batch_size
eval_dataset_size
eval_delay
eval_steps
evaluation_strategy
fp16
fp16_backend
fp16_full_eval
fp16_opt_level
Failed
-
andreas122001
11h 46m 7s
-
["bigscience/bloomz-1b7","bigscience/bloomz-3b","bigscience/bloomz-560m","roberta-base"]
false
0.9
0.999
1.0000e-8
false
["BloomForCausalLM","RobertaForMaskedLM"]
0.1
true
["bigscience/bloomz-1b7","bigscience/bloomz-3b","bigscience/bloomz-560m","roberta-base"]
false
false
0.66667
0
None
false
0
true
-
None
None
1800
[]
None
false
0
true
false
false
false
false
0
2
None
8
3750
0
43
steps
false
auto
false
O1
Finished
-
andreas122001
12d 10h 17m 17s
-
["bigscience/bloomz-1b7","bigscience/bloomz-3b","bigscience/bloomz-560m","roberta-base"]
false
0.9
0.999
1.0000e-8
false
["BloomForCausalLM","RobertaForMaskedLM"]
0.1
true
["bigscience/bloomz-1b7","bigscience/bloomz-3b","bigscience/bloomz-560m","roberta-base"]
false
false
0.73333
0
None
false
0
true
wiki_labeled
None
None
1800
[]
None
false
0
true
false
false
false
false
0
2
None
8
4500
0
48.66667
steps
false
auto
false
O1
Finished
-
andreas122001
12d 3h 56m 31s
-
["bigscience/bloomz-1b7","bigscience/bloomz-3b","bigscience/bloomz-560m","roberta-base"]
false
0.9
0.999
1.0000e-8
false
["BloomForCausalLM","RobertaForMaskedLM"]
0.1
true
["bigscience/bloomz-1b7","bigscience/bloomz-3b","bigscience/bloomz-560m","roberta-base"]
false
false
0.8
0
None
false
0
true
research_abstracts_labeled
None
None
1800
[]
None
false
0
true
false
false
false
false
0
2
None
8
3000
0
33.64
steps
false
auto
false
O1
1-3
of 3