Ableal's workspace
Runs
232
Name
232 visualized
1-20
of 232runs.summary["leaderboard_table"]
- 5 of 54
0.1622
0.0312
0.1318
0
0
0
0.1038
0.8687
0.0312
0
0
0.3266
0.3113
0.145
0
0.26
0.596
0.1283
0.8453
0.3113
0.42
0.82
0.3109
0.2743
0.1384
0
0.14
0.598
0.1437
0.8821
0.2743
0.47
0.85
0.4431
0.1454
0.0606
0.79
0.59
0.29
0.3768
0.8492
0.1454
0.05
0.44
0.3111
0
0.048
0
1
0.1
0.03
1
0
0
0.5
AVG_jaster
EL
FA
MC
MR
NLI
QA
RC
chabsa_set_f1
jamp_exact_match
janli_exact_match
jcommonsenseqa_exact_match
jemhopqa_char_f1
jnli_exact_match
jsem_exact_match
jsick_exact_match
jsquad_char_f1
mawps_exact_match
niilc_char_f1
wiki_coreference_set_f1
wiki_dependency_set_f1
wiki_ner_set_f1
wiki_pas_set_f1
wiki_reading_char_f1
basemodel_name
model_type
instruction_tuning_method
instruction_tuning_data
num_few_shots
llm-jp-eval-version
data_type
top_p
top_k
temperature
repetition_penalty
coding
extraction
humanities
math
reasoning
roleplay
stem
writing
AVG_mtbench
AVG
1
2
3
4
5