Ableal's workspace
Runs
232
Name
232 visualized
1-20
of 232Charts
2
Tables
8
runs.summary["leaderboard_table"]
- 7 of 54
AVG_jaster
EL
FA
MC
MR
NLI
QA
RC
chabsa_set_f1
jamp_exact_match
janli_exact_match
jcommonsenseqa_exact_match
jemhopqa_char_f1
jnli_exact_match
jsem_exact_match
jsick_exact_match
jsquad_char_f1
mawps_exact_match
niilc_char_f1
wiki_coreference_set_f1
wiki_dependency_set_f1
wiki_ner_set_f1
wiki_pas_set_f1
wiki_reading_char_f1
basemodel_name
model_type
instruction_tuning_method
instruction_tuning_data
num_few_shots
llm-jp-eval-version
data_type
top_p
top_k
temperature
repetition_penalty
coding
extraction
humanities
math
reasoning
roleplay
stem
writing
AVG_mtbench
AVG
1
2
3
4
5
6
7