Ptst1110-ee10's workspace
Runs
1
Name
1 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
cli_configs.batch_size
cli_configs.batch_sizes
cli_configs.bootstrap_iters
cli_configs.device
cli_configs.fewshot_seed
cli_configs.model
cli_configs.model_args
cli_configs.model_dtype
cli_configs.model_num_parameters
cli_configs.model_revision
cli_configs.model_sha
cli_configs.numpy_seed
cli_configs.random_seed
cli_configs.torch_seed
task_configs.tmlu_AST_biology.dataset_name
task_configs.tmlu_AST_biology.dataset_path
task_configs.tmlu_AST_biology.doc_to_choice
task_configs.tmlu_AST_biology.doc_to_target
task_configs.tmlu_AST_biology.doc_to_text
task_configs.tmlu_AST_biology.fewshot_config.sampler
task_configs.tmlu_AST_biology.fewshot_delimiter
task_configs.tmlu_AST_biology.fewshot_split
task_configs.tmlu_AST_biology.metadata.version
task_configs.tmlu_AST_biology.metric_list
task_configs.tmlu_AST_biology.num_fewshot
task_configs.tmlu_AST_biology.output_type
task_configs.tmlu_AST_biology.process_docs
task_configs.tmlu_AST_biology.repeats
task_configs.tmlu_AST_biology.should_decontaminate
task_configs.tmlu_AST_biology.tag
task_configs.tmlu_AST_biology.target_delimiter
task_configs.tmlu_AST_biology.task
task_configs.tmlu_AST_biology.task_alias
task_configs.tmlu_AST_biology.test_split
task_configs.tmlu_AST_biology.unsafe_code
task_configs.tmlu_AST_chemistry.dataset_name
task_configs.tmlu_AST_chemistry.dataset_path
task_configs.tmlu_AST_chemistry.doc_to_choice
task_configs.tmlu_AST_chemistry.doc_to_target
task_configs.tmlu_AST_chemistry.doc_to_text
task_configs.tmlu_AST_chemistry.fewshot_config.sampler
task_configs.tmlu_AST_chemistry.fewshot_delimiter
task_configs.tmlu_AST_chemistry.fewshot_split
task_configs.tmlu_AST_chemistry.metadata.version
Finished
-
ptst1110-ee10
3m 27s
-
1
[]
100000
cuda:1
1234
hf
pretrained=benchang1110/Qwen2.5-Taiwan-7B-Instruct
torch.bfloat16
7615616512
main
80a408cdd0f1062e395ec19b2692519770172e2f
1234
0
1234
AST_biology
miulab/tmlu
["A","B","C","D"]
answer
{{question.strip()}}
A. {{choices[0]}}
B. {{choices[1]}}
C. {{choices[2]}}
D. {{choices[3]}}{% if choices is defined and choices|length > 4 %}
E. {{choices[4]}}{% endif %}{% if choices is defined and choices|length > 5 %}
F. {{choices[5]}}{% endif %}
Answer:
first_n
dev
0.1
[{"metric":"acc","aggregation":"mean","higher_is_better":true}]
0
multiple_choice
def process_docs(dataset: datasets.Dataset) -> datasets.Dataset:
def _helper(doc):
# modifies the contents of a single
# document in our dataset.
answer_list = ["A", "B", "C", "D"]
choices = [doc["A"], doc["B"], doc["C"], doc["D"]]
if doc.get("E", None):
answer_list.append("E")
choices.append(doc["E"])
if doc.get("F", None):
answer_list.append("F")
choices.append(doc["F"])
out_doc = {
"questions": doc["question"],
"choices": choices,
"goal": answer_list.index(doc["answer"]),
}
return out_doc
return dataset.map(_helper) # returns back a datasets.Dataset object
1
false
tmlu_stem_tasks
tmlu_AST_biology
AST biology
test
false
AST_chemistry
miulab/tmlu
["A","B","C","D","E"]
answer
{{question.strip()}}
A. {{choices[0]}}
B. {{choices[1]}}
C. {{choices[2]}}
D. {{choices[3]}}{% if choices is defined and choices|length > 4 %}
E. {{choices[4]}}{% endif %}{% if choices is defined and choices|length > 5 %}
F. {{choices[5]}}{% endif %}
Answer:
first_n
dev
0.1
1-1
of 1