Skip to main content
agentboard
Projects
llm-agent-eval-llama2-13b-all
Log in
Sign up
Project
Workspace
Runs
Automat.
Sweeps
Reports
Artifacts
Changma's workspace
Personal workspace
Automated workspace
Changes are only visible to you.
Runs
3
Name
1 visualized
vllm_meta-llama/Llama-2-13b-chat-hf
vllm_meta-llama/Llama-2-13b-chat-hf
vllm_meta-llama/Llama-2-13b-chat-hf
vllm_meta-llama/Llama-2-13b-chat-hf
vllm_meta-llama/Llama-2-13b-chat-hf
vllm_meta-llama/Llama-2-13b-chat-hf
1-3
of 3
jericho/task_reward_w.r.t_steps
0
5
10
15
20
25
30
0
10
20
30
40
50
Model Name, Is Baseline
Current Run, False
gpt-35-turbo, True
text-davinci-003, True
llama2-70b, True
gpt-4, True
lemur-70b, True
codellama-13b, True
codellama-34b, True
Average Progress Rate (%) w.r.t Steps for jericho Tasks
steps
score
plotly-logomark
Previous
Next