Skip to main content

Changma's workspace

jericho/metrics_comparison
gpt-4text-davinci-003gpt-35-turbocodellama-34blemur-70bCurrent Runllama2-70bcodellama-13b00.20.40.60.81
Progress Rate (%)Success Rate (%)Grounding Accuracy (%)Jericho Metrics Compared to Baseline Models