Skip to main content

Changma's workspace

summary
4
summary/agent_abilities
summary/avg_metrics_comparison
Gpt-4Current RunCodellama-34bVicuna-13b-16k00.20.40.6
Progress Rate (%)Success Rate (%)Average Metrics for All Tasks Compared to Baseline Models
Average All Progress Rate
0.3848
Average All Success Rate
0.1721
Metric Name
Metric Value (%)
7
8
summary/all_results
tool-operation
6
tool-query
6
List<File<(table)>>