Skip to main content

Changma's workspace

summary/avg_metrics_comparison
Gpt-4Text-davinci-003Current RunCodellama-34bLemur-70b00.10.20.30.40.50.60.7
Progress Rate (%)Success Rate (%)Average Metrics for All Tasks Compared to Baseline Models