Skip to main content
nicolas-remerscheid
Projects
eval-llm-apps
Reports
LLM-rated Answer Correctness (avg) (24/07/18 17:41:02)
Log in
Sign up
Share
Comment
Star
LLM-rated Answer Correctness (avg) (24/07/18 17:41:02)
Nicolas Remerscheid
Created on July 18
|
Last edited on September 6
Comment
LLM-rated Answer Correctness (avg)
LLM-rated Answer Correctness (avg)
eager-spaceship-37
wise-salad-38
young-sweep-5
vital-sweep-20
dark-sweep-9
0
10
20
30
40
50
60
70
80
90
100
Run set
45
Add a comment
90