Skip to main content
nicolas-remerscheid
Projects
eval-llm-apps
Reports
LLM-rated Answer Correctness (avg) (24/07/17 12:32:55)
Log in
Sign up
Share
Comment
Star
LLM-rated Answer Correctness (avg) (24/07/17 12:32:55)
Nicolas Remerscheid
Created on July 17
|
Last edited on July 17
Comment
LLM-rated Answer Correctness (avg)
LLM-rated Answer Correctness (avg)
eager-spaceship-37
wise-salad-38
young-sweep-5
vital-sweep-20
dark-sweep-9
0
10
20
30
40
50
60
70
80
90
100
Run set
45
Add a comment
90