Evaluations
Filter
inputs
output
model_latency
response_scorer
bleu
correctness
correct
extras
label
Trace
Feedback
Status
model
self
mean
mean
true_count
true_fraction
mean
85.6981
0.1116
4
0.5
0.5
27.882
0.0793
3
0.3333
0.3333
36.4483
0.0795
3
0.3333
0.3333
1-12 of 12
Per page:
50