Evaluations
Filter
inputs
output
model_latency
response_scorer
bleu
correctness
correct
extras
label
Trace
Feedback
Status
model
self
mean
mean
true_count
true_fraction
mean
56.2023
0.1119
4
0.5
0.5
9.4036
0.0819
4
0.4444
0.4444
12.377
0.0948
4
0.4444
0.4444
1-12 of 12
Per page:
50