Evaluations
Filter
inputs
output
DoomerBoomerScorer
is_right_based_on_human_annotation
metrics
match
Trace
Feedback
Status
model
self
accuracy
cohens_kappa
sample_size
true_count
true_fraction
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
15
0.5556
N/A
N/A
N/A
15
0.5556
N/A
N/A
N/A
15
0.5556
1-12 of 12
Per page:
50