Skip to main content
c-metrics
Projects
coherence_scorer
Evaluations
Log in
Sign up
Overview
Models
Workspace
Runs
More
Weave
Traces
Evals
Playground
Monitors
Assets
More
Evaluations
Filter
inputs
output
accuracy
F1Score
Trace
Feedback
Status
model
self
true_count
true_fraction
f1
precision
recall
Dataset Baseline
b2f8
DatasetCoherenceScorerModel:v0
Evaluation:v1
250
1
1
1
1
OpenAI Coherence Scorer
b39e
OpenAICoherenceScorerModel:v0
Evaluation:v1
184
0.736
0.7626
0.6928
0.848
Wandb Coherence Scorer
6d70
WandbCoherenceScorerModel:v0
Evaluation:v1
169
0.676
0.7492
0.6111
0.968
Dataset Baseline
c6c0
DatasetCoherenceScorerModel:v0
Evaluation:v0
120
1
1
1
1
OpenAI Coherence Scorer
d477
OpenAICoherenceScorerModel:v0
Evaluation:v0
103
0.8583
0.9179
0.9406
0.8962
Wandb Coherence Scorer
99f6
WandbCoherenceScorerModel:v0
Evaluation:v0
110
0.9167
0.9545
0.9211
0.9906