Skip to main content
a-sh0ts
Projects
eval_course_ch1_dev
Evaluations
Log in
Sign up
Overview
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Evaluations
Filter
inputs
output
judge_adheres_to_privacy_guidelines
judge_overall_score
contains_pii
Trace
Feedback
Status
model
self
true_count
true_fraction
true_count
true_fraction
mean
Evaluation.evaluate
04a2
annotated_data_passthrough:v1
Evaluation:v8
N/A
N/A
4
0.8
0.6
Evaluation.evaluate
09af
annotated_data_passthrough:v1
Evaluation:v7
N/A
N/A
N/A
N/A
N/A
Evaluation.evaluate
35b5
annotated_data_passthrough:v1
Evaluation:v6
N/A
N/A
N/A
N/A
N/A
Evaluation.evaluate
dd53
annotated_data_passthrough:v1
Evaluation:v5
N/A
N/A
1
0.2
0
Evaluation.evaluate
1069
annotated_data_passthrough:v1
Evaluation:v4
N/A
N/A
4
0.8
0
Evaluation.evaluate
2c9b
annotated_data_passthrough:v1
Evaluation:v3
5
1
N/A
N/A
N/A
Evaluation.evaluate
3609
annotated_data_passthrough:v1
Evaluation:v2
N/A
N/A
N/A
N/A
N/A
Evaluation.evaluate
4a76
annotated_data_passthrough:v1
Evaluation:v1
N/A
N/A
N/A
N/A
N/A
Evaluation.evaluate
3d8b
annotated_data_passthrough:v1
Evaluation:v0
N/A
N/A
N/A
N/A
N/A
Evaluation.evaluate
59d4
annotated_data_passthrough:v0
Evaluation:v0
N/A
N/A
N/A
N/A
N/A