Evaluations
Filter
inputs
output
get_answer_correctness
answer_correctness
Trace
Feedback
Status
model
self
true_count
true_fraction
186
0.6327
249
0.8469
250
0.8503
1-42 of 42
Per page:
50
Charts
3
Score summary
3
General
Cost
$1.21
↗+ $1.19
Tokens
139K
↗+ 136.66K
Latency
1m58s
↗+ 1m55s
model_output
total_tokens.mean
9.21K
↗+ 9.21K
prompt_tokens.mean
8.49K
↗+ 8.49K
completion_tokens.mean
723.5
↗+ 723.5
time_taken.mean
87.31
↗+ 87.31
get_answer_correctness
answer_correctness.true_count
2
↗+ 2
answer_correctness.true_fraction
1
↗+ 1
model_latency
mean
93.76
↗+ 91.01