Skip to main content
a-sh0ts
Projects
google-abstract-summarization
Evaluations
Log in
Sign up
Overview
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Evaluations
Filter
inputs
output
bert_scorer
compression_scorer
coverage_scorer
gemini_scorer
model_latency
bert_score
compression_ratio
coverage_score
gemini_score
Trace
Feedback
Status
model
self
mean
mean
mean
mean
mean
eval-2025-04-10-daring-star
b418
GeminiPro2_5_Model:v0
Evaluation:v1
0
0.9524
0.1959
4.7
49.2754
eval-2025-04-10-optimistic-stream
5315
Gemini1_5ProModel:v0
Evaluation:v1
0
0.9441
0.2018
4.6
48.4765
eval-2025-04-10-eloquent-bird
a932
GeminiFlash2Model:v0
Evaluation:v0
0
0.9368
0.2052
4.6
53.816
eval-2025-04-10-proud-rain
c182
GeminiFlash_1_5_Model:v0
Evaluation:v0
0
0.8967
0.1915
4.5
47.4638