Skip to main content
c-metrics
Projects
bleu-scorer
Evaluations
Log in
Sign up
Project
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Evaluations
Compare
Select an eval
Select a dataset
Filter
Visualize
Columns
inputs
output
BLEUScorer
model_latency
corpus_level
sentence_level
Trace
Feedback
Status
model
self
bleu
brevity_penalty
bleu
mean
User
Called
Tokens
TruthfulQA-BLEU
3ea4
SimpleModel:v1
Evaluation:v1
6.5673
1
49.0077
0.0539
11 months ago
0