Skip to main content
c-metrics
Projects
rouge-scorer
Evaluation-definitions
Log in
Sign up
Project
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All Assets
Models
Datasets
Prompts
Scorers
Evaluations
Ops
Other
Evaluations
Evaluation
Category
Last updated
Versions
Evaluation:v3
Evaluation
11 months ago
4 versions
longbench_gov_report_subset-evaluation:v1
Evaluation
11 months ago
2 versions