Skip to main content
simplebench
Projects
simple_bench_comp_dataset
Objects
Log in
Sign up
Project
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All assets
Prompts
Ops
Models
Datasets
Scorers
All assets
Asset
Category
User
Last updated
Versions
extract_answer:v0
Ops
Jonas Zabel
8 months ago
1 version
openai.chat.completions.create:v0
Ops
Jonas Zabel
8 months ago
1 version
LiteLLMModel:v8
Model
Jonas Zabel
8 months ago
9 versions
competition_dataset-evaluation:v0
Evaluation
Jonas Zabel
8 months ago
1 version
LiteLLMModel.predict:v2
Ops
Jonas Zabel
8 months ago
3 versions
eval_multi_choice:v0
Ops
Jonas Zabel
8 months ago
1 version
Evaluation.predict_and_score:v0
Ops
Jonas Zabel
8 months ago
1 version
Evaluation.summarize:v0
Ops
Jonas Zabel
8 months ago
1 version
Evaluation.evaluate:v0
Ops
Jonas Zabel
8 months ago
1 version
EvaluationResults:v12
EvaluationResults
Jonas Zabel
8 months ago
13 versions
simple_bench_public-evaluation:v0
Evaluation
Jonas Zabel
8 months ago
1 version
simple_bench_competition_hard_set-evaluation:v0
Evaluation
Jonas Zabel
8 months ago
1 version
competition_dataset:v0
Dataset
9 months ago
1 version
simple_bench_public:v0
Dataset
9 months ago
1 version
simple_bench_competition_hard_set:v0
Dataset
9 months ago
1 version