Skip to main content
simplebench
Projects
simple_bench_comp_dataset
Object-versions
Log in
Sign up
Project
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All assets
Prompts
Ops
Models
Datasets
Scorers
All assets
Asset
Category
User
Last updated
Versions
extract_answer:v0
Ops
Jonas Zabel
9 months ago
1 version
openai.chat.completions.create:v0
Ops
Jonas Zabel
9 months ago
1 version
LiteLLMModel:v8
Model
Jonas Zabel
9 months ago
9 versions
competition_dataset-evaluation:v0
Evaluation
Jonas Zabel
9 months ago
1 version
LiteLLMModel.predict:v2
Ops
Jonas Zabel
9 months ago
3 versions
eval_multi_choice:v0
Ops
Jonas Zabel
9 months ago
1 version
Evaluation.predict_and_score:v0
Ops
Jonas Zabel
9 months ago
1 version
Evaluation.summarize:v0
Ops
Jonas Zabel
9 months ago
1 version
Evaluation.evaluate:v0
Ops
Jonas Zabel
9 months ago
1 version
EvaluationResults:v12
EvaluationResults
Jonas Zabel
9 months ago
13 versions
simple_bench_public-evaluation:v0
Evaluation
Jonas Zabel
9 months ago
1 version
simple_bench_competition_hard_set-evaluation:v0
Evaluation
Jonas Zabel
9 months ago
1 version
competition_dataset:v0
Dataset
9 months ago
1 version
simple_bench_public:v0
Dataset
9 months ago
1 version
simple_bench_competition_hard_set:v0
Dataset
9 months ago
1 version