Skip to main content
simplebench
Projects
simple_bench_public
Objects
Log in
Sign up
Overview
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All assets
Prompts
Ops
Models
Datasets
Scorers
All assets
Asset
Category
User
Last updated
Versions
extract_answer:v23
Ops
Ivica Baricevic
1 week ago
24 versions
Evaluation:v50
Evaluation
Ivica Baricevic
1 week ago
51 versions
eval_multi_choice:v41
Ops
Ivica Baricevic
1 week ago
42 versions
EvaluationResults:v8041
EvaluationResults
Ivica Baricevic
1 week ago
99+ versions
LiteLLMModel:v4843
Model
Ivica Baricevic
1 week ago
99+ versions
Evaluation.evaluate:v6
Ops
Ivica Baricevic
1 week ago
7 versions
Evaluation.summarize:v3
Ops
Ivica Baricevic
1 week ago
4 versions
LiteLLMModel.predict:v219
Ops
Ivica Baricevic
1 week ago
99+ versions
Evaluation.predict_and_score:v3
Ops
Ivica Baricevic
1 week ago
4 versions
Dataset:v24
Dataset
Ivica Baricevic
1 week ago
25 versions
openai.chat.completions.create:v11
Ops
Abinash Yadav
1 month ago
12 versions
litellm.acompletion:v59
Ops
Gareth Jones
1 month ago
60 versions
eval_multi_choice_confidence:v0
Ops
Simon McCallum
3 months ago
1 version
competition_dataset-evaluation:v36
Evaluation
Agata Mlynarczyk
5 months ago
37 versions
MajorityVoteModel:v113
Model
pepe
6 months ago
99+ versions
MajorityVoteModel.predict:v1
Ops
pepe
6 months ago
2 versions
eval_majority_vote:v2
Ops
pepe
6 months ago
3 versions
debug_scorer:v1
Ops
Josh Wand
7 months ago
2 versions