Skip to main content
simplebench
Projects
simple_bench_public
Object-versions
Log in
Sign up
Project
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All assets
Prompts
Ops
Models
Datasets
Scorers
All assets
Asset
Category
User
Last updated
Versions
EvaluationResults:v8046
EvaluationResults
Anthony Young
1 week ago
99+ versions
LiteLLMModel:v4847
Model
Anthony Young
1 week ago
99+ versions
LiteLLMModel.predict:v223
Ops
Anthony Young
1 week ago
99+ versions
extract_answer:v24
Ops
Anthony Young
1 week ago
25 versions
openai.chat.completions.create:v12
Ops
Anthony Young
1 week ago
13 versions
Evaluation:v51
Evaluation
Anthony Young
1 week ago
52 versions
Evaluation.predict_and_score:v3
Ops
Anthony Young
1 week ago
4 versions
Evaluation.evaluate:v6
Ops
Anthony Young
1 week ago
7 versions
Evaluation.summarize:v3
Ops
Anthony Young
1 week ago
4 versions
eval_multi_choice:v42
Ops
Anthony Young
1 week ago
43 versions
Dataset:v25
Dataset
Anthony Young
1 week ago
26 versions
litellm.acompletion:v59
Ops
Gareth Jones
2 months ago
60 versions
eval_multi_choice_confidence:v0
Ops
Simon McCallum
4 months ago
1 version
competition_dataset-evaluation:v36
Evaluation
Agata Mlynarczyk
6 months ago
37 versions
MajorityVoteModel:v113
Model
pepe
7 months ago
99+ versions
MajorityVoteModel.predict:v1
Ops
pepe
7 months ago
2 versions
eval_majority_vote:v2
Ops
pepe
7 months ago
3 versions
debug_scorer:v1
Ops
Josh Wand
8 months ago
2 versions