Skip to main content
simplebench
Projects
simple_bench_public
Operations
Log in
Sign up
Overview
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All assets
Prompts
Ops
Models
Datasets
Scorers
Operations
Op
Calls
User
Last updated
Versions
extract_answer:v23
2 calls
Ivica Baricevic
6 days ago
24 versions
eval_multi_choice:v41
2 calls
Ivica Baricevic
6 days ago
42 versions
Evaluation.evaluate:v6
4 calls
Ivica Baricevic
6 days ago
7 versions
Evaluation.summarize:v3
3 calls
Ivica Baricevic
6 days ago
4 versions
LiteLLMModel.predict:v219
40 calls
Ivica Baricevic
6 days ago
99+ versions
Evaluation.predict_and_score:v3
40 calls
Ivica Baricevic
6 days ago
4 versions
openai.chat.completions.create:v11
10 calls
Abinash Yadav
1 month ago
12 versions
litellm.acompletion:v59
30 calls
Gareth Jones
1 month ago
60 versions
eval_multi_choice_confidence:v0
20 calls
Simon McCallum
3 months ago
1 version
MajorityVoteModel.predict:v1
3049 calls
pepe
6 months ago
2 versions
eval_majority_vote:v2
102 calls
pepe
6 months ago
3 versions
debug_scorer:v1
19 calls
Josh Wand
7 months ago
2 versions
langchain.Parser.PydanticToolsParser:v0
20 calls
7 months ago
1 version
langchain.Chain.ChannelWrite-answer_parser:v0
123 calls
7 months ago
1 version
langchain.Llm.ChatAnthropic:v0
20 calls
7 months ago
1 version
langchain.Chain.ChannelWrite-question_validator:v0
123 calls
7 months ago
1 version
anthropic.Messages.create:v0
20 calls
7 months ago
1 version
langchain.Chain.RunnableSequence:v0
20 calls
7 months ago
1 version