Skip to main content
byyoung3
Projects
aime_evaluation
Op-versions
Log in
Sign up
Project
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All Assets
Models
Datasets
Prompts
Scorers
Evaluations
Ops
Other
Operations
Op
Calls
User
Last updated
Versions
correctness:v0
9 calls
Brett Young
6 months ago
1 version
Scorer.summarize:v0
Brett Young
6 months ago
1 version
openai.chat.completions.create:v6
4 calls
Brett Young
6 months ago
7 versions
gpt4o_scorer:v38
40 calls
Brett Young
6 months ago
39 versions
qwen3_14b_openrouter_inference:v1
11 calls
Brett Young
6 months ago
2 versions
Qwen3_14B_OpenRouter_Model.predict:v9
2 calls
Brett Young
6 months ago
10 versions
Model.predict:v0
52 calls
Brett Young
6 months ago
1 version
Evaluation.summarize:v3
5 calls
Brett Young
6 months ago
4 versions
Evaluation.evaluate:v4
16 calls
Brett Young
6 months ago
5 versions
Evaluation.predict_and_score:v4
52 calls
Brett Young
6 months ago
5 versions
gpt4o_correctness:v0
10 calls
Brett Young
6 months ago
1 version
gpt4o_scorer_correctness:v0
32 calls
Brett Young
6 months ago
1 version
R1FreeModel.predict:v0
30 calls
Brett Young
6 months ago
1 version
R1DistillQwenModel.predict:v0
30 calls
Brett Young
6 months ago
1 version
google.generativeai.GenerativeModel.generate_content:v0
112 calls
Brett Young
7 months ago
1 version
Gemini20FlashModel.predict:v2
30 calls
Brett Young
7 months ago
3 versions
Gemini25ProExpModel.predict:v1
30 calls
Brett Young
7 months ago
2 versions
anthropic.Messages.stream:v0
270 calls
Brett Young
8 months ago
1 version