Skip to main content
llm-leaderboard
Projects
nejumi-leaderboard4
Traces
Log in
Sign up
Overview
Models
Workspace
Runs
More
Weave
Traces
Evals
Playground
Monitors
Assets
More
Traces
All Ops
Filter
Past 1w
inputs
output
Trace
Feedback
Status
input
model
self
store
tools
background
[ARC-AGI] openai/o3-mini-2025-01-31
b4e6
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[ARC-AGI] openai/o3-mini-2025-01-31
fe2d
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[Hallulens] openai/o3-mini-2025-01-31
d334
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[Hallulens] openai/o3-mini-2025-01-31
11d7
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[HLE] openai/o3-mini-2025-01-31
3f44
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[HLE] openai/o3-mini-2025-01-31
1a12
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[JTruthfulQA] openai/o3-mini-2025-01-31
7fd5
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[JTruthfulQA] openai/o3-mini-2025-01-31
6c8d
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[Toxicity] openai/o3-mini-2025-01-31
9423
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[Toxicity] openai/o3-mini-2025-01-31
63f1
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[JBBQ] openai/o3-mini-2025-01-31
24aa
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[JBBQ] openai/o3-mini-2025-01-31
1d01
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[MT-Bench] openai/o3-mini-2025-01-31
8cb2
N/A
N/A
N/A
N/A
N/A
N/A
N/A
[MT-Bench] openai/o3-mini-2025-01-31
4525
N/A
N/A
N/A
N/A
N/A
N/A
N/A