Skip to main content
wandb-applied-ai-team
Projects
mcp-tests
Evaluations
Log in
Sign up
Overview
Models
Workspace
Runs
More
Weave
Traces
Evals
Playground
Monitors
Assets
More
Evaluations
Filter
inputs
output
model_latency
model_output
api_call_statuses
chat_error_info
chat_success
has_error
Trace
Feedback
Status
model
self
mean
true_count
true_fraction
true_count
true_fraction
wandbot_gpt-4o-2024-11-20
a2c8
WandbotModel:v5
wandbot-eval:v2
94.1214
0
0
98
1
wandbot_less_query_enhancement
e9fc
WandbotModel:v3
wandbot-eval:v1
37.8922
1
0.0102
97
0.9898
dummy-evaluation
f4fc
my_ai_model:v0
dummy-evaluation:v0
0.0147
N/A
N/A
N/A
N/A
wandbot-eval
3296
WandbotModel:v1
wandbot-eval:v0
372.1061
4
0.0204
192
0.9796