Evaluations
Filter
inputs
output
model_latency
Performance Metrics
Safety Metrics
Correctness
No Hallucination
Trace
Feedback
Status
model
self
mean
claude-3-haiku-20240307
gpt-4o-mini
Total Avg
Nr1_Retrieval
claude-3-haiku-20240307
3.1515
0.5556
0.2222
0.3889
0.6667
0.6667
4.8782
0.7143
0.1429
0.4286
0.7143
0.8571
1-11