Skip to main content
Reports
Created by
Created On
Last edited
Top 3 Evaluations Analysis Report
Comprehensive analysis of the top-performing evaluations in mcp-tests project, comparing correctness scores, costs, and performance metrics
0
2025-05-22
Model Evaluation Analysis
Visual analysis of recent model evaluation results showing performance metrics and trends.
0
2025-03-27
Wandbot Evaluation Analysis Report
Analysis of recent Wandbot evaluations with different model configurations, including GPT-4o vs GPT-4-0125
0
2025-03-21
Weave Traces Analysis
Analysis of the last 10 Weave traces in the mcp-tests project
0
2025-03-21
Weave Traces Analysis Report
Detailed analysis of Weave traces from the mcp-tests project, focusing on operation distribution and token usage.
0
2025-03-21
Recent Weave Traces Analysis
Analysis of the most recent Weave traces in the mcp-tests project
0
2025-03-21
OpenAI API Call Activity Analysis
Analysis of OpenAI API call patterns in the mcp-tests project
0
2025-03-21
OpenAI Chat Request Vowel Analysis
Analysis of vowel counts in OpenAI chat requests over time
0
2025-03-17
OpenAI Chat Traces Analysis
Analysis of recent OpenAI chat traces in the MCP tests project
0
2025-03-16
OpenAI Model Usage Analysis
Analysis of OpenAI model usage over time in the wandb-applied-ai-team/mcp-tests Weave project
0
2025-03-15