Skip to main content
wandb
Projects
jobs
Jobs
Log in
Sign up
Project
Models
Workspace
Runs
More
Weave
Traces
Evals
Playground
Monitors
Assets
More
Jobs
Documentation
Project jobs
1-2
of 2
Job
Versions
Runs
Creation date
Last run
Evaluate API-hosted model
Run a suite of off-the-shelf benchmarks against an LLM API.
1
34
Nov 26th 2025 at 9:07pm
N/A
Evaluate model checkpoint
Run a suite of off-the-shelf benchmarks against a model checkpoint.
1
8
Nov 26th 2025 at 12:32am
N/A
Loading...