Skip to main content
wandb
Projects
jobs
Jobs
Log in
Sign up
Project
Models
Workspace
Runs
More
Weave
Traces
Evals
Playground
Monitors
Assets
More
Jobs
Documentation
Project jobs
1-2
of 2
Job
Versions
Runs
Creation date
Last run
Evaluate model checkpoint
Run a suite of off-the-shelf benchmarks against a model checkpoint.
7
607
Oct 15th 2025 at 12:07am
N/A
Evaluate API-hosted model
Run a suite of off-the-shelf benchmarks against an LLM API.
6
221
Oct 15th 2025 at 12:06am
N/A
Loading...