Skip to main content

Announcing our newest W&B SDK performance enhancements

We've made wandb-core much more performant. Learn what's new and how you can take advantage of our latest improvements.
Created on July 10|Last edited on September 23
Updated Sep 23: The new wandb-core SDK is now generally available and enabled by default for all users. You can learn more about it here.
💡
We are excited to announce that a completely redesigned wandb-core is now public preview.
wandb-core is an advanced backend for the Weights & Biases SDK. The new wandb-core reduces memory and CPU usage, delivers faster operations—like startup, shutdown, and offline syncs—and provides more stable operations, especially under high resource contention.
Some of the most significant enhancements include:
  • Lower system resources footprint: Reduced memory and file descriptor usage, improving overall efficiency.
  • System robustness: Enhanced stability on systems with high resource contention.
  • Startup / shutdown performance: Faster startup and shutdown times, particularly for short experiments.
  • Parallel run performance: More efficient CPU usage and higher performance for parallel tasks.
  • Higher performance file logging: Faster and more efficient logging operations.
  • Network utilization of filestream: More efficient data management, reducing network data transfer.
  • Offline sync speeds: Accelerated offline sync speeds for long-running experiments.

Benchmarks

We wanted to establish quantifiable benefits of the redesigned wandb-core, so we executed a set of benchmarks focused on startup / shutdown performance, parallel run performance, and logging (for reference, the SDK supports online and offline modes). Offline logging allows our users to decouple running experiments from syncing the data to the backend server. We ran the benchmarks for each mode.

Startup/shutdown performance

When initiating an experiment there is a setup cost to make sure the experiment is reliably tracked. The delays in starting and stopping an experiment is SDK overhead which takes away from compute resources which could be used for model training. This overhead is mostly noticed with very short experiments (measured in seconds). We wanted to understand the reduction in overhead using the new SDK.

Parallel run performance

The SDK has the ability to track multiple experiments in parallel (like using python multiprocessing, for example). We wanted to understand how much more efficient parallel experiments are using the new SDK.

Logging tables

W&B Tables are an important datatype that allows detailed analysis in the wandb UI. We wanted to understand how much more efficient logging of tables is using the new SDK.
Here are the results from our benchmarking (you can also find the full details on GitHub):
MetricModeImprovement with wandb-core
Startup/Shutdown TimeOffline36% improvement
Startup/Shutdown TimeOnline23% improvement
Parallel Scalar Logging PerformanceOffline83% improvement
Parallel Scalar Logging PerformanceOnline88% improvement
Table Logging PerformanceOffline18% improvement
Table Logging PerformanceOnline40% improvement


Artifacts

Another part of the redesign was to help improve artifact upload/download performance based on feedback from customers. wandb-core is now optimized for faster data transfer, significantly reducing the time required for uploading and downloading artifacts.
For example, downloading a 10MB artifact now takes 2.81 seconds compared to 6.67 seconds with the previous backend. This means datasets, models, and other large artifacts are managed more efficiently, reducing delays and improving workflow. We consistently see 33% faster artifact upload times and 28% faster artifact download times.

Get started

wandb-core is now enabled by default for all users. Check out the notebook to start using it and reach out to the support team at support@wandb.com with any questions.
Iterate on AI agents and models faster. Try Weights & Biases today.