This is the multi-page printable view of this section.
Click here to print.
Return to the regular view of this page.
Python SDK
The W&B Python SDK, accessible at wandb
, enables you to train and fine-tune models, and manage models from experimentation to production.
After performing your training and fine-tuning operations with this SDK, you can use the Public API to query and analyze the data that was logged, and the Reports and Workspaces API to generate a web-publishable report summarizing your work.
Installation and setup
Sign up and create an API key
To authenticate your machine with W&B, you must first generate an API key at https://wandb.ai/authorize.
Install and import packages
Install the W&B library.
pip install wandb
Import W&B Python SDK:
import wandb
# Specify your team entity
entity = "<team_entity>"
# Project that the run is recorded to
project = "my-awesome-project"
with wandb.init(entity=entity, project=project) as run:
run.log({"accuracy": .90, "loss": .10})
The wandb
package supports optional extras for specific features. To install these, use square brackets with the package name.
In zsh (default on macOS), you need to escape the brackets or use quotes, e.g. pip install "wandb[media]"
Available extras:
wandb[media]
- Media logging support (installs bokeh, moviepy, pillow, plotly)
wandb[workspaces]
- Workspaces functionality (installs wandb-workspaces)
wandb[sweeps]
- Hyperparameter sweeps support
wandb[launch]
- W&B Launch support
wandb[models]
- Model management features
wandb[aws]
- AWS integrations
wandb[azure]
- Azure integrations
wandb[gcp]
- Google Cloud Platform integrations
wandb[kubeflow]
- Kubeflow integrations
wandb[importers]
- Data importers
wandb[perf]
- Performance monitoring tools
Installation examples:
# Using quotes (works in all shells)
pip install "wandb[media]"
pip install "wandb[workspaces,media]"
# Escaping brackets (for zsh/bash)
pip install wandb\[media\]
# Install all common extras
pip install "wandb[media,workspaces,sweeps,launch]"
For detailed documentation on each extra, see the Optional Extras Reference.
1 - Global Functions
The W&B Python SDK provides a set of global functions that serve as the primary entry points for interacting with the platform. These functions are called directly on the wandb
module and form the foundation of most W&B workflows.
Overview
Global functions in W&B are top-level functions that you call directly, such as wandb.init()
or wandb.login()
. Unlike methods that belong to specific classes, these functions provide direct access to W&B’s core functionality without needing to instantiate objects first.
Available Functions
Function |
Description |
init() |
Start a new run to track and log to W&B. This is typically the first function you’ll call in your ML training pipeline. |
login() |
Set up W&B login credentials to authenticate your machine with the platform. |
finish() |
Complete a run and upload any remaining data to ensure all information is synced to the server. |
setup() |
Prepare W&B for use in the current process and its children. Useful for multi-process applications. |
teardown() |
Clean up W&B resources and shut down the backend process. |
sweep() |
Initialize a hyperparameter sweep to search for optimal model configurations. |
agent() |
Create a sweep agent to run hyperparameter optimization experiments. |
controller() |
Manage and control sweep agents and their execution. |
restore() |
Restore a previous run or experiment state for resuming work. |
Getting Started
The most common workflow begins with authenticating with W&B, initializing a run, and logging values (such as accuracy and loss) from your training loop:
import wandb
# Authenticate with W&B
wandb.login()
config = {
"learning_rate": 0.01,
"epochs": 10,
}
# Project where logs
project = "my-awesome-project"
# Start a new run
with wandb.init(project=project, config=config) as run:
# Your training code here...
# Log values to W&B
run.log({"accuracy": acc, "loss": loss})
## Key Concepts
- **Runs**: The fundamental unit of tracking in W&B, representing a single execution of your code
- **Authentication**: Required to sync data with the W&B platform
- **Configuration**: Store hyperparameters and metadata for your experiments
- **Sweeps**: Automated hyperparameter optimization across multiple runs
For detailed information about each function, click on the function names above to view their complete documentation, including parameters, examples, and usage patterns.
1.1 - agent()
function agent
agent(
sweep_id: str,
function: Optional[Callable] = None,
entity: Optional[str] = None,
project: Optional[str] = None,
count: Optional[int] = None
) → None
Start one or more sweep agents.
The sweep agent uses the sweep_id
to know which sweep it is a part of, what function to execute, and (optionally) how many agents to run.
Args:
sweep_id
: The unique identifier for a sweep. A sweep ID is generated by W&B CLI or Python SDK.
function
: A function to call instead of the “program” specified in the sweep config.
entity
: The username or team name where you want to send W&B runs created by the sweep to. Ensure that the entity you specify already exists. If you don’t specify an entity, the run will be sent to your default entity, which is usually your username.
project
: The name of the project where W&B runs created from the sweep are sent to. If the project is not specified, the run is sent to a project labeled “Uncategorized”.
count
: The number of sweep config trials to try.
1.2 - controller()
function controller
controller(
sweep_id_or_config: Optional[str, Dict] = None,
entity: Optional[str] = None,
project: Optional[str] = None
) → _WandbController
Public sweep controller constructor.
Examples:
import wandb
tuner = wandb.controller(...)
print(tuner.sweep_config)
print(tuner.sweep_id)
tuner.configure_search(...)
tuner.configure_stopping(...)
1.3 - finish()
function finish
finish(exit_code: 'int | None' = None, quiet: 'bool | None' = None) → None
Finish a run and upload any remaining data.
Marks the completion of a W&B run and ensures all data is synced to the server. The run’s final state is determined by its exit conditions and sync status.
Run States:
- Running: Active run that is logging data and/or sending heartbeats.
- Crashed: Run that stopped sending heartbeats unexpectedly.
- Finished: Run completed successfully (
exit_code=0
) with all data synced.
- Failed: Run completed with errors (
exit_code!=0
).
Args:
exit_code
: Integer indicating the run’s exit status. Use 0 for success, any other value marks the run as failed.
quiet
: Deprecated. Configure logging verbosity using wandb.Settings(quiet=...)
.
1.4 - init()
function init
init(
entity: 'str | None' = None,
project: 'str | None' = None,
dir: 'StrPath | None' = None,
id: 'str | None' = None,
name: 'str | None' = None,
notes: 'str | None' = None,
tags: 'Sequence[str] | None' = None,
config: 'dict[str, Any] | str | None' = None,
config_exclude_keys: 'list[str] | None' = None,
config_include_keys: 'list[str] | None' = None,
allow_val_change: 'bool | None' = None,
group: 'str | None' = None,
job_type: 'str | None' = None,
mode: "Literal['online', 'offline', 'disabled', 'shared'] | None" = None,
force: 'bool | None' = None,
anonymous: "Literal['never', 'allow', 'must'] | None" = None,
reinit: "bool | Literal[None, 'default', 'return_previous', 'finish_previous', 'create_new']" = None,
resume: "bool | Literal['allow', 'never', 'must', 'auto'] | None" = None,
resume_from: 'str | None' = None,
fork_from: 'str | None' = None,
save_code: 'bool | None' = None,
tensorboard: 'bool | None' = None,
sync_tensorboard: 'bool | None' = None,
monitor_gym: 'bool | None' = None,
settings: 'Settings | dict[str, Any] | None' = None
) → Run
Start a new run to track and log to W&B.
In an ML training pipeline, you could add wandb.init()
to the beginning of your training script as well as your evaluation script, and each piece would be tracked as a run in W&B.
wandb.init()
spawns a new background process to log data to a run, and it also syncs data to https://wandb.ai by default, so you can see your results in real-time. When you’re done logging data, call wandb.Run.finish()
to end the run. If you don’t call run.finish()
, the run will end when your script exits.
Run IDs must not contain any of the following special characters / \ # ? % :
Args:
entity
: The username or team name the runs are logged to. The entity must already exist, so ensure you create your account or team in the UI before starting to log runs. If not specified, the run will default your default entity. To change the default entity, go to your settings and update the “Default location to create new projects” under “Default team”.
project
: The name of the project under which this run will be logged. If not specified, we use a heuristic to infer the project name based on the system, such as checking the git root or the current program file. If we can’t infer the project name, the project will default to "uncategorized"
.
dir
: The absolute path to the directory where experiment logs and metadata files are stored. If not specified, this defaults to the ./wandb
directory. Note that this does not affect the location where artifacts are stored when calling download()
.
id
: A unique identifier for this run, used for resuming. It must be unique within the project and cannot be reused once a run is deleted. For a short descriptive name, use the name
field, or for saving hyperparameters to compare across runs, use config
.
name
: A short display name for this run, which appears in the UI to help you identify it. By default, we generate a random two-word name allowing easy cross-reference runs from table to charts. Keeping these run names brief enhances readability in chart legends and tables. For saving hyperparameters, we recommend using the config
field.
notes
: A detailed description of the run, similar to a commit message in Git. Use this argument to capture any context or details that may help you recall the purpose or setup of this run in the future.
tags
: A list of tags to label this run in the UI. Tags are helpful for organizing runs or adding temporary identifiers like “baseline” or “production.” You can easily add, remove tags, or filter by tags in the UI. If resuming a run, the tags provided here will replace any existing tags. To add tags to a resumed run without overwriting the current tags, use run.tags += ("new_tag",)
after calling run = wandb.init()
.
config
: Sets wandb.config
, a dictionary-like object for storing input parameters to your run, such as model hyperparameters or data preprocessing settings. The config appears in the UI in an overview page, allowing you to group, filter, and sort runs based on these parameters. Keys should not contain periods (.
), and values should be smaller than 10 MB. If a dictionary, argparse.Namespace
, or absl.flags.FLAGS
is provided, the key-value pairs will be loaded directly into wandb.config
. If a string is provided, it is interpreted as a path to a YAML file, from which configuration values will be loaded into wandb.config
.
config_exclude_keys
: A list of specific keys to exclude from wandb.config
.
config_include_keys
: A list of specific keys to include in wandb.config
.
allow_val_change
: Controls whether config values can be modified after their initial set. By default, an exception is raised if a config value is overwritten. For tracking variables that change during training, such as a learning rate, consider using wandb.log()
instead. By default, this is False
in scripts and True
in Notebook environments.
group
: Specify a group name to organize individual runs as part of a larger experiment. This is useful for cases like cross-validation or running multiple jobs that train and evaluate a model on different test sets. Grouping allows you to manage related runs collectively in the UI, making it easy to toggle and review results as a unified experiment.
job_type
: Specify the type of run, especially helpful when organizing runs within a group as part of a larger experiment. For example, in a group, you might label runs with job types such as “train” and “eval”. Defining job types enables you to easily filter and group similar runs in the UI, facilitating direct comparisons.
mode
: Specifies how run data is managed, with the following options:
"online"
(default): Enables live syncing with W&B when a network connection is available, with real-time updates to visualizations.
"offline"
: Suitable for air-gapped or offline environments; data is saved locally and can be synced later. Ensure the run folder is preserved to enable future syncing.
"disabled"
: Disables all W&B functionality, making the run’s methods no-ops. Typically used in testing to bypass W&B operations.
"shared"
: (This is an experimental feature). Allows multiple processes, possibly on different machines, to simultaneously log to the same run. In this approach you use a primary node and one or more worker nodes to log data to the same run. Within the primary node you initialize a run. For each worker node, initialize a run using the run ID used by the primary node.
force
: Determines if a W&B login is required to run the script. If True
, the user must be logged in to W&B; otherwise, the script will not proceed. If False
(default), the script can proceed without a login, switching to offline mode if the user is not logged in.
anonymous
: Specifies the level of control over anonymous data logging. Available options are:
"never"
(default): Requires you to link your W&B account before tracking the run. This prevents unintentional creation of anonymous runs by ensuring each run is associated with an account.
"allow"
: Enables a logged-in user to track runs with their account, but also allows someone running the script without a W&B account to view the charts and data in the UI.
"must"
: Forces the run to be logged to an anonymous account, even if the user is logged in.
reinit
: Shorthand for the “reinit” setting. Determines the behavior of wandb.init()
when a run is active.
resume
: Controls the behavior when resuming a run with the specified id
. Available options are:
"allow"
: If a run with the specified id
exists, it will resume from the last step; otherwise, a new run will be created.
"never"
: If a run with the specified id
exists, an error will be raised. If no such run is found, a new run will be created.
"must"
: If a run with the specified id
exists, it will resume from the last step. If no run is found, an error will be raised.
"auto"
: Automatically resumes the previous run if it crashed on this machine; otherwise, starts a new run.
True
: Deprecated. Use "auto"
instead.
False
: Deprecated. Use the default behavior (leaving resume
unset) to always start a new run. If resume
is set, fork_from
and resume_from
cannot be used. When resume
is unset, the system will always start a new run.
resume_from
: Specifies a moment in a previous run to resume a run from, using the format {run_id}?_step={step}
. This allows users to truncate the history logged to a run at an intermediate step and resume logging from that step. The target run must be in the same project. If an id
argument is also provided, the resume_from
argument will take precedence. resume
, resume_from
and fork_from
cannot be used together, only one of them can be used at a time. Note that this feature is in beta and may change in the future.
fork_from
: Specifies a point in a previous run from which to fork a new run, using the format {id}?_step={step}
. This creates a new run that resumes logging from the specified step in the target run’s history. The target run must be part of the current project. If an id
argument is also provided, it must be different from the fork_from
argument, an error will be raised if they are the same. resume
, resume_from
and fork_from
cannot be used together, only one of them can be used at a time. Note that this feature is in beta and may change in the future.
save_code
: Enables saving the main script or notebook to W&B, aiding in experiment reproducibility and allowing code comparisons across runs in the UI. By default, this is disabled, but you can change the default to enable on your settings page.
tensorboard
: Deprecated. Use sync_tensorboard
instead.
sync_tensorboard
: Enables automatic syncing of W&B logs from TensorBoard or TensorBoardX, saving relevant event files for viewing in the W&B UI.
saving relevant event files for viewing in the W&B UI. (Default
: False
)
monitor_gym
: Enables automatic logging of videos of the environment when using OpenAI Gym.
settings
: Specifies a dictionary or wandb.Settings
object with advanced settings for the run.
Returns:
A Run
object.
Raises:
Error
: If some unknown or internal error happened during the run initialization.
AuthenticationError
: If the user failed to provide valid credentials.
CommError
: If there was a problem communicating with the WandB server.
UsageError
: If the user provided invalid arguments.
KeyboardInterrupt
: If user interrupts the run.
Examples:
wandb.init()
returns a Run
object. Use the run object to log data, save artifacts, and manage the run lifecycle.
import wandb
config = {"lr": 0.01, "batch_size": 32}
with wandb.init(config=config) as run:
# Log accuracy and loss to the run
acc = 0.95 # Example accuracy
loss = 0.05 # Example loss
run.log({"accuracy": acc, "loss": loss})
1.5 - login()
function login
login(
anonymous: Optional[Literal['must', 'allow', 'never']] = None,
key: Optional[str] = None,
relogin: Optional[bool] = None,
host: Optional[str] = None,
force: Optional[bool] = None,
timeout: Optional[int] = None,
verify: bool = False,
referrer: Optional[str] = None
) → bool
Set up W&B login credentials.
By default, this will only store credentials locally without verifying them with the W&B server. To verify credentials, pass verify=True
.
Args:
anonymous
: Set to “must”, “allow”, or “never”. If set to “must”, always log a user in anonymously. If set to “allow”, only create an anonymous user if the user isn’t already logged in. If set to “never”, never log a user anonymously. Default set to “never”. Defaults to None
.
key
: The API key to use.
relogin
: If true, will re-prompt for API key.
host
: The host to connect to.
force
: If true, will force a relogin.
timeout
: Number of seconds to wait for user input.
verify
: Verify the credentials with the W&B server.
referrer
: The referrer to use in the URL login request.
Returns:
bool
: If key
is configured.
Raises:
AuthenticationError
: If api_key
fails verification with the server.
UsageError
: If api_key
cannot be configured and no tty.
1.6 - restore()
function restore
restore(
name: 'str',
run_path: 'str | None' = None,
replace: 'bool' = False,
root: 'str | None' = None
) → None | TextIO
Download the specified file from cloud storage.
File is placed into the current directory or run directory. By default, will only download the file if it doesn’t already exist.
Args:
name
: The name of the file.
run_path
: Optional path to a run to pull files from, i.e. username/project_name/run_id
if wandb.init has not been called, this is required.
replace
: Whether to download the file even if it already exists locally
root
: The directory to download the file to. Defaults to the current directory or the run directory if wandb.init was called.
Returns:
None if it can’t find the file, otherwise a file object open for reading.
Raises:
CommError
: If W&B can’t connect to the W&B backend.
ValueError
: If the file is not found or can’t find run_path.
1.7 - setup()
function setup
setup(settings: 'Settings | None' = None) → _WandbSetup
Prepares W&B for use in the current process and its children.
You can usually ignore this as it is implicitly called by wandb.init()
.
When using wandb in multiple processes, calling wandb.setup()
in the parent process before starting child processes may improve performance and resource utilization.
Note that wandb.setup()
modifies os.environ
, and it is important that child processes inherit the modified environment variables.
See also wandb.teardown()
.
Args:
settings
: Configuration settings to apply globally. These can be overridden by subsequent wandb.init()
calls.
Example:
import multiprocessing
import wandb
def run_experiment(params):
with wandb.init(config=params):
# Run experiment
pass
if __name__ == "__main__":
# Start backend and set global config
wandb.setup(settings={"project": "my_project"})
# Define experiment parameters
experiment_params = [
{"learning_rate": 0.01, "epochs": 10},
{"learning_rate": 0.001, "epochs": 20},
]
# Start multiple processes, each running a separate experiment
processes = []
for params in experiment_params:
p = multiprocessing.Process(target=run_experiment, args=(params,))
p.start()
processes.append(p)
# Wait for all processes to complete
for p in processes:
p.join()
# Optional: Explicitly shut down the backend
wandb.teardown()
1.8 - sweep()
function sweep
sweep(
sweep: Union[dict, Callable],
entity: Optional[str] = None,
project: Optional[str] = None,
prior_runs: Optional[List[str]] = None
) → str
Initialize a hyperparameter sweep.
Search for hyperparameters that optimizes a cost function of a machine learning model by testing various combinations.
Make note the unique identifier, sweep_id
, that is returned. At a later step provide the sweep_id
to a sweep agent.
See Sweep configuration structure for information on how to define your sweep.
Args:
sweep
: The configuration of a hyperparameter search. (or configuration generator). If you provide a callable, ensure that the callable does not take arguments and that it returns a dictionary that conforms to the W&B sweep config spec.
entity
: The username or team name where you want to send W&B runs created by the sweep to. Ensure that the entity you specify already exists. If you don’t specify an entity, the run will be sent to your default entity, which is usually your username.
project
: The name of the project where W&B runs created from the sweep are sent to. If the project is not specified, the run is sent to a project labeled ‘Uncategorized’.
prior_runs
: The run IDs of existing runs to add to this sweep.
Returns:
str
: A unique identifier for the sweep.
1.9 - teardown()
function teardown
teardown(exit_code: 'int | None' = None) → None
Waits for W&B to finish and frees resources.
Completes any runs that were not explicitly finished using run.finish()
and waits for all data to be uploaded.
It is recommended to call this at the end of a session that used wandb.setup()
. It is invoked automatically in an atexit
hook, but this is not reliable in certain setups such as when using Python’s multiprocessing
module.
2 - Data Types
The W&B Python SDK includes data types for logging various forms of media and structured data.
Overview
Data Types in W&B are classes that wrap media and structured data for logging to runs. They include visualization components in the W&B UI and handle data serialization, storage, and retrieval.
Available Data Types
Data Type |
Description |
Image |
Log images with support for masks, bounding boxes, and segmentation. |
Video |
Track video data for model outputs or dataset samples. |
Audio |
Log audio samples for audio processing tasks. |
Table |
Create tables that can contain mixed media types. |
Plotly |
Log Plotly charts for data visualization. |
Html |
Embed custom HTML content. |
Object3D |
Visualize 3D point clouds and meshes. |
Molecule |
Log molecular structures for computational chemistry. |
Box3D |
Track 3D bounding boxes for 3D object detection. |
Getting Started
Using Image
Data Type to log an image:
import wandb
import matplotlib.pyplot as plt
# Generate an image for demonstration purposes
path_to_img = "/path/to/cafe.png"
im = plt.imread(path_to_img)
# Initialize a new run
with wandb.init(project="awesome-project") as run:
# Log the image
run.log({"img": [wandb.Image(im, caption="Cafe")]})
Using a Table
Data Type to log a table with mixed text and labels:
import wandb
# Initialize a new run
with wandb.init(project="visualize-predictions", name="tables") as run:
# Create tabular data, using a list of lists
data = [["Cat", "1", "1"],["Dog", "0", "-1"]]
run.log({"Table 1": wandb.Table(data=data, columns=["Text", "Predicted Label", "True Label"])})
# Create tabular data, using `wandb.Table.add_data()` method
table = wandb.Table(columns=["Text", "Predicted Label", "True Label"])
table.add_data("Cat", "1", "1")
table.add_data("Dog", "0", "-1")
run.log({"Table 2": table})
2.1 - Audio
class Audio
W&B class for audio clips.
method Audio.__init__
__init__(
data_or_path: Union[str, pathlib.Path, list, ForwardRef('np.ndarray')],
sample_rate: Optional[int] = None,
caption: Optional[str] = None
)
Accept a path to an audio file or a numpy array of audio data.
Args:
data_or_path
: A path to an audio file or a NumPy array of audio data.
sample_rate
: Sample rate, required when passing in raw NumPy array of audio data.
caption
: Caption to display with audio.
classmethod Audio.durations
Calculate the duration of the audio files.
classmethod Audio.sample_rates
Get sample rates of the audio files.
2.2 - box3d()
function box3d
box3d(
center: 'npt.ArrayLike',
size: 'npt.ArrayLike',
orientation: 'npt.ArrayLike',
color: 'RGBColor',
label: 'Optional[str]' = None,
score: 'Optional[numeric]' = None
) → Box3D
Returns a Box3D.
Args:
center
: The center point of the box as a length-3 ndarray.
size
: The box’s X, Y and Z dimensions as a length-3 ndarray.
orientation
: The rotation transforming global XYZ coordinates into the box’s local XYZ coordinates, given as a length-4 ndarray [r, x, y, z] corresponding to the non-zero quaternion r + xi + yj + zk.
color
: The box’s color as an (r, g, b) tuple with 0 <= r,g,b <= 1.
label
: An optional label for the box.
score
: An optional score for the box.
2.3 - Html
class Html
W&B class for logging HTML content to W&B.
method Html.__init__
__init__(
data: Union[str, pathlib.Path, ForwardRef('TextIO')],
inject: bool = True,
data_is_not_path: bool = False
) → None
Creates a W&B HTML object.
Args:
data: A string that is a path to a file with the extension “.html”, or a string or IO object containing literal HTML.
inject
: Add a stylesheet to the HTML object. If set to False the HTML will pass through unchanged.
data_is_not_path
: If set to False, the data will be treated as a path to a file.
Examples:
It can be initialized by providing a path to a file:
with wandb.init() as run:
run.log({"html": wandb.Html("./index.html")})
Alternatively, it can be initialized by providing literal HTML, in either a string or IO object:
with wandb.init() as run:
run.log({"html": wandb.Html("<h1>Hello, world!</h1>")})
2.4 - Image
class Image
A class for logging images to W&B.
method Image.__init__
__init__(
data_or_path: 'ImageDataOrPathType',
mode: Optional[str] = None,
caption: Optional[str] = None,
grouping: Optional[int] = None,
classes: Optional[ForwardRef('Classes'), Sequence[dict]] = None,
boxes: Optional[Dict[str, ForwardRef('BoundingBoxes2D')], Dict[str, dict]] = None,
masks: Optional[Dict[str, ForwardRef('ImageMask')], Dict[str, dict]] = None,
file_type: Optional[str] = None,
normalize: bool = True
) → None
Initialize a wandb.Image
object.
Args:
data_or_path
: Accepts NumPy array/pytorch tensor of image data, a PIL image object, or a path to an image file. If a NumPy array or pytorch tensor is provided, the image data will be saved to the given file type. If the values are not in the range [0, 255] or all values are in the range [0, 1], the image pixel values will be normalized to the range [0, 255] unless normalize
is set to False.
- pytorch tensor should be in the format (channel, height, width)
- NumPy array should be in the format (height, width, channel)
mode
: The PIL mode for an image. Most common are “L”, “RGB”,
"RGBA". Full explanation at https
: //pillow.readthedocs.io/en/stable/handbook/concepts.html#modes
caption
: Label for display of image.
grouping
: The grouping number for the image.
classes
: A list of class information for the image, used for labeling bounding boxes, and image masks.
boxes
: A dictionary containing bounding box information for the image.
see
: https://docs.wandb.ai/ref/python/data-types/boundingboxes2d/
masks
: A dictionary containing mask information for the image.
see
: https://docs.wandb.ai/ref/python/data-types/imagemask/
file_type
: The file type to save the image as. This parameter has no effect if data_or_path is a path to an image file.
normalize
: If True, normalize the image pixel values to fall within the range of [0, 255]. Normalize is only applied if data_or_path is a numpy array or pytorch tensor.
Examples:
Create a wandb.Image from a numpy array
import numpy as np
import wandb
with wandb.init() as run:
examples = []
for i in range(3):
pixels = np.random.randint(low=0, high=256, size=(100, 100, 3))
image = wandb.Image(pixels, caption=f"random field {i}")
examples.append(image)
run.log({"examples": examples})
Create a wandb.Image from a PILImage
import numpy as np
from PIL import Image as PILImage
import wandb
with wandb.init() as run:
examples = []
for i in range(3):
pixels = np.random.randint(
low=0, high=256, size=(100, 100, 3), dtype=np.uint8
)
pil_image = PILImage.fromarray(pixels, mode="RGB")
image = wandb.Image(pil_image, caption=f"random field {i}")
examples.append(image)
run.log({"examples": examples})
Log .jpg rather than .png (default)
import numpy as np
import wandb
with wandb.init() as run:
examples = []
for i in range(3):
pixels = np.random.randint(low=0, high=256, size=(100, 100, 3))
image = wandb.Image(
pixels, caption=f"random field {i}", file_type="jpg"
)
examples.append(image)
run.log({"examples": examples})
property Image.image
2.5 - Molecule
class Molecule
W&B class for 3D Molecular data.
method Molecule.__init__
__init__(
data_or_path: Union[str, pathlib.Path, ForwardRef('TextIO')],
caption: Optional[str] = None,
**kwargs: str
) → None
Initialize a Molecule object.
Args:
data_or_path
: Molecule can be initialized from a file name or an io object.
caption
: Caption associated with the molecule for display.
2.6 - Object3D
class Object3D
W&B class for 3D point clouds.
method Object3D.__init__
__init__(
data_or_path: Union[ForwardRef('np.ndarray'), str, pathlib.Path, ForwardRef('TextIO'), dict],
caption: Optional[str] = None,
**kwargs: Optional[str, ForwardRef('FileFormat3D')]
) → None
Creates a W&B Object3D object.
Args:
data_or_path
: Object3D can be initialized from a file or a numpy array.
caption
: Caption associated with the object for display.
Examples:
The shape of the numpy array must be one of either
[[x y z], ...] nx3
[[x y z c], ...] nx4 where c is a category with supported range [1, 14]
[[x y z r g b], ...] nx6 where is rgb is color
2.7 - Plotly
class Plotly
W&B class for Plotly plots.
method Plotly.__init__
__init__(
val: Union[ForwardRef('plotly.Figure'), ForwardRef('matplotlib.artist.Artist')]
)
Initialize a Plotly object.
Args:
val
: Matplotlib or Plotly figure.
2.8 - Table
class Table
The Table class used to display and analyze tabular data.
Unlike traditional spreadsheets, Tables support numerous types of data: scalar values, strings, numpy arrays, and most subclasses of wandb.data_types.Media
. This means you can embed Images
, Video
, Audio
, and other sorts of rich, annotated media directly in Tables, alongside other traditional scalar values.
This class is the primary class used to generate W&B Tables https://docs.wandb.ai/guides/models/tables/.
method Table.__init__
__init__(
columns=None,
data=None,
rows=None,
dataframe=None,
dtype=None,
optional=True,
allow_mixed_types=False,
log_mode: Optional[Literal['IMMUTABLE', 'MUTABLE', 'INCREMENTAL']] = 'IMMUTABLE'
)
Initializes a Table object.
The rows is available for legacy reasons and should not be used. The Table class uses data to mimic the Pandas API.
Args:
columns
: (List[str]) Names of the columns in the table. Defaults to [“Input”, “Output”, “Expected”].
data
: (List[List[any]]) 2D row-oriented array of values.
dataframe
: (pandas.DataFrame) DataFrame object used to create the table. When set, data
and columns
arguments are ignored.
rows
: (List[List[any]]) 2D row-oriented array of values.
optional
: (Union[bool,List[bool]]) Determines if None
values are allowed. Default to True
- If a singular bool value, then the optionality is enforced for all columns specified at construction time
- If a list of bool values, then the optionality is applied to each column - should be the same length as columns
applies to all columns. A list of bool values applies to each respective column.
allow_mixed_types
: (bool) Determines if columns are allowed to have mixed types (disables type validation). Defaults to False
log_mode
: Optional[str] Controls how the Table is logged when mutations occur. Options:
- “IMMUTABLE” (default): Table can only be logged once; subsequent logging attempts after the table has been mutated will be no-ops.
- “MUTABLE”: Table can be re-logged after mutations, creating a new artifact version each time it’s logged.
- “INCREMENTAL”: Table data is logged incrementally, with each log creating a new artifact entry containing the new data since the last log.
method Table.add_column
add_column(name, data, optional=False)
Adds a column of data to the table.
Args:
name
: (str) - the unique name of the column
data
: (list | np.array) - a column of homogeneous data
optional
: (bool) - if null-like values are permitted
method Table.add_computed_columns
Adds one or more computed columns based on existing data.
Args:
fn
: A function which accepts one or two parameters, ndx (int) and row (dict), which is expected to return a dict representing new columns for that row, keyed by the new column names.
ndx
is an integer representing the index of the row. Only included if include_ndx
is set to True
.
row
is a dictionary keyed by existing columns
method Table.add_data
Adds a new row of data to the table.
The maximum amount ofrows in a table is determined by wandb.Table.MAX_ARTIFACT_ROWS
.
The length of the data should match the length of the table column.
method Table.add_row
Deprecated. Use Table.add_data
method instead.
method Table.cast
cast(col_name, dtype, optional=False)
Casts a column to a specific data type.
This can be one of the normal python classes, an internal W&B type, or an example object, like an instance of wandb.Image or wandb.Classes.
Args:
col_name
(str): The name of the column to cast.
dtype
(class, wandb.wandb_sdk.interface._dtypes.Type, any): The target dtype.
optional
(bool): If the column should allow Nones.
method Table.get_column
get_column(name, convert_to=None)
Retrieves a column from the table and optionally converts it to a NumPy object.
Args:
name
: (str) - the name of the column
convert_to
: (str, optional)
- “numpy”: will convert the underlying data to numpy object
method Table.get_dataframe
Returns a pandas.DataFrame
of the table.
method Table.get_index
Returns an array of row indexes for use in other tables to create links.
2.9 - Video
class Video
A class for logging videos to W&B.
method Video.__init__
__init__(
data_or_path: Union[str, pathlib.Path, ForwardRef('np.ndarray'), ForwardRef('TextIO'), ForwardRef('BytesIO')],
caption: Optional[str] = None,
fps: Optional[int] = None,
format: Optional[Literal['gif', 'mp4', 'webm', 'ogg']] = None
)
Initialize a W&B Video object.
Args:
data_or_path
: Video can be initialized with a path to a file or an io object. Video can be initialized with a numpy tensor. The numpy tensor must be either 4 dimensional or 5 dimensional. The dimensions should be (number of frames, channel, height, width) or (batch, number of frames, channel, height, width) The format parameter must be specified with the format argument when initializing with a numpy array or io object.
caption
: Caption associated with the video for display.
fps
: The frame rate to use when encoding raw video frames. Default value is 4. This parameter has no effect when data_or_path is a string, or bytes.
format
: Format of video, necessary if initializing with a numpy array or io object. This parameter will be used to determine the format to use when encoding the video data. Accepted values are “gif”, “mp4”, “webm”, or “ogg”. If no value is provided, the default format will be “gif”.
Examples:
Log a numpy array as a video
import numpy as np
import wandb
with wandb.init() as run:
# axes are (number of frames, channel, height, width)
frames = np.random.randint(
low=0, high=256, size=(10, 3, 100, 100), dtype=np.uint8
)
run.log({"video": wandb.Video(frames, format="mp4", fps=4)})
3 - Artifact
class Artifact
Flexible and lightweight building block for dataset and model versioning.
Construct an empty W&B Artifact. Populate an artifacts contents with methods that begin with add
. Once the artifact has all the desired files, you can call run.log_artifact()
to log it.
Args:
name
(str): A human-readable name for the artifact. Use the name to identify a specific artifact in the W&B App UI or programmatically. You can interactively reference an artifact with the use_artifact
Public API. A name can contain letters, numbers, underscores, hyphens, and dots. The name must be unique across a project.
type
(str): The artifact’s type. Use the type of an artifact to both organize and differentiate artifacts. You can use any string that contains letters, numbers, underscores, hyphens, and dots. Common types include dataset
or model
. Include model
within your type string if you want to link the artifact to the W&B Model Registry. Note that some types reserved for internal use and cannot be set by users. Such types include job
and types that start with wandb-
.
description (str | None) = None
: A description of the artifact. For Model or Dataset Artifacts, add documentation for your standardized team model or dataset card. View an artifact’s description programmatically with the Artifact.description
attribute or programmatically with the W&B App UI. W&B renders the description as markdown in the W&B App.
metadata (dict[str, Any] | None) = None
: Additional information about an artifact. Specify metadata as a dictionary of key-value pairs. You can specify no more than 100 total keys.
incremental
: Use Artifact.new_draft()
method instead to modify an existing artifact.
use_as
: Deprecated.
is_link
: Boolean indication of if the artifact is a linked artifact(True
) or source artifact(False
).
Returns:
An Artifact
object.
method Artifact.__init__
__init__(
name: 'str',
type: 'str',
description: 'str | None' = None,
metadata: 'dict[str, Any] | None' = None,
incremental: 'bool' = False,
use_as: 'str | None' = None
) → None
property Artifact.aliases
List of one or more semantically-friendly references or
identifying “nicknames” assigned to an artifact version.
Aliases are mutable references that you can programmatically reference. Change an artifact’s alias with the W&B App UI or programmatically. See Create new artifact versions for more information.
property Artifact.collection
The collection this artifact was retrieved from.
A collection is an ordered group of artifact versions. If this artifact was retrieved from a portfolio / linked collection, that collection will be returned rather than the collection that an artifact version originated from. The collection that an artifact originates from is known as the source sequence.
property Artifact.commit_hash
The hash returned when this artifact was committed.
property Artifact.created_at
Timestamp when the artifact was created.
property Artifact.description
A description of the artifact.
property Artifact.digest
The logical digest of the artifact.
The digest is the checksum of the artifact’s contents. If an artifact has the same digest as the current latest
version, then log_artifact
is a no-op.
property Artifact.entity
The name of the entity that the artifact collection belongs to.
If the artifact is a link, the entity will be the entity of the linked artifact.
property Artifact.file_count
The number of files (including references).
property Artifact.history_step
The nearest step at which history metrics were logged for the source run of the artifact.
Examples:
run = artifact.logged_by()
if run and (artifact.history_step is not None):
history = run.sample_history(
min_step=artifact.history_step,
max_step=artifact.history_step + 1,
keys=["my_metric"],
)
property Artifact.id
The artifact’s ID.
property Artifact.is_link
Boolean flag indicating if the artifact is a link artifact.
True: The artifact is a link artifact to a source artifact. False: The artifact is a source artifact.
property Artifact.linked_artifacts
Returns a list of all the linked artifacts of a source artifact.
If the artifact is a link artifact (artifact.is_link == True
), it will return an empty list. Limited to 500 results.
property Artifact.manifest
The artifact’s manifest.
The manifest lists all of its contents, and can’t be changed once the artifact has been logged.
User-defined artifact metadata.
Structured data associated with the artifact.
property Artifact.name
The artifact name and version of the artifact.
A string with the format {collection}:{alias}
. If fetched before an artifact is logged/saved, the name won’t contain the alias. If the artifact is a link, the name will be the name of the linked artifact.
property Artifact.project
The name of the project that the artifact collection belongs to.
If the artifact is a link, the project will be the project of the linked artifact.
property Artifact.qualified_name
The entity/project/name of the artifact.
If the artifact is a link, the qualified name will be the qualified name of the linked artifact path.
property Artifact.size
The total size of the artifact in bytes.
Includes any references tracked by this artifact.
property Artifact.source_artifact
Returns the source artifact. The source artifact is the original logged artifact.
If the artifact itself is a source artifact (artifact.is_link == False
), it will return itself.
property Artifact.source_collection
The artifact’s source collection.
The source collection is the collection that the artifact was logged from.
property Artifact.source_entity
The name of the entity of the source artifact.
property Artifact.source_name
The artifact name and version of the source artifact.
A string with the format {source_collection}:{alias}
. Before the artifact is saved, contains only the name since the version is not yet known.
property Artifact.source_project
The name of the project of the source artifact.
property Artifact.source_qualified_name
The source_entity/source_project/source_name of the source artifact.
property Artifact.source_version
The source artifact’s version.
A string with the format v{number}
.
property Artifact.state
The status of the artifact. One of: “PENDING”, “COMMITTED”, or “DELETED”.
List of one or more tags assigned to this artifact version.
property Artifact.ttl
The time-to-live (TTL) policy of an artifact.
Artifacts are deleted shortly after a TTL policy’s duration passes. If set to None
, the artifact deactivates TTL policies and will be not scheduled for deletion, even if there is a team default TTL. An artifact inherits a TTL policy from the team default if the team administrator defines a default TTL and there is no custom policy set on an artifact.
Raises:
ArtifactNotLoggedError
: Unable to fetch inherited TTL if the artifact has not been logged or saved.
property Artifact.type
The artifact’s type. Common types include dataset
or model
.
property Artifact.updated_at
The time when the artifact was last updated.
property Artifact.url
Constructs the URL of the artifact.
Returns:
str
: The URL of the artifact.
property Artifact.use_as
Deprecated.
property Artifact.version
The artifact’s version.
A string with the format v{number}
. If the artifact is a link artifact, the version will be from the linked collection.
method Artifact.add
add(
obj: 'WBValue',
name: 'StrPath',
overwrite: 'bool' = False
) → ArtifactManifestEntry
Add wandb.WBValue obj
to the artifact.
Args:
obj
: The object to add. Currently support one of Bokeh, JoinedTable, PartitionedTable, Table, Classes, ImageMask, BoundingBoxes2D, Audio, Image, Video, Html, Object3D
name
: The path within the artifact to add the object.
overwrite
: If True, overwrite existing objects with the same file path if applicable.
Returns:
The added manifest entry
Raises:
ArtifactFinalizedError
: You cannot make changes to the current artifact version because it is finalized. Log a new artifact version instead.
method Artifact.add_dir
add_dir(
local_path: 'str',
name: 'str | None' = None,
skip_cache: 'bool | None' = False,
policy: "Literal['mutable', 'immutable'] | None" = 'mutable',
merge: 'bool' = False
) → None
Add a local directory to the artifact.
Args:
local_path
: The path of the local directory.
name
: The subdirectory name within an artifact. The name you specify appears in the W&B App UI nested by artifact’s type
. Defaults to the root of the artifact.
skip_cache
: If set to True
, W&B will not copy/move files to the cache while uploading
policy
: By default, “mutable”.
- mutable: Create a temporary copy of the file to prevent corruption during upload.
- immutable: Disable protection, rely on the user not to delete or change the file.
merge
: If False
(default), throws ValueError if a file was already added in a previous add_dir call and its content has changed. If True
, overwrites existing files with changed content. Always adds new files and never removes files. To replace an entire directory, pass a name when adding the directory using add_dir(local_path, name=my_prefix)
and call remove(my_prefix)
to remove the directory, then add it again.
Raises:
ArtifactFinalizedError
: You cannot make changes to the current artifact version because it is finalized. Log a new artifact version instead.
ValueError
: Policy must be “mutable” or “immutable”
method Artifact.add_file
add_file(
local_path: 'str',
name: 'str | None' = None,
is_tmp: 'bool | None' = False,
skip_cache: 'bool | None' = False,
policy: "Literal['mutable', 'immutable'] | None" = 'mutable',
overwrite: 'bool' = False
) → ArtifactManifestEntry
Add a local file to the artifact.
Args:
local_path
: The path to the file being added.
name
: The path within the artifact to use for the file being added. Defaults to the basename of the file.
is_tmp
: If true, then the file is renamed deterministically to avoid collisions.
skip_cache
: If True
, do not copy files to the cache after uploading.
policy
: By default, set to “mutable”. If set to “mutable”, create a temporary copy of the file to prevent corruption during upload. If set to “immutable”, disable protection and rely on the user not to delete or change the file.
overwrite
: If True
, overwrite the file if it already exists.
Returns:
The added manifest entry.
Raises:
ArtifactFinalizedError
: You cannot make changes to the current artifact version because it is finalized. Log a new artifact version instead.
ValueError
: Policy must be “mutable” or “immutable”
method Artifact.add_reference
add_reference(
uri: 'ArtifactManifestEntry | str',
name: 'StrPath | None' = None,
checksum: 'bool' = True,
max_objects: 'int | None' = None
) → Sequence[ArtifactManifestEntry]
Add a reference denoted by a URI to the artifact.
Unlike files or directories that you add to an artifact, references are not uploaded to W&B. For more information, see Track external files.
By default, the following schemes are supported:
- http(s): The size and digest of the file will be inferred by the
Content-Length
and the ETag
response headers returned by the server.
- s3: The checksum and size are pulled from the object metadata. If bucket versioning is enabled, then the version ID is also tracked.
- gs: The checksum and size are pulled from the object metadata. If bucket versioning is enabled, then the version ID is also tracked.
- https, domain matching
*.blob.core.windows.net
- Azure: The checksum and size are be pulled from the blob metadata. If storage account versioning is enabled, then the version ID is also tracked.
- file: The checksum and size are pulled from the file system. This scheme is useful if you have an NFS share or other externally mounted volume containing files you wish to track but not necessarily upload.
For any other scheme, the digest is just a hash of the URI and the size is left blank.
Args:
uri
: The URI path of the reference to add. The URI path can be an object returned from Artifact.get_entry
to store a reference to another artifact’s entry.
name
: The path within the artifact to place the contents of this reference.
checksum
: Whether or not to checksum the resource(s) located at the reference URI. Checksumming is strongly recommended as it enables automatic integrity validation. Disabling checksumming will speed up artifact creation but reference directories will not iterated through so the objects in the directory will not be saved to the artifact. We recommend setting checksum=False
when adding reference objects, in which case a new version will only be created if the reference URI changes.
max_objects
: The maximum number of objects to consider when adding a reference that points to directory or bucket store prefix. By default, the maximum number of objects allowed for Amazon S3, GCS, Azure, and local files is 10,000,000. Other URI schemas do not have a maximum.
Returns:
The added manifest entries.
Raises:
ArtifactFinalizedError
: You cannot make changes to the current artifact version because it is finalized. Log a new artifact version instead.
method Artifact.checkout
checkout(root: 'str | None' = None) → str
Replace the specified root directory with the contents of the artifact.
WARNING: This will delete all files in root
that are not included in the artifact.
Args:
root
: The directory to replace with this artifact’s files.
Returns:
The path of the checked out contents.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
method Artifact.delete
delete(delete_aliases: 'bool' = False) → None
Delete an artifact and its files.
If called on a linked artifact, only the link is deleted, and the source artifact is unaffected.
Use artifact.unlink()
instead of artifact.delete()
to remove a link between a source artifact and a linked artifact.
Args:
delete_aliases
: If set to True
, deletes all aliases associated with the artifact. Otherwise, this raises an exception if the artifact has existing aliases. This parameter is ignored if the artifact is linked (a member of a portfolio collection).
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
method Artifact.download
download(
root: 'StrPath | None' = None,
allow_missing_references: 'bool' = False,
skip_cache: 'bool | None' = None,
path_prefix: 'StrPath | None' = None,
multipart: 'bool | None' = None
) → FilePathStr
Download the contents of the artifact to the specified root directory.
Existing files located within root
are not modified. Explicitly delete root
before you call download
if you want the contents of root
to exactly match the artifact.
Args:
root
: The directory W&B stores the artifact’s files.
allow_missing_references
: If set to True
, any invalid reference paths will be ignored while downloading referenced files.
skip_cache
: If set to True
, the artifact cache will be skipped when downloading and W&B will download each file into the default root or specified download directory.
path_prefix
: If specified, only files with a path that starts with the given prefix will be downloaded. Uses unix format (forward slashes).
multipart
: If set to None
(default), the artifact will be downloaded in parallel using multipart download if individual file size is greater than 2GB. If set to True
or False
, the artifact will be downloaded in parallel or serially regardless of the file size.
Returns:
The path to the downloaded contents.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
method Artifact.file
file(root: 'str | None' = None) → StrPath
Download a single file artifact to the directory you specify with root
.
Args:
root
: The root directory to store the file. Defaults to ./artifacts/self.name/
.
Returns:
The full path of the downloaded file.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
ValueError
: If the artifact contains more than one file.
method Artifact.files
files(names: 'list[str] | None' = None, per_page: 'int' = 50) → ArtifactFiles
Iterate over all files stored in this artifact.
Args:
names
: The filename paths relative to the root of the artifact you wish to list.
per_page
: The number of files to return per request.
Returns:
An iterator containing File
objects.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
method Artifact.finalize
Finalize the artifact version.
You cannot modify an artifact version once it is finalized because the artifact is logged as a specific artifact version. Create a new artifact version to log more data to an artifact. An artifact is automatically finalized when you log the artifact with log_artifact
.
method Artifact.get
get(name: 'str') → WBValue | None
Get the WBValue object located at the artifact relative name
.
Args:
name
: The artifact relative name to retrieve.
Returns:
W&B object that can be logged with run.log()
and visualized in the W&B UI.
Raises:
ArtifactNotLoggedError
: if the artifact isn’t logged or the run is offline.
method Artifact.get_added_local_path_name
get_added_local_path_name(local_path: 'str') → str | None
Get the artifact relative name of a file added by a local filesystem path.
Args:
local_path
: The local path to resolve into an artifact relative name.
Returns:
The artifact relative name.
method Artifact.get_entry
get_entry(name: 'StrPath') → ArtifactManifestEntry
Get the entry with the given name.
Args:
name
: The artifact relative name to get
Returns:
A W&B
object.
Raises:
ArtifactNotLoggedError
: if the artifact isn’t logged or the run is offline.
KeyError
: if the artifact doesn’t contain an entry with the given name.
method Artifact.get_path
get_path(name: 'StrPath') → ArtifactManifestEntry
Deprecated. Use get_entry(name)
.
method Artifact.is_draft
Check if artifact is not saved.
Returns:
Boolean. False
if artifact is saved. True
if artifact is not saved.
method Artifact.json_encode
json_encode() → dict[str, Any]
Returns the artifact encoded to the JSON format.
Returns:
A dict
with string
keys representing attributes of the artifact.
method Artifact.link
link(target_path: 'str', aliases: 'list[str] | None' = None) → Artifact
Link this artifact to a portfolio (a promoted collection of artifacts).
Args:
target_path
: The path to the portfolio inside a project. The target path must adhere to one of the following schemas {portfolio}
, {project}/{portfolio}
or {entity}/{project}/{portfolio}
. To link the artifact to the Model Registry, rather than to a generic portfolio inside a project, set target_path
to the following schema {"model-registry"}/{Registered Model Name}
or {entity}/{"model-registry"}/{Registered Model Name}
.
aliases
: A list of strings that uniquely identifies the artifact inside the specified portfolio.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
Returns:
The linked artifact.
method Artifact.logged_by
Get the W&B run that originally logged the artifact.
Returns:
The name of the W&B run that originally logged the artifact.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
method Artifact.new_draft
Create a new draft artifact with the same content as this committed artifact.
Modifying an existing artifact creates a new artifact version known as an “incremental artifact”. The artifact returned can be extended or modified and logged as a new version.
Returns:
An Artifact
object.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
method Artifact.new_file
new_file(
name: 'str',
mode: 'str' = 'x',
encoding: 'str | None' = None
) → Iterator[IO]
Open a new temporary file and add it to the artifact.
Args:
name
: The name of the new file to add to the artifact.
mode
: The file access mode to use to open the new file.
encoding
: The encoding used to open the new file.
Returns:
A new file object that can be written to. Upon closing, the file is automatically added to the artifact.
Raises:
ArtifactFinalizedError
: You cannot make changes to the current artifact version because it is finalized. Log a new artifact version instead.
method Artifact.remove
remove(item: 'StrPath | ArtifactManifestEntry') → None
Remove an item from the artifact.
Args:
item
: The item to remove. Can be a specific manifest entry or the name of an artifact-relative path. If the item matches a directory all items in that directory will be removed.
Raises:
ArtifactFinalizedError
: You cannot make changes to the current artifact version because it is finalized. Log a new artifact version instead.
FileNotFoundError
: If the item isn’t found in the artifact.
method Artifact.save
save(
project: 'str | None' = None,
settings: 'wandb.Settings | None' = None
) → None
Persist any changes made to the artifact.
If currently in a run, that run will log this artifact. If not currently in a run, a run of type “auto” is created to track this artifact.
Args:
project
: A project to use for the artifact in the case that a run is not already in context.
settings
: A settings object to use when initializing an automatic run. Most commonly used in testing harness.
method Artifact.unlink
Unlink this artifact if it is currently a member of a promoted collection of artifacts.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
ValueError
: If the artifact is not linked, in other words, it is not a member of a portfolio collection.
method Artifact.used_by
Get a list of the runs that have used this artifact and its linked artifacts.
Returns:
A list of Run
objects.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
method Artifact.verify
verify(root: 'str | None' = None) → None
Verify that the contents of an artifact match the manifest.
All files in the directory are checksummed and the checksums are then cross-referenced against the artifact’s manifest. References are not verified.
Args:
root
: The directory to verify. If None artifact will be downloaded to ‘./artifacts/self.name/’.
Raises:
ArtifactNotLoggedError
: If the artifact is not logged.
ValueError
: If the verification fails.
method Artifact.wait
wait(timeout: 'int | None' = None) → Artifact
If needed, wait for this artifact to finish logging.
Args:
timeout
: The time, in seconds, to wait.
Returns:
An Artifact
object.
4 - Automations
The W&B Automations API enables programmatic creation and management of automated workflows that respond to events in your ML pipeline. Configure actions to trigger when specific conditions are met, such as model performance thresholds or artifact creation.
Overview
Automations in W&B (wandb.automations
) provide event-driven workflow automation for ML operations. Define triggers based on run metrics, artifact events, or other conditions, and specify actions such as sending notifications or webhooks. Automations execute automatically when their trigger conditions are satisfied, enabling responsive ML pipelines without manual intervention.
Available Components
Core Classes
Class |
Description |
Automation |
Represents a saved automation instance with its configuration. |
NewAutomation |
Builder class for creating new automations. |
Events (Triggers)
Event |
Description |
OnRunMetric |
Trigger when a run metric satisfies a defined condition (threshold, change, etc.). |
OnCreateArtifact |
Trigger when a new artifact is created in a collection. |
OnLinkArtifact |
Trigger when an artifact is linked to a registry. |
OnAddArtifactAlias |
Trigger when an alias is added to an artifact. |
Actions
Action |
Description |
SendNotification |
Send notifications via Slack or other integrated channels. |
SendWebhook |
Send HTTP webhook requests to external services. |
DoNothing |
Placeholder action for testing automation configurations. |
Filters
Common Use Cases
- Alert when model accuracy drops below a threshold
- Notify team when training loss plateaus
- Trigger retraining pipelines based on performance metrics
Artifact Management
- Send notifications when new model versions are created
- Trigger deployment workflows when artifacts are tagged
- Automate downstream processing when datasets are updated
Experiment Tracking
- Alert on failed or crashed runs
- Notify when long-running experiments complete
- Send daily summaries of experiment metrics
Integration Workflows
- Update external tracking systems via webhooks
- Sync model registry with deployment platforms
- Trigger CI/CD pipelines based on W&B events
Configuration
Setting Up Integrations
# Configure Slack integration
from wandb.automations import SlackIntegration
slack = SlackIntegration(
webhook_url="https://hooks.slack.com/services/..."
)
# Use in notification action
notification = SendNotification.from_integration(
integration=slack,
title="ML Alert",
text="Training completed",
level="INFO"
)
Filter Operators
Metric filters support standard comparison operators:
">"
: Greater than
"<"
: Less than
">="
: Greater than or equal
"<="
: Less than or equal
"=="
: Equal to
"!="
: Not equal to
Aggregation Options
For metric filters with windows:
"min"
: Minimum value in window
"max"
: Maximum value in window
"mean"
: Average value in window
"sum"
: Sum of values in window
Usage Notes
- Automations require appropriate permissions in the target project or organization
- Rate limits apply to action executions (notifications, webhooks)
- Filters are evaluated on the W&B backend, not locally
- Disabled automations remain saved but do not trigger
- Test automations with
DoNothing
action before deploying
Example Usage
import wandb
from wandb.automations import OnRunMetric, SendNotification, MetricThresholdFilter
# Initialize W&B
wandb.login()
# Create an automation that alerts when accuracy exceeds 0.95
automation = OnRunMetric(
filter=MetricThresholdFilter(
name="accuracy",
cmp=">",
threshold=0.95
),
scope="entity/project"
).then(
SendNotification(
title="High Accuracy Achieved",
message="Model accuracy exceeded 95%",
severity="INFO"
)
)
# Save the automation
automation.save(name="accuracy-alert", enabled=True)
# Create an automation for artifact creation
artifact_automation = OnCreateArtifact(
scope="entity/project/artifact-collection"
).then(
SendWebhook.from_integration(
integration=webhook_integration,
payload={"event": "new_artifact", "collection": "models"}
)
)
# Save with description
artifact_automation.save(
name="model-webhook",
description="Notify external service on new model creation",
enabled=True
)
# Query existing automations
from wandb.apis.public import Api
api = Api()
automations = api.project("entity/project").automations()
for auto in automations:
print(f"Automation: {auto.name}")
print(f"Enabled: {auto.enabled}")
print(f"Event: {auto.event}")
4.1 - Automation
A local instance of a saved W&B automation.
Attributes:
- action (Union): The action that will execute when this automation is triggered.
- description (Optional): An optional description of this automation.
- enabled (bool): Whether this automation is enabled. Only enabled automations will trigger.
- event (SavedEvent): The event that will trigger this automation.
- name (str): The name of this automation.
- scope (Union): The scope in which the triggering event must occur.
4.2 - DoNothing
Defines an automation action that intentionally does nothing.
Attributes:
- action_type (Literal): The kind of action to be triggered.
- no_op (bool): Placeholder field which exists only to satisfy backend schema requirements.
There should never be a need to set this field explicitly, as its value is ignored.
4.3 - MetricChangeFilter
Defines a filter that compares a change in a run metric against a user-defined threshold.
The change is calculated over “tumbling” windows, i.e. the difference
between the current window and the non-overlapping prior window.
Attributes:
- agg (Optional): Aggregate operation, if any, to apply over the window size.
- change_dir (ChangeDir): No description provided.
- change_type (ChangeType): No description provided.
- name (str): Name of the observed metric.
- prior_window (int): Size of the prior window over which the metric is aggregated (ignored if
agg is None
).
If omitted, defaults to the size of the current window.
- threshold (Union): Threshold value to compare against.
- window (int): Size of the window over which the metric is aggregated (ignored if
agg is None
).
4.4 - MetricThresholdFilter
Defines a filter that compares a run metric against a user-defined threshold value.
Attributes:
- agg (Optional): Aggregate operation, if any, to apply over the window size.
- cmp (Literal): Comparison operator used to compare the metric value (left) vs. the threshold value (right).
- name (str): Name of the observed metric.
- threshold (Union): Threshold value to compare against.
- window (int): Size of the window over which the metric is aggregated (ignored if
agg is None
).
4.5 - NewAutomation
A new automation to be created.
Attributes:
- action (Optional): The action that will execute when this automation is triggered.
- description (Optional): An optional description of this automation.
- enabled (Optional): Whether this automation is enabled. Only enabled automations will trigger.
- event (Optional): The event that will trigger this automation.
- name (Optional): The name of this automation.
4.6 - OnAddArtifactAlias
A new alias is assigned to an artifact.
Attributes:
- event_type (Literal): No description provided.
- filter (Union): Additional condition(s), if any, that must be met for this event to trigger an automation.
- scope (Union): The scope of the event.
method then
then(self, action: 'InputAction') -> 'NewAutomation'
Define a new Automation in which this event triggers the given action.
4.7 - OnCreateArtifact
A new artifact is created.
Attributes:
- event_type (Literal): No description provided.
- filter (Union): Additional condition(s), if any, that must be met for this event to trigger an automation.
- scope (Union): The scope of the event: only artifact collections are valid scopes for this event.
method then
then(self, action: 'InputAction') -> 'NewAutomation'
Define a new Automation in which this event triggers the given action.
4.8 - OnLinkArtifact
A new artifact is linked to a collection.
Attributes:
- event_type (Literal): No description provided.
- filter (Union): Additional condition(s), if any, that must be met for this event to trigger an automation.
- scope (Union): The scope of the event.
method then
then(self, action: 'InputAction') -> 'NewAutomation'
Define a new Automation in which this event triggers the given action.
4.9 - OnRunMetric
A run metric satisfies a user-defined condition.
Attributes:
- event_type (Literal): No description provided.
- filter (RunMetricFilter): Run and/or metric condition(s) that must be satisfied for this event to trigger an automation.
- scope (ProjectScope): The scope of the event: only projects are valid scopes for this event.
method then
then(self, action: 'InputAction') -> 'NewAutomation'
Define a new Automation in which this event triggers the given action.
4.10 - SendNotification
Defines an automation action that sends a (Slack) notification.
Attributes:
- action_type (Literal): The kind of action to be triggered.
- message (str): The message body of the sent notification.
- severity (AlertSeverity): The severity (
INFO
, WARN
, ERROR
) of the sent notification.
- title (str): The title of the sent notification.
method from_integration
from_integration(cls, integration: 'SlackIntegration', *, title: 'str' = '', text: 'str' = '', level: 'AlertSeverity' = <AlertSeverity.INFO: 'INFO'>) -> 'Self'
Define a notification action that sends to the given (Slack) integration.
4.11 - SendWebhook
Defines an automation action that sends a webhook request.
Attributes:
- action_type (Literal): The kind of action to be triggered.
- request_payload (Optional): The payload, possibly with template variables, to send in the webhook request.
method from_integration
from_integration(cls, integration: 'WebhookIntegration', *, payload: 'Optional[SerializedToJson[dict[str, Any]]]' = None) -> 'Self'
Define a webhook action that sends to the given (webhook) integration.
5 - Custom Charts
The W&B Python SDK includes functions for creating specialized charts from your data. These functions generate interactive visualizations that integrate with the W&B UI.
Overview
Custom Charts in W&B (wandb.plot
) are visualization functions that transform data into interactive charts. These functions handle common ML visualization requirements such as confusion matrices, ROC curves, and distribution plots. Custom charts can also be created using Vega-Lite specifications.
Available Chart Functions
Function |
Description |
confusion_matrix() |
Generate confusion matrices for classification performance visualization. |
roc_curve() |
Create Receiver Operating Characteristic curves for binary and multi-class classifiers. |
pr_curve() |
Build Precision-Recall curves for classifier evaluation. |
line() |
Construct line charts from tabular data. |
scatter() |
Create scatter plots for variable relationships. |
bar() |
Generate bar charts for categorical data. |
histogram() |
Build histograms for data distribution analysis. |
line_series() |
Plot multiple line series on a single chart. |
plot_table() |
Create custom charts using Vega-Lite specifications. |
Common Use Cases
Model Evaluation
- Classification:
confusion_matrix()
, roc_curve()
, and pr_curve()
for classifier evaluation
- Regression:
scatter()
for prediction vs. actual plots and histogram()
for residual analysis
Training Monitoring
- Learning Curves:
line()
or line_series()
for tracking metrics over epochs
- Hyperparameter Comparison:
bar()
charts for comparing configurations
Data Analysis
- Distribution Analysis:
histogram()
for feature distributions
- Correlation Analysis:
scatter()
plots for variable relationships
Custom Visualizations
- Vega-Lite Charts:
plot_table()
for domain-specific visualizations
- Multi-Chart Dashboards: Combination of multiple chart types in a single run
Example: Scatter plot
import wandb
import numpy as np
from sklearn.metrics import roc_curve as sklearn_roc_curve
# Initialize a run
wandb.init(project="custom-charts-demo")
# Example 1: Log a confusion matrix
y_true = [0, 1, 2, 0, 1, 2]
y_pred = [0, 2, 2, 0, 1, 1]
class_names = ["class_0", "class_1", "class_2"]
wandb.log({
"conf_mat": wandb.plot.confusion_matrix(
y_true=y_true,
preds=y_pred,
class_names=class_names
)
})
# Example 2: Create an ROC curve for binary classification
y_true_binary = np.array([0, 0, 1, 1, 0, 1])
y_scores = np.array([0.1, 0.4, 0.35, 0.8, 0.2, 0.9])
wandb.log({
"roc": wandb.plot.roc_curve(
y_true=y_true_binary,
y_probas=y_scores,
labels=["negative", "positive"]
)
})
# Example 3: Create a custom line chart from a table
table = wandb.Table(columns=["epoch", "accuracy", "loss"])
for epoch in range(10):
table.add_data(epoch, 0.8 + 0.02 * epoch, 1.0 - 0.1 * epoch)
wandb.log({
"training_progress": wandb.plot.line(
table, x="epoch", y="accuracy",
title="Model Accuracy Over Time"
)
})
# Example 4: Build a scatter plot for feature analysis
data_table = wandb.Table(columns=["feature_1", "feature_2", "label"])
for _ in range(100):
data_table.add_data(
np.random.randn(),
np.random.randn(),
np.random.choice(["A", "B"])
)
wandb.log({
"feature_scatter": wandb.plot.scatter(
data_table, x="feature_1", y="feature_2",
title="Feature Distribution"
)
})
wandb.finish()
5.1 - bar()
function bar
bar(
table: 'wandb.Table',
label: 'str',
value: 'str',
title: 'str' = '',
split_table: 'bool' = False
) → CustomChart
Constructs a bar chart from a wandb.Table of data.
Args:
table
: A table containing the data for the bar chart.
label
: The name of the column to use for the labels of each bar.
value
: The name of the column to use for the values of each bar.
title
: The title of the bar chart.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass it to wandb.log()
.
Example:
import random
import wandb
# Generate random data for the table
data = [
["car", random.uniform(0, 1)],
["bus", random.uniform(0, 1)],
["road", random.uniform(0, 1)],
["person", random.uniform(0, 1)],
]
# Create a table with the data
table = wandb.Table(data=data, columns=["class", "accuracy"])
# Initialize a W&B run and log the bar plot
with wandb.init(project="bar_chart") as run:
# Create a bar plot from the table
bar_plot = wandb.plot.bar(
table=table,
label="class",
value="accuracy",
title="Object Classification Accuracy",
)
# Log the bar chart to W&B
run.log({"bar_plot": bar_plot})
5.2 - confusion_matrix()
function confusion_matrix
confusion_matrix(
probs: 'Sequence[Sequence[float]] | None' = None,
y_true: 'Sequence[T] | None' = None,
preds: 'Sequence[T] | None' = None,
class_names: 'Sequence[str] | None' = None,
title: 'str' = 'Confusion Matrix Curve',
split_table: 'bool' = False
) → CustomChart
Constructs a confusion matrix from a sequence of probabilities or predictions.
Args:
probs
: A sequence of predicted probabilities for each class. The sequence shape should be (N, K) where N is the number of samples and K is the number of classes. If provided, preds
should not be provided.
y_true
: A sequence of true labels.
preds
: A sequence of predicted class labels. If provided, probs
should not be provided.
class_names
: Sequence of class names. If not provided, class names will be defined as “Class_1”, “Class_2”, etc.
title
: Title of the confusion matrix chart.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass it to wandb.log()
.
Raises:
ValueError
: If both probs
and preds
are provided or if the number of predictions and true labels are not equal. If the number of unique predicted classes exceeds the number of class names or if the number of unique true labels exceeds the number of class names.
wandb.Error
: If numpy is not installed.
Examples:
Logging a confusion matrix with random probabilities for wildlife classification:
import numpy as np
import wandb
# Define class names for wildlife
wildlife_class_names = ["Lion", "Tiger", "Elephant", "Zebra"]
# Generate random true labels (0 to 3 for 10 samples)
wildlife_y_true = np.random.randint(0, 4, size=10)
# Generate random probabilities for each class (10 samples x 4 classes)
wildlife_probs = np.random.rand(10, 4)
wildlife_probs = np.exp(wildlife_probs) / np.sum(
np.exp(wildlife_probs),
axis=1,
keepdims=True,
)
# Initialize W&B run and log confusion matrix
with wandb.init(project="wildlife_classification") as run:
confusion_matrix = wandb.plot.confusion_matrix(
probs=wildlife_probs,
y_true=wildlife_y_true,
class_names=wildlife_class_names,
title="Wildlife Classification Confusion Matrix",
)
run.log({"wildlife_confusion_matrix": confusion_matrix})
In this example, random probabilities are used to generate a confusion matrix.
Logging a confusion matrix with simulated model predictions and 85% accuracy:
import numpy as np
import wandb
# Define class names for wildlife
wildlife_class_names = ["Lion", "Tiger", "Elephant", "Zebra"]
# Simulate true labels for 200 animal images (imbalanced distribution)
wildlife_y_true = np.random.choice(
[0, 1, 2, 3],
size=200,
p=[0.2, 0.3, 0.25, 0.25],
)
# Simulate model predictions with 85% accuracy
wildlife_preds = [
y_t
if np.random.rand() < 0.85
else np.random.choice([x for x in range(4) if x != y_t])
for y_t in wildlife_y_true
]
# Initialize W&B run and log confusion matrix
with wandb.init(project="wildlife_classification") as run:
confusion_matrix = wandb.plot.confusion_matrix(
preds=wildlife_preds,
y_true=wildlife_y_true,
class_names=wildlife_class_names,
title="Simulated Wildlife Classification Confusion Matrix",
)
run.log({"wildlife_confusion_matrix": confusion_matrix})
In this example, predictions are simulated with 85% accuracy to generate a confusion matrix.
5.3 - histogram()
function histogram
histogram(
table: 'wandb.Table',
value: 'str',
title: 'str' = '',
split_table: 'bool' = False
) → CustomChart
Constructs a histogram chart from a W&B Table.
Args:
table
: The W&B Table containing the data for the histogram.
value
: The label for the bin axis (x-axis).
title
: The title of the histogram plot.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass it to wandb.log()
.
Example:
import math
import random
import wandb
# Generate random data
data = [[i, random.random() + math.sin(i / 10)] for i in range(100)]
# Create a W&B Table
table = wandb.Table(
data=data,
columns=["step", "height"],
)
# Create a histogram plot
histogram = wandb.plot.histogram(
table,
value="height",
title="My Histogram",
)
# Log the histogram plot to W&B
with wandb.init(...) as run:
run.log({"histogram-plot1": histogram})
5.4 - line_series()
function line_series
line_series(
xs: 'Iterable[Iterable[Any]] | Iterable[Any]',
ys: 'Iterable[Iterable[Any]]',
keys: 'Iterable[str] | None' = None,
title: 'str' = '',
xname: 'str' = 'x',
split_table: 'bool' = False
) → CustomChart
Constructs a line series chart.
Args:
xs
: Sequence of x values. If a singular array is provided, all y values are plotted against that x array. If an array of arrays is provided, each y value is plotted against the corresponding x array.
ys
: Sequence of y values, where each iterable represents a separate line series.
keys
: Sequence of keys for labeling each line series. If not provided, keys will be automatically generated as “line_1”, “line_2”, etc.
title
: Title of the chart.
xname
: Label for the x-axis.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass it to wandb.log()
.
Examples:
Logging a single x array where all y series are plotted against the same x values:
import wandb
# Initialize W&B run
with wandb.init(project="line_series_example") as run:
# x values shared across all y series
xs = list(range(10))
# Multiple y series to plot
ys = [
[i for i in range(10)], # y = x
[i**2 for i in range(10)], # y = x^2
[i**3 for i in range(10)], # y = x^3
]
# Generate and log the line series chart
line_series_chart = wandb.plot.line_series(
xs,
ys,
title="title",
xname="step",
)
run.log({"line-series-single-x": line_series_chart})
In this example, a single xs
series (shared x-values) is used for all ys
series. This results in each y-series being plotted against the same x-values (0-9).
Logging multiple x arrays where each y series is plotted against its corresponding x array:
import wandb
# Initialize W&B run
with wandb.init(project="line_series_example") as run:
# Separate x values for each y series
xs = [
[i for i in range(10)], # x for first series
[2 * i for i in range(10)], # x for second series (stretched)
[3 * i for i in range(10)], # x for third series (stretched more)
]
# Corresponding y series
ys = [
[i for i in range(10)], # y = x
[i**2 for i in range(10)], # y = x^2
[i**3 for i in range(10)], # y = x^3
]
# Generate and log the line series chart
line_series_chart = wandb.plot.line_series(
xs, ys, title="Multiple X Arrays Example", xname="Step"
)
run.log({"line-series-multiple-x": line_series_chart})
In this example, each y series is plotted against its own unique x series. This allows for more flexibility when the x values are not uniform across the data series.
Customizing line labels using keys
:
import wandb
# Initialize W&B run
with wandb.init(project="line_series_example") as run:
xs = list(range(10)) # Single x array
ys = [
[i for i in range(10)], # y = x
[i**2 for i in range(10)], # y = x^2
[i**3 for i in range(10)], # y = x^3
]
# Custom labels for each line
keys = ["Linear", "Quadratic", "Cubic"]
# Generate and log the line series chart
line_series_chart = wandb.plot.line_series(
xs,
ys,
keys=keys, # Custom keys (line labels)
title="Custom Line Labels Example",
xname="Step",
)
run.log({"line-series-custom-keys": line_series_chart})
This example shows how to provide custom labels for the lines using the keys
argument. The keys will appear in the legend as “Linear”, “Quadratic”, and “Cubic”.
5.5 - line()
function line
line(
table: 'wandb.Table',
x: 'str',
y: 'str',
stroke: 'str | None' = None,
title: 'str' = '',
split_table: 'bool' = False
) → CustomChart
Constructs a customizable line chart.
Args:
table
: The table containing data for the chart.
x
: Column name for the x-axis values.
y
: Column name for the y-axis values.
stroke
: Column name to differentiate line strokes (e.g., for grouping lines).
title
: Title of the chart.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass it to wandb.log()
.
Example:
import math
import random
import wandb
# Create multiple series of data with different patterns
data = []
for i in range(100):
# Series 1: Sinusoidal pattern with random noise
data.append([i, math.sin(i / 10) + random.uniform(-0.1, 0.1), "series_1"])
# Series 2: Cosine pattern with random noise
data.append([i, math.cos(i / 10) + random.uniform(-0.1, 0.1), "series_2"])
# Series 3: Linear increase with random noise
data.append([i, i / 10 + random.uniform(-0.5, 0.5), "series_3"])
# Define the columns for the table
table = wandb.Table(data=data, columns=["step", "value", "series"])
# Initialize wandb run and log the line chart
with wandb.init(project="line_chart_example") as run:
line_chart = wandb.plot.line(
table=table,
x="step",
y="value",
stroke="series", # Group by the "series" column
title="Multi-Series Line Plot",
)
run.log({"line-chart": line_chart})
5.6 - plot_table()
function plot_table
plot_table(
vega_spec_name: 'str',
data_table: 'wandb.Table',
fields: 'dict[str, Any]',
string_fields: 'dict[str, Any] | None' = None,
split_table: 'bool' = False
) → CustomChart
Creates a custom charts using a Vega-Lite specification and a wandb.Table
.
This function creates a custom chart based on a Vega-Lite specification and a data table represented by a wandb.Table
object. The specification needs to be predefined and stored in the W&B backend. The function returns a custom chart object that can be logged to W&B using wandb.Run.log()
.
Args:
vega_spec_name
: The name or identifier of the Vega-Lite spec that defines the visualization structure.
data_table
: A wandb.Table
object containing the data to be visualized.
fields
: A mapping between the fields in the Vega-Lite spec and the corresponding columns in the data table to be visualized.
string_fields
: A dictionary for providing values for any string constants required by the custom visualization.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass the chart object as argument to wandb.Run.log()
.
Raises:
wandb.Error
: If data_table
is not a wandb.Table
object.
Example:
# Create a custom chart using a Vega-Lite spec and the data table.
import wandb
data = [[1, 1], [2, 2], [3, 3], [4, 4], [5, 5]]
table = wandb.Table(data=data, columns=["x", "y"])
fields = {"x": "x", "y": "y", "title": "MY TITLE"}
with wandb.init() as run:
# Training code goes here
# Create a custom title with `string_fields`.
my_custom_chart = wandb.plot_table(
vega_spec_name="wandb/line/v0",
data_table=table,
fields=fields,
string_fields={"title": "Title"},
)
run.log({"custom_chart": my_custom_chart})
5.7 - pr_curve()
function pr_curve
pr_curve(
y_true: 'Iterable[T] | None' = None,
y_probas: 'Iterable[numbers.Number] | None' = None,
labels: 'list[str] | None' = None,
classes_to_plot: 'list[T] | None' = None,
interp_size: 'int' = 21,
title: 'str' = 'Precision-Recall Curve',
split_table: 'bool' = False
) → CustomChart
Constructs a Precision-Recall (PR) curve.
The Precision-Recall curve is particularly useful for evaluating classifiers on imbalanced datasets. A high area under the PR curve signifies both high precision (a low false positive rate) and high recall (a low false negative rate). The curve provides insights into the balance between false positives and false negatives at various threshold levels, aiding in the assessment of a model’s performance.
Args:
y_true
: True binary labels. The shape should be (num_samples
,).
y_probas
: Predicted scores or probabilities for each class. These can be probability estimates, confidence scores, or non-thresholded decision values. The shape should be (num_samples
, num_classes
).
labels
: Optional list of class names to replace numeric values in y_true
for easier plot interpretation. For example, labels = ['dog', 'cat', 'owl']
will replace 0 with ‘dog’, 1 with ‘cat’, and 2 with ‘owl’ in the plot. If not provided, numeric values from y_true
will be used.
classes_to_plot
: Optional list of unique class values from y_true to be included in the plot. If not specified, all unique classes in y_true will be plotted.
interp_size
: Number of points to interpolate recall values. The recall values will be fixed to interp_size
uniformly distributed points in the range [0, 1], and the precision will be interpolated accordingly.
title
: Title of the plot. Defaults to “Precision-Recall Curve”.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass it to wandb.log()
.
Raises:
wandb.Error
: If NumPy, pandas, or scikit-learn is not installed.
Example:
import wandb
# Example for spam detection (binary classification)
y_true = [0, 1, 1, 0, 1] # 0 = not spam, 1 = spam
y_probas = [
[0.9, 0.1], # Predicted probabilities for the first sample (not spam)
[0.2, 0.8], # Second sample (spam), and so on
[0.1, 0.9],
[0.8, 0.2],
[0.3, 0.7],
]
labels = ["not spam", "spam"] # Optional class names for readability
with wandb.init(project="spam-detection") as run:
pr_curve = wandb.plot.pr_curve(
y_true=y_true,
y_probas=y_probas,
labels=labels,
title="Precision-Recall Curve for Spam Detection",
)
run.log({"pr-curve": pr_curve})
5.8 - roc_curve()
function roc_curve
roc_curve(
y_true: 'Sequence[numbers.Number]',
y_probas: 'Sequence[Sequence[float]] | None' = None,
labels: 'list[str] | None' = None,
classes_to_plot: 'list[numbers.Number] | None' = None,
title: 'str' = 'ROC Curve',
split_table: 'bool' = False
) → CustomChart
Constructs Receiver Operating Characteristic (ROC) curve chart.
Args:
y_true
: The true class labels (ground truth) for the target variable. Shape should be (num_samples,).
y_probas
: The predicted probabilities or decision scores for each class. Shape should be (num_samples, num_classes).
labels
: Human-readable labels corresponding to the class indices in y_true
. For example, if labels=['dog', 'cat']
, class 0 will be displayed as ‘dog’ and class 1 as ‘cat’ in the plot. If None, the raw class indices from y_true
will be used. Default is None.
classes_to_plot
: A subset of unique class labels to include in the ROC curve. If None, all classes in y_true
will be plotted. Default is None.
title
: Title of the ROC curve plot. Default is “ROC Curve”.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass it to wandb.log()
.
Raises:
wandb.Error
: If numpy, pandas, or scikit-learn are not found.
Example:
import numpy as np
import wandb
# Simulate a medical diagnosis classification problem with three diseases
n_samples = 200
n_classes = 3
# True labels: assign "Diabetes", "Hypertension", or "Heart Disease" to
# each sample
disease_labels = ["Diabetes", "Hypertension", "Heart Disease"]
# 0: Diabetes, 1: Hypertension, 2: Heart Disease
y_true = np.random.choice([0, 1, 2], size=n_samples)
# Predicted probabilities: simulate predictions, ensuring they sum to 1
# for each sample
y_probas = np.random.dirichlet(np.ones(n_classes), size=n_samples)
# Specify classes to plot (plotting all three diseases)
classes_to_plot = [0, 1, 2]
# Initialize a W&B run and log a ROC curve plot for disease classification
with wandb.init(project="medical_diagnosis") as run:
roc_plot = wandb.plot.roc_curve(
y_true=y_true,
y_probas=y_probas,
labels=disease_labels,
classes_to_plot=classes_to_plot,
title="ROC Curve for Disease Classification",
)
run.log({"roc-curve": roc_plot})
5.9 - scatter()
function scatter
scatter(
table: 'wandb.Table',
x: 'str',
y: 'str',
title: 'str' = '',
split_table: 'bool' = False
) → CustomChart
Constructs a scatter plot from a wandb.Table of data.
Args:
table
: The W&B Table containing the data to visualize.
x
: The name of the column used for the x-axis.
y
: The name of the column used for the y-axis.
title
: The title of the scatter chart.
split_table
: Whether the table should be split into a separate section in the W&B UI. If True
, the table will be displayed in a section named “Custom Chart Tables”. Default is False
.
Returns:
CustomChart
: A custom chart object that can be logged to W&B. To log the chart, pass it to wandb.log()
.
Example:
import math
import random
import wandb
# Simulate temperature variations at different altitudes over time
data = [
[i, random.uniform(-10, 20) - 0.005 * i + 5 * math.sin(i / 50)]
for i in range(300)
]
# Create W&B table with altitude (m) and temperature (°C) columns
table = wandb.Table(data=data, columns=["altitude (m)", "temperature (°C)"])
# Initialize W&B run and log the scatter plot
with wandb.init(project="temperature-altitude-scatter") as run:
# Create and log the scatter plot
scatter_plot = wandb.plot.scatter(
table=table,
x="altitude (m)",
y="temperature (°C)",
title="Altitude vs Temperature",
)
run.log({"altitude-temperature-scatter": scatter_plot})
6 - Public API
The W&B Public API provides programmatic access to query, export, and update data stored in W&B. Use this API for post-hoc analysis, data export, and programmatic management of runs, artifacts, and sweeps.
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
Overview
The Public API (wandb.apis.public
) is designed for querying and managing data after it has been logged to W&B. While the main SDK handles real-time logging during training, the Public API enables you to retrieve historical data, update metadata, manage artifacts, and perform analysis on completed experiments. Access is provided through the main Api
class which serves as the entry point to all functionality.
Available Components
Component |
Description |
Api |
Main entry point for the Public API. Query runs, projects, and artifacts across your organization. |
Runs |
Access and manage individual training runs, including history, logs, and metrics. |
Artifacts |
Query and download model artifacts, datasets, and other versioned files. |
Sweeps |
Access hyperparameter sweep data and analyze optimization results. |
Projects |
Manage projects and access project-level metadata and settings. |
Reports |
Programmatically access and manage W&B Reports. |
Teams |
Query team information and manage team-level resources. |
Users |
Access user profiles and user-specific data. |
Files |
Download and manage files associated with runs. |
History |
Access detailed time-series metrics logged during training. |
Automations |
Manage automated workflows and actions. |
Integrations |
Configure and manage third-party integrations. |
Common Use Cases
Data Export and Analysis
- Export run history as DataFrames for analysis in Jupyter notebooks
- Download metrics for custom visualization or reporting
- Aggregate results across multiple experiments
Post-Hoc Updates
- Update run metadata after completion
- Add tags or notes to completed experiments
- Modify run configurations or summaries
Artifact Management
- Query artifacts by version or alias
- Download model checkpoints programmatically
- Track artifact lineage and dependencies
Sweep Analysis
- Access sweep results and best performing runs
- Export hyperparameter search results
- Analyze parameter importance
Usage Notes
- Read-Only vs. Write Operations: Most API operations are read-only; write operations are limited to metadata updates
- Pagination: Large result sets are automatically paginated for efficient data retrieval
- Filtering: Use MongoDB-style query filters for precise data selection
- Lazy Loading: Data is fetched on-demand to minimize API calls and memory usage
- Authentication: Uses the same authentication as the main W&B SDK
Authentication
The Public API uses the same authentication mechanism as the W&B SDK:
# Option 1: Set environment variable
# export WANDB_API_KEY=your_api_key
# Option 2: Pass API key directly
api = Api(api_key="your_api_key")
# Option 3: Use wandb login
import wandb
wandb.login()
api = Api()
Example Usage
from wandb.apis.public import Api
# Initialize the API client
api = Api()
# Query runs with filters
runs = api.runs(
path="entity/project",
filters={"state": "finished", "config.learning_rate": {"$gte": 0.001}}
)
# Analyze run metrics
for run in runs:
print(f"Run: {run.name}")
print(f"Final accuracy: {run.summary.get('accuracy')}")
# Get detailed history
history = run.history(keys=["loss", "accuracy"])
# Update run metadata
run.tags.append("reviewed")
run.update()
# Access artifacts
artifact = api.artifact("entity/project/model:v1")
artifact_dir = artifact.download()
# Query sweep results
sweep = api.sweep("entity/project/sweep_id")
best_run = sweep.best_run()
print(f"Best parameters: {best_run.config}")
# Export data as DataFrame
import pandas as pd
runs_df = pd.DataFrame([
{**run.config, **run.summary}
for run in runs
])
6.1 - Api
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
Use the Public API to export or update data that you have saved to W&B.
Before using this API, you’ll want to log data from your script — check the Quickstart for more details.
You might use the Public API to
- update metadata or metrics for an experiment after it has been completed,
- pull down your results as a dataframe for post-hoc analysis in a Jupyter notebook, or
- check your saved model artifacts for those tagged as
ready-to-deploy
.
For more on using the Public API, check out our guide.
class Api
Used for querying the W&B server.
Examples:
method Api.__init__
__init__(
overrides: Optional[Dict[str, Any]] = None,
timeout: Optional[int] = None,
api_key: Optional[str] = None
) → None
Initialize the API.
Args:
overrides
: You can set base_url
if you are
using a W&B server other than
https: //api.wandb.ai
. You can also set defaults for entity
, project
, and run
.
timeout
: HTTP timeout in seconds for API requests. If not specified, the default timeout will be used.
api_key
: API key to use for authentication. If not provided, the API key from the current environment or configuration will be used.
property Api.api_key
Returns W&B API key.
property Api.client
Returns the client object.
property Api.default_entity
Returns the default W&B entity.
property Api.user_agent
Returns W&B public user agent.
property Api.viewer
Returns the viewer object.
Raises:
ValueError
: If viewer data is not able to be fetched from W&B.
requests.RequestException
: If an error occurs while making the graphql request.
method Api.artifact
artifact(name: str, type: Optional[str] = None)
Returns a single artifact.
Args:
name
: The artifact’s name. The name of an artifact resembles a filepath that consists, at a minimum, the name of the project the artifact was logged to, the name of the artifact, and the artifact’s version or alias. Optionally append the entity that logged the artifact as a prefix followed by a forward slash. If no entity is specified in the name, the Run or API setting’s entity is used.
type
: The type of artifact to fetch.
Returns:
An Artifact
object.
Raises:
ValueError
: If the artifact name is not specified.
ValueError
: If the artifact type is specified but does not match the type of the fetched artifact.
Examples:
In the proceeding code snippets “entity”, “project”, “artifact”, “version”, and “alias” are placeholders for your W&B entity, name of the project the artifact is in, the name of the artifact, and artifact’s version, respectively.
import wandb
# Specify the project, artifact's name, and the artifact's alias
wandb.Api().artifact(name="project/artifact:alias")
# Specify the project, artifact's name, and a specific artifact version
wandb.Api().artifact(name="project/artifact:version")
# Specify the entity, project, artifact's name, and the artifact's alias
wandb.Api().artifact(name="entity/project/artifact:alias")
# Specify the entity, project, artifact's name, and a specific artifact version
wandb.Api().artifact(name="entity/project/artifact:version")
Note:
This method is intended for external use only. Do not call api.artifact()
within the wandb repository code.
method Api.artifact_collection
artifact_collection(type_name: str, name: str) → public.ArtifactCollection
Returns a single artifact collection by type.
You can use the returned ArtifactCollection
object to retrieve information about specific artifacts in that collection, and more.
Args:
type_name
: The type of artifact collection to fetch.
name
: An artifact collection name. Optionally append the entity that logged the artifact as a prefix followed by a forward slash.
Returns:
An ArtifactCollection
object.
Examples:
In the proceeding code snippet “type”, “entity”, “project”, and “artifact_name” are placeholders for the collection type, your W&B entity, name of the project the artifact is in, and the name of the artifact, respectively.
import wandb
collections = wandb.Api().artifact_collection(
type_name="type", name="entity/project/artifact_name"
)
# Get the first artifact in the collection
artifact_example = collections.artifacts()[0]
# Download the contents of the artifact to the specified root directory.
artifact_example.download()
method Api.artifact_collection_exists
artifact_collection_exists(name: str, type: str) → bool
Whether an artifact collection exists within a specified project and entity.
Args:
name
: An artifact collection name. Optionally append the entity that logged the artifact as a prefix followed by a forward slash. If entity or project is not specified, infer the collection from the override params if they exist. Otherwise, entity is pulled from the user settings and project will default to “uncategorized”.
type
: The type of artifact collection.
Returns:
True if the artifact collection exists, False otherwise.
Examples:
In the proceeding code snippet “type”, and “collection_name” refer to the type of the artifact collection and the name of the collection, respectively.
import wandb
wandb.Api.artifact_collection_exists(type="type", name="collection_name")
method Api.artifact_collections
artifact_collections(
project_name: str,
type_name: str,
per_page: int = 50
) → public.ArtifactCollections
Returns a collection of matching artifact collections.
Args:
project_name
: The name of the project to filter on.
type_name
: The name of the artifact type to filter on.
per_page
: Sets the page size for query pagination. None will use the default size. Usually there is no reason to change this.
Returns:
An iterable ArtifactCollections
object.
method Api.artifact_exists
artifact_exists(name: str, type: Optional[str] = None) → bool
Whether an artifact version exists within the specified project and entity.
Args:
name
: The name of artifact. Add the artifact’s entity and project as a prefix. Append the version or the alias of the artifact with a colon. If the entity or project is not specified, W&B uses override parameters if populated. Otherwise, the entity is pulled from the user settings and the project is set to “Uncategorized”.
type
: The type of artifact.
Returns:
True if the artifact version exists, False otherwise.
Examples:
In the proceeding code snippets “entity”, “project”, “artifact”, “version”, and “alias” are placeholders for your W&B entity, name of the project the artifact is in, the name of the artifact, and artifact’s version, respectively.
import wandb
wandb.Api().artifact_exists("entity/project/artifact:version")
wandb.Api().artifact_exists("entity/project/artifact:alias")
method Api.artifact_type
artifact_type(
type_name: str,
project: Optional[str] = None
) → public.ArtifactType
Returns the matching ArtifactType
.
Args:
type_name
: The name of the artifact type to retrieve.
project
: If given, a project name or path to filter on.
Returns:
An ArtifactType
object.
method Api.artifact_types
artifact_types(project: Optional[str] = None) → public.ArtifactTypes
Returns a collection of matching artifact types.
Args:
project
: The project name or path to filter on.
Returns:
An iterable ArtifactTypes
object.
method Api.artifact_versions
artifact_versions(type_name, name, per_page=50)
Deprecated. Use Api.artifacts(type_name, name)
method instead.
method Api.artifacts
artifacts(
type_name: str,
name: str,
per_page: int = 50,
tags: Optional[List[str]] = None
) → public.Artifacts
Return an Artifacts
collection.
Args:
type_name: The type of artifacts to fetch. name: The artifact’s collection name. Optionally append the entity that logged the artifact as a prefix followed by a forward slash. per_page: Sets the page size for query pagination. If set to None
, use the default size. Usually there is no reason to change this. tags: Only return artifacts with all of these tags.
Returns:
An iterable Artifacts
object.
Examples:
In the proceeding code snippet, “type”, “entity”, “project”, and “artifact_name” are placeholders for the artifact type, W&B entity, name of the project the artifact was logged to, and the name of the artifact, respectively.
import wandb
wandb.Api().artifacts(type_name="type", name="entity/project/artifact_name")
method Api.automation
automation(name: str, entity: Optional[str] = None) → Automation
Returns the only Automation matching the parameters.
Args:
name
: The name of the automation to fetch.
entity
: The entity to fetch the automation for.
Raises:
ValueError
: If zero or multiple Automations match the search criteria.
Examples:
Get an existing automation named “my-automation”:
import wandb
api = wandb.Api()
automation = api.automation(name="my-automation")
Get an existing automation named “other-automation”, from the entity “my-team”:
automation = api.automation(name="other-automation", entity="my-team")
method Api.automations
automations(
entity: Optional[str] = None,
name: Optional[str] = None,
per_page: int = 50
) → Iterator[ForwardRef('Automation')]
Returns an iterator over all Automations that match the given parameters.
If no parameters are provided, the returned iterator will contain all Automations that the user has access to.
Args:
entity
: The entity to fetch the automations for.
name
: The name of the automation to fetch.
per_page
: The number of automations to fetch per page. Defaults to 50. Usually there is no reason to change this.
Returns:
A list of automations.
Examples:
Fetch all existing automations for the entity “my-team”:
import wandb
api = wandb.Api()
automations = api.automations(entity="my-team")
method Api.create_automation
create_automation(
obj: 'NewAutomation',
fetch_existing: bool = False,
**kwargs: typing_extensions.Unpack[ForwardRef('WriteAutomationsKwargs')]
) → Automation
Create a new Automation.
Args:
obj: The automation to create. fetch_existing: If True, and a conflicting automation already exists, attempt to fetch the existing automation instead of raising an error. **kwargs: Any additional values to assign to the automation before creating it. If given, these will override any values that may already be set on the automation:
- name
: The name of the automation.
- description
: The description of the automation.
- enabled
: Whether the automation is enabled.
- scope
: The scope of the automation.
- event
: The event that triggers the automation.
- action
: The action that is triggered by the automation.
Returns:
The saved Automation.
Examples:
Create a new automation named “my-automation” that sends a Slack notification when a run within a specific project logs a metric exceeding a custom threshold:
import wandb
from wandb.automations import OnRunMetric, RunEvent, SendNotification
api = wandb.Api()
project = api.project("my-project", entity="my-team")
# Use the first Slack integration for the team
slack_hook = next(api.slack_integrations(entity="my-team"))
event = OnRunMetric(
scope=project,
filter=RunEvent.metric("custom-metric") > 10,
)
action = SendNotification.from_integration(slack_hook)
automation = api.create_automation(
event >> action,
name="my-automation",
description="Send a Slack message whenever 'custom-metric' exceeds 10.",
)
method Api.create_custom_chart
create_custom_chart(
entity: str,
name: str,
display_name: str,
spec_type: Literal['vega2'],
access: Literal['private', 'public'],
spec: Union[str, dict]
) → str
Create a custom chart preset and return its id.
Args:
entity
: The entity (user or team) that owns the chart
name
: Unique identifier for the chart preset
display_name
: Human-readable name shown in the UI
spec_type
: Type of specification. Must be “vega2” for Vega-Lite v2 specifications.
access
: Access level for the chart:
- “private”: Chart is only accessible to the entity that created it
- “public”: Chart is publicly accessible
spec
: The Vega/Vega-Lite specification as a dictionary or JSON string
Returns:
The ID of the created chart preset in the format “entity/name”
Raises:
wandb.Error
: If chart creation fails
UnsupportedError
: If the server doesn’t support custom charts
Example:
import wandb
api = wandb.Api()
# Define a simple bar chart specification
vega_spec = {
"$schema": "https://vega.github.io/schema/vega-lite/v6.json",
"mark": "bar",
"data": {"name": "wandb"},
"encoding": {
"x": {"field": "${field:x}", "type": "ordinal"},
"y": {"field": "${field:y}", "type": "quantitative"},
},
}
# Create the custom chart
chart_id = api.create_custom_chart(
entity="my-team",
name="my-bar-chart",
display_name="My Custom Bar Chart",
spec_type="vega2",
access="private",
spec=vega_spec,
)
# Use with wandb.plot_table()
chart = wandb.plot_table(
vega_spec_name=chart_id,
data_table=my_table,
fields={"x": "category", "y": "value"},
)
```
---
### <kbd>method</kbd> `Api.create_project`
```python
create_project(name: str, entity: str) → None
Create a new project.
Args:
name
: The name of the new project.
entity
: The entity of the new project.
method Api.create_registry
create_registry(
name: str,
visibility: Literal['organization', 'restricted'],
organization: Optional[str] = None,
description: Optional[str] = None,
artifact_types: Optional[List[str]] = None
) → Registry
Create a new registry.
Args:
name
: The name of the registry. Name must be unique within the organization.
visibility
: The visibility of the registry.
organization
: Anyone in the organization can view this registry. You can edit their roles later from the settings in the UI.
restricted
: Only invited members via the UI can access this registry. Public sharing is disabled.
organization
: The organization of the registry. If no organization is set in the settings, the organization will be fetched from the entity if the entity only belongs to one organization.
description
: The description of the registry.
artifact_types
: The accepted artifact types of the registry. A type is no
more than 128 characters and do not include characters
/or ``:
. If not specified, all types are accepted. Allowed types added to the registry cannot be removed later.
Returns:
A registry object.
Examples:
import wandb
api = wandb.Api()
registry = api.create_registry(
name="my-registry",
visibility="restricted",
organization="my-org",
description="This is a test registry",
artifact_types=["model"],
)
method Api.create_run
create_run(
run_id: Optional[str] = None,
project: Optional[str] = None,
entity: Optional[str] = None
) → public.Run
Create a new run.
Args:
run_id
: The ID to assign to the run. If not specified, W&B creates a random ID.
project
: The project where to log the run to. If no project is specified, log the run to a project called “Uncategorized”.
entity
: The entity that owns the project. If no entity is specified, log the run to the default entity.
Returns:
The newly created Run
.
method Api.create_run_queue
create_run_queue(
name: str,
type: 'public.RunQueueResourceType',
entity: Optional[str] = None,
prioritization_mode: Optional[ForwardRef('public.RunQueuePrioritizationMode')] = None,
config: Optional[dict] = None,
template_variables: Optional[dict] = None
) → public.RunQueue
Create a new run queue in W&B Launch.
Args:
name
: Name of the queue to create
type
: Type of resource to be used for the queue. One of “local-container”, “local-process”, “kubernetes”,“sagemaker”, or “gcp-vertex”.
entity
: Name of the entity to create the queue. If None
, use the configured or default entity.
prioritization_mode
: Version of prioritization to use. Either “V0” or None
.
config
: Default resource configuration to be used for the queue. Use handlebars (eg. {{var}}
) to specify template variables.
template_variables
: A dictionary of template variable schemas to use with the config.
Returns:
The newly created RunQueue
.
Raises:
ValueError
if any of the parameters are invalid wandb.Error
on wandb API errors
method Api.create_team
create_team(team: str, admin_username: Optional[str] = None) → public.Team
Create a new team.
Args:
team
: The name of the team
admin_username
: Username of the admin user of the team. Defaults to the current user.
Returns:
A Team
object.
method Api.create_user
create_user(email: str, admin: Optional[bool] = False)
Create a new user.
Args:
email
: The email address of the user.
admin
: Set user as a global instance administrator.
Returns:
A User
object.
method Api.delete_automation
delete_automation(obj: Union[ForwardRef('Automation'), str]) → Literal[True]
Delete an automation.
Args:
obj
: The automation to delete, or its ID.
Returns:
True if the automation was deleted successfully.
method Api.flush
Flush the local cache.
The api object keeps a local cache of runs, so if the state of the run may change while executing your script you must clear the local cache with api.flush()
to get the latest values associated with the run.
method Api.from_path
Return a run, sweep, project or report from a path.
Args:
path
: The path to the project, run, sweep or report
Returns:
A Project
, Run
, Sweep
, or BetaReport
instance.
Raises:
wandb.Error
if path is invalid or the object doesn’t exist.
Examples:
In the proceeding code snippets “project”, “team”, “run_id”, “sweep_id”, and “report_name” are placeholders for the project, team, run ID, sweep ID, and the name of a specific report, respectively.
import wandb
api = wandb.Api()
project = api.from_path("project")
team_project = api.from_path("team/project")
run = api.from_path("team/project/runs/run_id")
sweep = api.from_path("team/project/sweeps/sweep_id")
report = api.from_path("team/project/reports/report_name")
method Api.integrations
integrations(
entity: Optional[str] = None,
per_page: int = 50
) → Iterator[ForwardRef('Integration')]
Return an iterator of all integrations for an entity.
Args:
entity
: The entity (e.g. team name) for which to fetch integrations. If not provided, the user’s default entity will be used.
per_page
: Number of integrations to fetch per page. Defaults to 50. Usually there is no reason to change this.
Yields:
Iterator[SlackIntegration | WebhookIntegration]
: An iterator of any supported integrations.
method Api.job
job(name: Optional[str], path: Optional[str] = None) → public.Job
Return a Job
object.
Args:
name
: The name of the job.
path
: The root path to download the job artifact.
Returns:
A Job
object.
method Api.list_jobs
list_jobs(entity: str, project: str) → List[Dict[str, Any]]
Return a list of jobs, if any, for the given entity and project.
Args:
entity
: The entity for the listed jobs.
project
: The project for the listed jobs.
Returns:
A list of matching jobs.
method Api.project
project(name: str, entity: Optional[str] = None) → public.Project
Return the Project
with the given name (and entity, if given).
Args:
name
: The project name.
entity
: Name of the entity requested. If None, will fall back to the default entity passed to Api
. If no default entity, will raise a ValueError
.
Returns:
A Project
object.
method Api.projects
projects(entity: Optional[str] = None, per_page: int = 200) → public.Projects
Get projects for a given entity.
Args:
entity
: Name of the entity requested. If None, will fall back to the default entity passed to Api
. If no default entity, will raise a ValueError
.
per_page
: Sets the page size for query pagination. If set to None
, use the default size. Usually there is no reason to change this.
Returns:
A Projects
object which is an iterable collection of Project
objects.
method Api.queued_run
queued_run(
entity: str,
project: str,
queue_name: str,
run_queue_item_id: str,
project_queue=None,
priority=None
)
Return a single queued run based on the path.
Parses paths of the form entity/project/queue_id/run_queue_item_id
.
method Api.registries
registries(
organization: Optional[str] = None,
filter: Optional[Dict[str, Any]] = None
) → Registries
Returns a lazy iterator of Registry
objects.
Use the iterator to search and filter registries, collections, or artifact versions across your organization’s registry.
Args:
organization
: (str, optional) The organization of the registry to fetch. If not specified, use the organization specified in the user’s settings.
filter
: (dict, optional) MongoDB-style filter to apply to each object in the lazy registry iterator. Fields available to filter for registries are name
, description
, created_at
, updated_at
. Fields available to filter for collections are name
, tag
, description
, created_at
, updated_at
Fields available to filter for versions are tag
, alias
, created_at
, updated_at
, metadata
Returns:
A lazy iterator of Registry
objects.
Examples:
Find all registries with the names that contain “model”
import wandb
api = wandb.Api() # specify an org if your entity belongs to multiple orgs
api.registries(filter={"name": {"$regex": "model"}})
Find all collections in the registries with the name “my_collection” and the tag “my_tag”
api.registries().collections(filter={"name": "my_collection", "tag": "my_tag"})
Find all artifact versions in the registries with a collection name that contains “my_collection” and a version that has the alias “best”
api.registries().collections(
filter={"name": {"$regex": "my_collection"}}
).versions(filter={"alias": "best"})
Find all artifact versions in the registries that contain “model” and have the tag “prod” or alias “best”
api.registries(filter={"name": {"$regex": "model"}}).versions(
filter={"$or": [{"tag": "prod"}, {"alias": "best"}]}
)
method Api.registry
registry(name: str, organization: Optional[str] = None) → Registry
Return a registry given a registry name.
Args:
name
: The name of the registry. This is without the wandb-registry-
prefix.
organization
: The organization of the registry. If no organization is set in the settings, the organization will be fetched from the entity if the entity only belongs to one organization.
Returns:
A registry object.
Examples:
Fetch and update a registry
import wandb
api = wandb.Api()
registry = api.registry(name="my-registry", organization="my-org")
registry.description = "This is an updated description"
registry.save()
method Api.reports
reports(
path: str = '',
name: Optional[str] = None,
per_page: int = 50
) → public.Reports
Get reports for a given project path.
Note: wandb.Api.reports()
API is in beta and will likely change in future releases.
Args:
path
: The path to the project the report resides in. Specify the entity that created the project as a prefix followed by a forward slash.
name
: Name of the report requested.
per_page
: Sets the page size for query pagination. If set to None
, use the default size. Usually there is no reason to change this.
Returns:
A Reports
object which is an iterable collection of BetaReport
objects.
Examples:
import wandb
wandb.Api.reports("entity/project")
method Api.run
Return a single run by parsing path in the form entity/project/run_id
.
Args:
path
: Path to run in the form entity/project/run_id
. If api.entity
is set, this can be in the form project/run_id
and if api.project
is set this can just be the run_id.
Returns:
A Run
object.
method Api.run_queue
run_queue(entity: str, name: str)
Return the named RunQueue
for entity.
See Api.create_run_queue
for more information on how to create a run queue.
method Api.runs
runs(
path: Optional[str] = None,
filters: Optional[Dict[str, Any]] = None,
order: str = '+created_at',
per_page: int = 50,
include_sweeps: bool = True
)
Returns a Runs
object, which lazily iterates over Run
objects.
Fields you can filter by include:
createdAt
: The timestamp when the run was created. (in ISO 8601 format, e.g. “2023-01-01T12:00:00Z”)
displayName
: The human-readable display name of the run. (e.g. “eager-fox-1”)
duration
: The total runtime of the run in seconds.
group
: The group name used to organize related runs together.
host
: The hostname where the run was executed.
jobType
: The type of job or purpose of the run.
name
: The unique identifier of the run. (e.g. “a1b2cdef”)
state
: The current state of the run.
tags
: The tags associated with the run.
username
: The username of the user who initiated the run
Additionally, you can filter by items in the run config or summary metrics. Such as config.experiment_name
, summary_metrics.loss
, etc.
For more complex filtering, you can use MongoDB query operators. For details, see: https://docs.mongodb.com/manual/reference/operator/query The following operations are supported:
$and
$or
$nor
$eq
$ne
$gt
$gte
$lt
$lte
$in
$nin
$exists
$regex
Args:
path
: (str) path to project, should be in the form: “entity/project”
filters
: (dict) queries for specific runs using the MongoDB query language. You can filter by run properties such as config.key, summary_metrics.key, state, entity, createdAt, etc.
For example
: {"config.experiment_name": "foo"}
would find runs with a config entry of experiment name set to “foo”
order
: (str) Order can be created_at
, heartbeat_at
, config.*.value
, or summary_metrics.*
. If you prepend order with a + order is ascending (default). If you prepend order with a - order is descending. The default order is run.created_at from oldest to newest.
per_page
: (int) Sets the page size for query pagination.
include_sweeps
: (bool) Whether to include the sweep runs in the results.
Returns:
A Runs
object, which is an iterable collection of Run
objects.
Examples:
# Find runs in project where config.experiment_name has been set to "foo"
api.runs(path="my_entity/project", filters={"config.experiment_name": "foo"})
# Find runs in project where config.experiment_name has been set to "foo" or "bar"
api.runs(
path="my_entity/project",
filters={
"$or": [
{"config.experiment_name": "foo"},
{"config.experiment_name": "bar"},
]
},
)
# Find runs in project where config.experiment_name matches a regex
# (anchors are not supported)
api.runs(
path="my_entity/project",
filters={"config.experiment_name": {"$regex": "b.*"}},
)
# Find runs in project where the run name matches a regex
# (anchors are not supported)
api.runs(
path="my_entity/project", filters={"display_name": {"$regex": "^foo.*"}}
)
# Find runs in project sorted by ascending loss
api.runs(path="my_entity/project", order="+summary_metrics.loss")
method Api.slack_integrations
slack_integrations(
entity: Optional[str] = None,
per_page: int = 50
) → Iterator[ForwardRef('SlackIntegration')]
Returns an iterator of Slack integrations for an entity.
Args:
entity
: The entity (e.g. team name) for which to fetch integrations. If not provided, the user’s default entity will be used.
per_page
: Number of integrations to fetch per page. Defaults to 50. Usually there is no reason to change this.
Yields:
Iterator[SlackIntegration]
: An iterator of Slack integrations.
Examples:
Get all registered Slack integrations for the team “my-team”:
import wandb
api = wandb.Api()
slack_integrations = api.slack_integrations(entity="my-team")
Find only Slack integrations that post to channel names starting with “team-alerts-”:
slack_integrations = api.slack_integrations(entity="my-team")
team_alert_integrations = [
ig
for ig in slack_integrations
if ig.channel_name.startswith("team-alerts-")
]
method Api.sweep
Return a sweep by parsing path in the form entity/project/sweep_id
.
Args:
path
: Path to sweep in the form entity/project/sweep_id. If api.entity
is set, this can be in the form project/sweep_id and if api.project
is set this can just be the sweep_id.
Returns:
A Sweep
object.
method Api.sync_tensorboard
sync_tensorboard(root_dir, run_id=None, project=None, entity=None)
Sync a local directory containing tfevent files to wandb.
method Api.team
team(team: str) → public.Team
Return the matching Team
with the given name.
Args:
team
: The name of the team.
Returns:
A Team
object.
method Api.update_automation
update_automation(
obj: 'Automation',
create_missing: bool = False,
**kwargs: typing_extensions.Unpack[ForwardRef('WriteAutomationsKwargs')]
) → Automation
Update an existing automation.
Args:
obj
: The automation to update. Must be an existing automation. create_missing (bool): If True, and the automation does not exist, create it. **kwargs: Any additional values to assign to the automation before updating it. If given, these will override any values that may already be set on the automation:
- name
: The name of the automation.
- description
: The description of the automation.
- enabled
: Whether the automation is enabled.
- scope
: The scope of the automation.
- event
: The event that triggers the automation.
- action
: The action that is triggered by the automation.
Returns:
The updated automation.
Examples:
Disable and edit the description of an existing automation (“my-automation”):
import wandb
api = wandb.Api()
automation = api.automation(name="my-automation")
automation.enabled = False
automation.description = "Kept for reference, but no longer used."
updated_automation = api.update_automation(automation)
OR
import wandb
api = wandb.Api()
automation = api.automation(name="my-automation")
updated_automation = api.update_automation(
automation,
enabled=False,
description="Kept for reference, but no longer used.",
)
method Api.upsert_run_queue
upsert_run_queue(
name: str,
resource_config: dict,
resource_type: 'public.RunQueueResourceType',
entity: Optional[str] = None,
template_variables: Optional[dict] = None,
external_links: Optional[dict] = None,
prioritization_mode: Optional[ForwardRef('public.RunQueuePrioritizationMode')] = None
)
Upsert a run queue in W&B Launch.
Args:
name
: Name of the queue to create
entity
: Optional name of the entity to create the queue. If None
, use the configured or default entity.
resource_config
: Optional default resource configuration to be used for the queue. Use handlebars (eg. {{var}}
) to specify template variables.
resource_type
: Type of resource to be used for the queue. One of “local-container”, “local-process”, “kubernetes”, “sagemaker”, or “gcp-vertex”.
template_variables
: A dictionary of template variable schemas to be used with the config.
external_links
: Optional dictionary of external links to be used with the queue.
prioritization_mode
: Optional version of prioritization to use. Either “V0” or None
Returns:
The upserted RunQueue
.
Raises:
ValueError if any of the parameters are invalid wandb.Error on wandb API errors
method Api.user
user(username_or_email: str) → Optional[ForwardRef('public.User')]
Return a user from a username or email address.
This function only works for local administrators. Use api.viewer
to get your own user object.
Args:
username_or_email
: The username or email address of the user.
Returns:
A User
object or None if a user is not found.
method Api.users
users(username_or_email: str) → List[ForwardRef('public.User')]
Return all users from a partial username or email address query.
This function only works for local administrators. Use api.viewer
to get your own user object.
Args:
username_or_email
: The prefix or suffix of the user you want to find.
Returns:
An array of User
objects.
method Api.webhook_integrations
webhook_integrations(
entity: Optional[str] = None,
per_page: int = 50
) → Iterator[ForwardRef('WebhookIntegration')]
Returns an iterator of webhook integrations for an entity.
Args:
entity
: The entity (e.g. team name) for which to fetch integrations. If not provided, the user’s default entity will be used.
per_page
: Number of integrations to fetch per page. Defaults to 50. Usually there is no reason to change this.
Yields:
Iterator[WebhookIntegration]
: An iterator of webhook integrations.
Examples:
Get all registered webhook integrations for the team “my-team”:
import wandb
api = wandb.Api()
webhook_integrations = api.webhook_integrations(entity="my-team")
Find only webhook integrations that post requests to “https://my-fake-url.com”:
webhook_integrations = api.webhook_integrations(entity="my-team")
my_webhooks = [
ig
for ig in webhook_integrations
if ig.url_endpoint.startswith("https://my-fake-url.com")
]
6.2 - Artifacts
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for Artifact objects.
This module provides classes for interacting with W&B artifacts and their collections.
class ArtifactTypes
An lazy iterator of ArtifactType
objects for a specific project.
class ArtifactType
An artifact object that satisfies query based on the specified type.
Args:
client
: The client instance to use for querying W&B.
entity
: The entity (user or team) that owns the project.
project
: The name of the project to query for artifact types.
type_name
: The name of the artifact type.
attrs
: Optional mapping of attributes to initialize the artifact type. If not provided, the object will load its attributes from W&B upon initialization.
property ArtifactType.id
The unique identifier of the artifact type.
property ArtifactType.name
The name of the artifact type.
method ArtifactType.collection
collection(name: 'str') → ArtifactCollection
Get a specific artifact collection by name.
Args:
name
(str): The name of the artifact collection to retrieve.
method ArtifactType.collections
collections(per_page: 'int' = 50) → ArtifactCollections
Get all artifact collections associated with this artifact type.
Args:
per_page
(int): The number of artifact collections to fetch per page. Default is 50.
class ArtifactCollections
Artifact collections of a specific type in a project.
Args:
client
: The client instance to use for querying W&B.
entity
: The entity (user or team) that owns the project.
project
: The name of the project to query for artifact collections.
type_name
: The name of the artifact type for which to fetch collections.
per_page
: The number of artifact collections to fetch per page. Default is 50.
property ArtifactCollections.length
class ArtifactCollection
An artifact collection that represents a group of related artifacts.
Args:
client
: The client instance to use for querying W&B.
entity
: The entity (user or team) that owns the project.
project
: The name of the project to query for artifact collections.
name
: The name of the artifact collection.
type
: The type of the artifact collection (e.g., “dataset”, “model”).
organization
: Optional organization name if applicable.
attrs
: Optional mapping of attributes to initialize the artifact collection. If not provided, the object will load its attributes from W&B upon initialization.
property ArtifactCollection.aliases
Artifact Collection Aliases.
property ArtifactCollection.created_at
The creation date of the artifact collection.
property ArtifactCollection.description
A description of the artifact collection.
property ArtifactCollection.id
The unique identifier of the artifact collection.
property ArtifactCollection.name
The name of the artifact collection.
The tags associated with the artifact collection.
property ArtifactCollection.type
Returns the type of the artifact collection.
method ArtifactCollection.artifacts
artifacts(per_page: 'int' = 50) → Artifacts
Get all artifacts in the collection.
method ArtifactCollection.change_type
change_type(new_type: 'str') → None
Deprecated, change type directly with save
instead.
method ArtifactCollection.delete
Delete the entire artifact collection.
method ArtifactCollection.is_sequence
Return whether the artifact collection is a sequence.
method ArtifactCollection.save
Persist any changes made to the artifact collection.
class Artifacts
An iterable collection of artifact versions associated with a project.
Optionally pass in filters to narrow down the results based on specific criteria.
Args:
client
: The client instance to use for querying W&B.
entity
: The entity (user or team) that owns the project.
project
: The name of the project to query for artifacts.
collection_name
: The name of the artifact collection to query.
type
: The type of the artifacts to query. Common examples include “dataset” or “model”.
filters
: Optional mapping of filters to apply to the query.
order
: Optional string to specify the order of the results.
per_page
: The number of artifact versions to fetch per page. Default is 50.
tags
: Optional string or list of strings to filter artifacts by tags.
property Artifacts.length
class RunArtifacts
An iterable collection of artifacts associated with a specific run.
property RunArtifacts.length
class ArtifactFiles
A paginator for files in an artifact.
property ArtifactFiles.length
property ArtifactFiles.path
Returns the path of the artifact.
6.3 - Automations
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for Automation objects.
class Automations
An lazy iterator of Automation
objects.
6.4 - Files
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for File objects.
This module provides classes for interacting with files stored in W&B.
Example:
from wandb.apis.public import Api
# Get files from a specific run
run = Api().run("entity/project/run_id")
files = run.files()
# Work with files
for file in files:
print(f"File: {file.name}")
print(f"Size: {file.size} bytes")
print(f"Type: {file.mimetype}")
# Download file
if file.size < 1000000: # Less than 1MB
file.download(root="./downloads")
# Get S3 URI for large files
if file.size >= 1000000:
print(f"S3 URI: {file.path_uri}")
Note:
This module is part of the W&B Public API and provides methods to access, download, and manage files stored in W&B. Files are typically associated with specific runs and can include model weights, datasets, visualizations, and other artifacts.
class Files
A lazy iterator over a collection of File
objects.
Access and manage files uploaded to W&B during a run. Handles pagination automatically when iterating through large collections of files.
Example:
from wandb.apis.public.files import Files
from wandb.apis.public.api import Api
# Example run object
run = Api().run("entity/project/run-id")
# Create a Files object to iterate over files in the run
files = Files(api.client, run)
# Iterate over files
for file in files:
print(file.name)
print(file.url)
print(file.size)
# Download the file
file.download(root="download_directory", replace=True)
method Files.__init__
__init__(
client: 'RetryingClient',
run: 'Run',
names: 'list[str] | None' = None,
per_page: 'int' = 50,
upload: 'bool' = False,
pattern: 'str | None' = None
)
Initialize a lazy iterator over a collection of File
objects.
Files are retrieved in pages from the W&B server as needed.
Args:
client: The run object that contains the files run: The run object that contains the files names (list, optional): A list of file names to filter the files per_page (int, optional): The number of files to fetch per page upload (bool, optional): If True
, fetch the upload URL for each file pattern (str, optional): Pattern to match when returning files from W&B This pattern uses mySQL’s LIKE syntax, so matching all files that end with .json would be “%.json”. If both names and pattern are provided, a ValueError will be raised.
property Files.length
6.5 - History
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for Run History.
This module provides classes for efficiently scanning and sampling run history data.
Note:
This module is part of the W&B Public API and provides methods to access run history data. It handles pagination automatically and offers both complete and sampled access to metrics logged during training runs.
6.6 - Integrations
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for integrations.
This module provides classes for interacting with W&B integrations.
class Integrations
An lazy iterator of Integration
objects.
method Integrations.__init__
__init__(client: '_Client', variables: 'dict[str, Any]', per_page: 'int' = 50)
method Integrations.convert_objects
convert_objects() → Iterable[Integration]
Parse the page data into a list of integrations.
6.7 - Projects
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for Project objects.
This module provides classes for interacting with W&B projects and their associated data.
Example:
from wandb.apis.public import Api
# Get all projects for an entity
projects = Api().projects("entity")
# Access project data
for project in projects:
print(f"Project: {project.name}")
print(f"URL: {project.url}")
# Get artifact types
for artifact_type in project.artifacts_types():
print(f"Artifact Type: {artifact_type.name}")
# Get sweeps
for sweep in project.sweeps():
print(f"Sweep ID: {sweep.id}")
print(f"State: {sweep.state}")
Note:
This module is part of the W&B Public API and provides methods to access and manage projects. For creating new projects, use wandb.init() with a new project name.
class Projects
An lazy iterator of Project
objects.
An iterable interface to access projects created and saved by the entity.
Args:
client
(wandb.apis.internal.Api
): The API client instance to use.
entity
(str): The entity name (username or team) to fetch projects for.
per_page
(int): Number of projects to fetch per request (default is 50).
Example:
from wandb.apis.public.api import Api
# Find projects that belong to this entity
projects = Api().projects(entity="entity")
# Iterate over files
for project in projects:
print(f"Project: {project.name}")
print(f"- URL: {project.url}")
print(f"- Created at: {project.created_at}")
print(f"- Is benchmark: {project.is_benchmark}")
method Projects.__init__
__init__(
client: wandb.apis.public.api.RetryingClient,
entity: str,
per_page: int = 50
) → Projects
An iterable collection of Project
objects.
Args:
client
: The API client used to query W&B.
entity
: The entity which owns the projects.
per_page
: The number of projects to fetch per request to the API.
class Project
A project is a namespace for runs.
Args:
client
: W&B API client instance.
name
(str): The name of the project.
entity
(str): The entity name that owns the project.
method Project.__init__
__init__(
client: wandb.apis.public.api.RetryingClient,
entity: str,
project: str,
attrs: dict
) → Project
A single project associated with an entity.
Args:
client
: The API client used to query W&B.
entity
: The entity which owns the project.
project
: The name of the project to query.
attrs
: The attributes of the project.
property Project.id
property Project.path
Returns the path of the project. The path is a list containing the entity and project name.
property Project.url
Returns the URL of the project.
method Project.artifacts_types
artifacts_types(per_page=50)
Returns all artifact types associated with this project.
method Project.sweeps
Return a paginated collection of sweeps in this project.
Args:
per_page
: The number of sweeps to fetch per request to the API.
Returns:
A Sweeps
object, which is an iterable collection of Sweep
objects.
6.8 - Reports
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for Report objects.
This module provides classes for interacting with W&B reports and managing report-related data.
class Reports
Reports is a lazy iterator of BetaReport
objects.
Args:
client
(wandb.apis.internal.Api
): The API client instance to use.
project
(wandb.sdk.internal.Project
): The project to fetch reports from.
name
(str, optional): The name of the report to filter by. If None
, fetches all reports.
entity
(str, optional): The entity name for the project. Defaults to the project entity.
per_page
(int): Number of reports to fetch per page (default is 50).
method Reports.__init__
__init__(client, project, name=None, entity=None, per_page=50)
property Reports.length
method Reports.convert_objects
Converts GraphQL edges to File objects.
method Reports.update_variables
Updates the GraphQL query variables for pagination.
class BetaReport
BetaReport is a class associated with reports created in W&B.
Provides access to report attributes (name, description, user, spec, timestamps) and methods for retrieving associated runs, sections, and for rendering the report as HTML.
Attributes:
id
(string): Unique identifier of the report.
display_name
(string): Human-readable display name of the report.
name
(string): The name of the report. Use display_name
for a more user-friendly name.
description
(string): Description of the report.
user
(User): Dictionary containing user info (username, email) who created the report.
spec
(dict): The spec of the report.
url
(string): The URL of the report.
updated_at
(string): Timestamp of last update.
created_at
(string): Timestamp when the report was created.
method BetaReport.__init__
__init__(client, attrs, entity=None, project=None)
property BetaReport.created_at
property BetaReport.description
property BetaReport.display_name
property BetaReport.id
property BetaReport.name
property BetaReport.sections
Get the panel sections (groups) from the report.
property BetaReport.spec
property BetaReport.updated_at
property BetaReport.url
property BetaReport.user
method BetaReport.runs
runs(section, per_page=50, only_selected=True)
Get runs associated with a section of the report.
method BetaReport.to_html
to_html(height=1024, hidden=False)
Generate HTML containing an iframe displaying this report.
6.9 - Runs
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for Runs.
This module provides classes for interacting with W&B runs and their associated data.
Example:
from wandb.apis.public import Api
# Get runs matching filters
runs = Api().runs(
path="entity/project", filters={"state": "finished", "config.batch_size": 32}
)
# Access run data
for run in runs:
print(f"Run: {run.name}")
print(f"Config: {run.config}")
print(f"Metrics: {run.summary}")
# Get history with pandas
history_df = run.history(keys=["loss", "accuracy"], pandas=True)
# Work with artifacts
for artifact in run.logged_artifacts():
print(f"Artifact: {artifact.name}")
Note:
This module is part of the W&B Public API and provides read/write access to run data. For logging new runs, use the wandb.init() function from the main wandb package.
class Runs
A lazy iterator of Run
objects associated with a project and optional filter.
Runs are retrieved in pages from the W&B server as needed.
This is generally used indirectly using the Api.runs
namespace.
Args:
client
: (wandb.apis.public.RetryingClient
) The API client to use for requests.
entity
: (str) The entity (username or team) that owns the project.
project
: (str) The name of the project to fetch runs from.
filters
: (Optional[Dict[str, Any]]) A dictionary of filters to apply to the runs query.
order
: (str) Order can be created_at
, heartbeat_at
, config.*.value
, or summary_metrics.*
. If you prepend order with a + order is ascending (default). If you prepend order with a - order is descending. The default order is run.created_at from oldest to newest.
per_page
: (int) The number of runs to fetch per request (default is 50).
include_sweeps
: (bool) Whether to include sweep information in the runs. Defaults to True.
Examples:
from wandb.apis.public.runs import Runs
from wandb.apis.public import Api
# Get all runs from a project that satisfy the filters
filters = {"state": "finished", "config.optimizer": "adam"}
runs = Api().runs(
client=api.client,
entity="entity",
project="project_name",
filters=filters,
)
# Iterate over runs and print details
for run in runs:
print(f"Run name: {run.name}")
print(f"Run ID: {run.id}")
print(f"Run URL: {run.url}")
print(f"Run state: {run.state}")
print(f"Run config: {run.config}")
print(f"Run summary: {run.summary}")
print(f"Run history (samples=5): {run.history(samples=5)}")
print("----------")
# Get histories for all runs with specific metrics
histories_df = runs.histories(
samples=100, # Number of samples per run
keys=["loss", "accuracy"], # Metrics to fetch
x_axis="_step", # X-axis metric
format="pandas", # Return as pandas DataFrame
)
method Runs.__init__
__init__(
client: 'RetryingClient',
entity: 'str',
project: 'str',
filters: 'dict[str, Any] | None' = None,
order: 'str' = '+created_at',
per_page: 'int' = 50,
include_sweeps: 'bool' = True
)
property Runs.length
method Runs.histories
histories(
samples: 'int' = 500,
keys: 'list[str] | None' = None,
x_axis: 'str' = '_step',
format: "Literal['default', 'pandas', 'polars']" = 'default',
stream: "Literal['default', 'system']" = 'default'
)
Return sampled history metrics for all runs that fit the filters conditions.
Args:
samples
: The number of samples to return per run
keys
: Only return metrics for specific keys
x_axis
: Use this metric as the xAxis defaults to _step
format
: Format to return data in, options are “default”, “pandas”, “polars”
stream
: “default” for metrics, “system” for machine metrics
Returns:
pandas.DataFrame
: If format="pandas"
, returns a pandas.DataFrame
of history metrics.
polars.DataFrame
: If format="polars"
, returns a polars.DataFrame
of history metrics.
list of dicts
: If format="default"
, returns a list of dicts containing history metrics with a run_id
key.
class Run
A single run associated with an entity and project.
Args:
client
: The W&B API client.
entity
: The entity associated with the run.
project
: The project associated with the run.
run_id
: The unique identifier for the run.
attrs
: The attributes of the run.
include_sweeps
: Whether to include sweeps in the run.
Attributes:
tags
([str]): a list of tags associated with the run
url
(str): the url of this run
id
(str): unique identifier for the run (defaults to eight characters)
name
(str): the name of the run
state
(str): one of: running, finished, crashed, killed, preempting, preempted
config
(dict): a dict of hyperparameters associated with the run
created_at
(str): ISO timestamp when the run was started
system_metrics
(dict): the latest system metrics recorded for the run
summary
(dict): A mutable dict-like property that holds the current summary. Calling update will persist any changes.
project
(str): the project associated with the run
entity
(str): the name of the entity associated with the run
project_internal_id
(int): the internal id of the project
user
(str): the name of the user who created the run
path
(str): Unique identifier [entity]/[project]/[run_id]
notes
(str): Notes about the run
read_only
(boolean): Whether the run is editable
history_keys
(str): Keys of the history metrics that have been logged
with
wandb.log({key: value})
metadata
(str): Metadata about the run from wandb-metadata.json
method Run.__init__
__init__(
client: 'RetryingClient',
entity: 'str',
project: 'str',
run_id: 'str',
attrs: 'Mapping | None' = None,
include_sweeps: 'bool' = True
)
Initialize a Run object.
Run is always initialized by calling api.runs() where api is an instance of wandb.Api.
property Run.entity
The entity associated with the run.
property Run.id
The unique identifier for the run.
property Run.lastHistoryStep
Returns the last step logged in the run’s history.
Metadata about the run from wandb-metadata.json.
Metadata includes the run’s description, tags, start time, memory usage and more.
property Run.name
The name of the run.
property Run.path
The path of the run. The path is a list containing the entity, project, and run_id.
property Run.state
The state of the run. Can be one of: Finished, Failed, Crashed, or Running.
property Run.storage_id
The unique storage identifier for the run.
property Run.summary
A mutable dict-like property that holds summary values associated with the run.
property Run.url
The URL of the run.
The run URL is generated from the entity, project, and run_id. For SaaS users, it takes the form of https://wandb.ai/entity/project/run_id
.
property Run.username
This API is deprecated. Use entity
instead.
classmethod Run.create
create(
api: 'public.Api',
run_id: 'str | None' = None,
project: 'str | None' = None,
entity: 'str | None' = None,
state: "Literal['running', 'pending']" = 'running'
)
Create a run for the given project.
method Run.delete
delete(delete_artifacts=False)
Delete the given run from the wandb backend.
Args:
delete_artifacts
(bool, optional): Whether to delete the artifacts associated with the run.
method Run.file
Return the path of a file with a given name in the artifact.
Args:
name
(str): name of requested file.
Returns:
A File
matching the name argument.
method Run.files
files(
names: 'list[str] | None' = None,
pattern: 'str | None' = None,
per_page: 'int' = 50
)
Returns a Files
object for all files in the run which match the given criteria.
You can specify a list of exact file names to match, or a pattern to match against. If both are provided, the pattern will be ignored.
Args:
names
(list): names of the requested files, if empty returns all files
pattern
(str, optional): Pattern to match when returning files from W&B. This pattern uses mySQL’s LIKE syntax, so matching all files that end with .json would be “%.json”. If both names and pattern are provided, a ValueError will be raised.
per_page
(int): number of results per page.
Returns:
A Files
object, which is an iterator over File
objects.
method Run.history
history(samples=500, keys=None, x_axis='_step', pandas=True, stream='default')
Return sampled history metrics for a run.
This is simpler and faster if you are ok with the history records being sampled.
Args:
samples
: (int, optional) The number of samples to return
pandas
: (bool, optional) Return a pandas dataframe
keys
: (list, optional) Only return metrics for specific keys
x_axis
: (str, optional) Use this metric as the xAxis defaults to _step
stream
: (str, optional) “default” for metrics, “system” for machine metrics
Returns:
pandas.DataFrame
: If pandas=True returns a pandas.DataFrame
of history metrics.
list of dicts
: If pandas=False returns a list of dicts of history metrics.
method Run.load
method Run.log_artifact
log_artifact(
artifact: 'wandb.Artifact',
aliases: 'Collection[str] | None' = None,
tags: 'Collection[str] | None' = None
)
Declare an artifact as output of a run.
Args:
artifact
(Artifact
): An artifact returned from wandb.Api().artifact(name)
.
aliases
(list, optional): Aliases to apply to this artifact.
tags
: (list, optional) Tags to apply to this artifact, if any.
Returns:
A Artifact
object.
method Run.logged_artifacts
logged_artifacts(per_page: 'int' = 100) → public.RunArtifacts
Fetches all artifacts logged by this run.
Retrieves all output artifacts that were logged during the run. Returns a paginated result that can be iterated over or collected into a single list.
Args:
per_page
: Number of artifacts to fetch per API request.
Returns:
An iterable collection of all Artifact objects logged as outputs during this run.
Example:
import wandb
import tempfile
with tempfile.NamedTemporaryFile(mode="w", delete=False, suffix=".txt") as tmp:
tmp.write("This is a test artifact")
tmp_path = tmp.name
run = wandb.init(project="artifact-example")
artifact = wandb.Artifact("test_artifact", type="dataset")
artifact.add_file(tmp_path)
run.log_artifact(artifact)
run.finish()
api = wandb.Api()
finished_run = api.run(f"{run.entity}/{run.project}/{run.id}")
for logged_artifact in finished_run.logged_artifacts():
print(logged_artifact.name)
method Run.save
Persist changes to the run object to the W&B backend.
method Run.scan_history
scan_history(keys=None, page_size=1000, min_step=None, max_step=None)
Returns an iterable collection of all history records for a run.
Args:
keys
([str], optional): only fetch these keys, and only fetch rows that have all of keys defined.
page_size
(int, optional): size of pages to fetch from the api.
min_step
(int, optional): the minimum number of pages to scan at a time.
max_step
(int, optional): the maximum number of pages to scan at a time.
Returns:
An iterable collection over history records (dict).
Example:
Export all the loss values for an example run
run = api.run("entity/project-name/run-id")
history = run.scan_history(keys=["Loss"])
losses = [row["Loss"] for row in history]
method Run.to_html
to_html(height=420, hidden=False)
Generate HTML containing an iframe displaying this run.
method Run.update
Persist changes to the run object to the wandb backend.
method Run.upload_file
upload_file(path, root='.')
Upload a local file to W&B, associating it with this run.
Args:
path
(str): Path to the file to upload. Can be absolute or relative.
root
(str): The root path to save the file relative to. For example, if you want to have the file saved in the run as “my_dir/file.txt” and you’re currently in “my_dir” you would set root to “../”. Defaults to current directory (".").
Returns:
A File
object representing the uploaded file.
method Run.use_artifact
use_artifact(artifact, use_as=None)
Declare an artifact as an input to a run.
Args:
artifact
(Artifact
): An artifact returned from wandb.Api().artifact(name)
use_as
(string, optional): A string identifying how the artifact is used in the script. Used to easily differentiate artifacts used in a run, when using the beta wandb launch feature’s artifact swapping functionality.
Returns:
An Artifact
object.
method Run.used_artifacts
used_artifacts(per_page: 'int' = 100) → public.RunArtifacts
Fetches artifacts explicitly used by this run.
Retrieves only the input artifacts that were explicitly declared as used during the run, typically via run.use_artifact()
. Returns a paginated result that can be iterated over or collected into a single list.
Args:
per_page
: Number of artifacts to fetch per API request.
Returns:
An iterable collection of Artifact objects explicitly used as inputs in this run.
Example:
import wandb
run = wandb.init(project="artifact-example")
run.use_artifact("test_artifact:latest")
run.finish()
api = wandb.Api()
finished_run = api.run(f"{run.entity}/{run.project}/{run.id}")
for used_artifact in finished_run.used_artifacts():
print(used_artifact.name)
test_artifact
method Run.wait_until_finished
Check the state of the run until it is finished.
6.10 - Sweeps
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for Sweeps.
This module provides classes for interacting with W&B hyperparameter optimization sweeps.
Example:
from wandb.apis.public import Api
# Get a specific sweep
sweep = Api().sweep("entity/project/sweep_id")
# Access sweep properties
print(f"Sweep: {sweep.name}")
print(f"State: {sweep.state}")
print(f"Best Loss: {sweep.best_loss}")
# Get best performing run
best_run = sweep.best_run()
print(f"Best Run: {best_run.name}")
print(f"Metrics: {best_run.summary}")
Note:
This module is part of the W&B Public API and provides read-only access to sweep data. For creating and controlling sweeps, use the wandb.sweep() and wandb.agent() functions from the main wandb package.
class Sweeps
A lazy iterator over a collection of Sweep
objects.
Examples:
from wandb.apis.public import Api
sweeps = Api().project(name="project_name", entity="entity").sweeps()
# Iterate over sweeps and print details
for sweep in sweeps:
print(f"Sweep name: {sweep.name}")
print(f"Sweep ID: {sweep.id}")
print(f"Sweep URL: {sweep.url}")
print("----------")
method Sweeps.__init__
__init__(
client: wandb.apis.public.api.RetryingClient,
entity: str,
project: str,
per_page: int = 50
) → Sweeps
An iterable collection of Sweep
objects.
Args:
client
: The API client used to query W&B.
entity
: The entity which owns the sweeps.
project
: The project which contains the sweeps.
per_page
: The number of sweeps to fetch per request to the API.
property Sweeps.length
class Sweep
The set of runs associated with the sweep.
Attributes:
runs
(Runs): List of runs
id
(str): Sweep ID
project
(str): The name of the project the sweep belongs to
config
(dict): Dictionary containing the sweep configuration
state
(str): The state of the sweep. Can be “Finished”, “Failed”, “Crashed”, or “Running”.
expected_run_count
(int): The number of expected runs for the sweep
method Sweep.__init__
__init__(client, entity, project, sweep_id, attrs=None)
property Sweep.config
The sweep configuration used for the sweep.
property Sweep.entity
The entity associated with the sweep.
property Sweep.expected_run_count
Return the number of expected runs in the sweep or None for infinite runs.
property Sweep.name
The name of the sweep.
Returns the first name that exists in the following priority order:
- User-edited display name 2. Name configured at creation time 3. Sweep ID
property Sweep.order
Return the order key for the sweep.
property Sweep.path
Returns the path of the project.
The path is a list containing the entity, project name, and sweep ID.
property Sweep.url
The URL of the sweep.
The sweep URL is generated from the entity, project, the term “sweeps”, and the sweep ID.run_id. For SaaS users, it takes the form of https://wandb.ai/entity/project/sweeps/sweeps_ID
.
property Sweep.username
Deprecated. Use Sweep.entity
instead.
method Sweep.best_run
Return the best run sorted by the metric defined in config or the order passed in.
classmethod Sweep.get
get(
client: 'RetryingClient',
entity: Optional[str] = None,
project: Optional[str] = None,
sid: Optional[str] = None,
order: Optional[str] = None,
query: Optional[str] = None,
**kwargs
)
Execute a query against the cloud backend.
Args:
client
: The client to use to execute the query.
entity
: The entity (username or team) that owns the project.
project
: The name of the project to fetch sweep from.
sid
: The sweep ID to query.
order
: The order in which the sweep’s runs are returned.
query
: The query to use to execute the query.
**kwargs
: Additional keyword arguments to pass to the query.
method Sweep.to_html
to_html(height=420, hidden=False)
Generate HTML containing an iframe displaying this sweep.
6.11 - Teams
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for managing teams and team members.
This module provides classes for managing W&B teams and their members.
Note:
This module is part of the W&B Public API and provides methods to manage teams and their members. Team management operations require appropriate permissions.
class Member
A member of a team.
Args:
client
(wandb.apis.internal.Api
): The client instance to use
team
(str): The name of the team this member belongs to
attrs
(dict): The member attributes
method Member.__init__
__init__(client, team, attrs)
method Member.delete
Remove a member from a team.
Returns:
Boolean indicating success
class Team
A class that represents a W&B team.
This class provides methods to manage W&B teams, including creating teams, inviting members, and managing service accounts. It inherits from Attrs to handle team attributes.
Args:
client
(wandb.apis.public.Api
): The api instance to use
name
(str): The name of the team
attrs
(dict): Optional dictionary of team attributes
Note:
Team management requires appropriate permissions.
method Team.__init__
__init__(client, name, attrs=None)
classmethod Team.create
create(api, team, admin_username=None)
Create a new team.
Args:
api
: (Api
) The api instance to use
team
: (str) The name of the team
admin_username
: (str) optional username of the admin user of the team, defaults to the current user.
Returns:
A Team
object
method Team.create_service_account
create_service_account(description)
Create a service account for the team.
Args:
description
: (str) A description for this service account
Returns:
The service account Member
object, or None on failure
method Team.invite
invite(username_or_email, admin=False)
Invite a user to a team.
Args:
username_or_email
: (str) The username or email address of the user you want to invite.
admin
: (bool) Whether to make this user a team admin. Defaults to False
.
Returns:
True
on success, False
if user was already invited or didn’t exist.
6.12 - Users
Training and fine-tuning models is done elsewhere in the W&B Python SDK, not the Public API.
module wandb.apis.public
W&B Public API for managing users and API keys.
This module provides classes for managing W&B users and their API keys.
Note:
This module is part of the W&B Public API and provides methods to manage users and their authentication. Some operations require admin privileges.
class User
A class representing a W&B user with authentication and management capabilities.
This class provides methods to manage W&B users, including creating users, managing API keys, and accessing team memberships. It inherits from Attrs to handle user attributes.
Args:
client
: (wandb.apis.internal.Api
) The client instance to use
attrs
: (dict) The user attributes
Note:
Some operations require admin privileges
method User.__init__
property User.api_keys
List of API key names associated with the user.
Returns:
list[str]
: Names of API keys associated with the user. Empty list if user has no API keys or if API key data hasn’t been loaded.
property User.teams
List of team names that the user is a member of.
Returns:
list
(list): Names of teams the user belongs to. Empty list if user has no team memberships or if teams data hasn’t been loaded.
property User.user_api
An instance of the api using credentials from the user.
classmethod User.create
create(api, email, admin=False)
Create a new user.
Args:
api
(Api
): The api instance to use
email
(str): The name of the team
admin
(bool): Whether this user should be a global instance admin
Returns:
A User
object
method User.delete_api_key
Delete a user’s api key.
Args:
api_key
(str): The name of the API key to delete. This should be one of the names returned by the api_keys
property.
Returns:
Boolean indicating success
Raises:
ValueError if the api_key couldn’t be found
method User.generate_api_key
generate_api_key(description=None)
Generate a new api key.
Args:
description
(str, optional): A description for the new API key. This can be used to identify the purpose of the API key.
Returns:
The new api key, or None on failure
7 - Run
class Run
A unit of computation logged by W&B. Typically, this is an ML experiment.
Call wandb.init()
to create a new run. wandb.init()
starts a new run and returns a wandb.Run
object. Each run is associated with a unique ID (run ID). W&B recommends using a context (with
statement) manager to automatically finish the run.
For distributed training experiments, you can either track each process separately using one run per process or track all processes to a single run. See Log distributed training experiments for more information.
You can log data to a run with wandb.Run.log()
. Anything you log using wandb.Run.log()
is sent to that run. See Create an experiment or wandb.init
API reference page or more information.
There is a another Run
object in the wandb.apis.public
namespace. Use this object is to interact with runs that have already been created.
Attributes:
summary
: (Summary) A summary of the run, which is a dictionary-like object. For more information, see
[Log summary metrics](https
: //docs.wandb.ai/guides/track/log/log-summary/).
Examples:
Create a run with wandb.init()
:
import wandb
# Start a new run and log some data
# Use context manager (`with` statement) to automatically finish the run
with wandb.init(entity="entity", project="project") as run:
run.log({"accuracy": acc, "loss": loss})
property Run.config
Config object associated with this run.
property Run.config_static
Static config object associated with this run.
property Run.dir
The directory where files associated with the run are saved.
property Run.disabled
True if the run is disabled, False otherwise.
property Run.entity
The name of the W&B entity associated with the run.
Entity can be a username or the name of a team or organization.
property Run.group
Returns the name of the group associated with this run.
Grouping runs together allows related experiments to be organized and visualized collectively in the W&B UI. This is especially useful for scenarios such as distributed training or cross-validation, where multiple runs should be viewed and managed as a unified experiment.
In shared mode, where all processes share the same run object, setting a group is usually unnecessary, since there is only one run and no grouping is required.
property Run.id
Identifier for this run.
property Run.job_type
Name of the job type associated with the run.
View a run’s job type in the run’s Overview page in the W&B App.
You can use this to categorize runs by their job type, such as “training”, “evaluation”, or “inference”. This is useful for organizing and filtering runs in the W&B UI, especially when you have multiple runs with different job types in the same project. For more information, see Organize runs.
property Run.name
Display name of the run.
Display names are not guaranteed to be unique and may be descriptive. By default, they are randomly generated.
property Run.notes
Notes associated with the run, if there are any.
Notes can be a multiline string and can also use markdown and latex equations inside $$
, like $x + 3$
.
property Run.offline
True if the run is offline, False otherwise.
property Run.path
Path to the run.
Run paths include entity, project, and run ID, in the format entity/project/run_id
.
property Run.project
Name of the W&B project associated with the run.
property Run.project_url
URL of the W&B project associated with the run, if there is one.
Offline runs do not have a project URL.
property Run.resumed
True if the run was resumed, False otherwise.
property Run.settings
A frozen copy of run’s Settings object.
property Run.start_time
Unix timestamp (in seconds) of when the run started.
property Run.sweep_id
Identifier for the sweep associated with the run, if there is one.
property Run.sweep_url
URL of the sweep associated with the run, if there is one.
Offline runs do not have a sweep URL.
Tags associated with the run, if there are any.
property Run.url
The url for the W&B run, if there is one.
Offline runs will not have a url.
method Run.alert
alert(
title: 'str',
text: 'str',
level: 'str | AlertLevel | None' = None,
wait_duration: 'int | float | timedelta | None' = None
) → None
Create an alert with the given title and text.
Args:
title
: The title of the alert, must be less than 64 characters long.
text
: The text body of the alert.
level
: The alert level to use, either: INFO
, WARN
, or ERROR
.
wait_duration
: The time to wait (in seconds) before sending another alert with this title.
method Run.define_metric
define_metric(
name: 'str',
step_metric: 'str | wandb_metric.Metric | None' = None,
step_sync: 'bool | None' = None,
hidden: 'bool | None' = None,
summary: 'str | None' = None,
goal: 'str | None' = None,
overwrite: 'bool | None' = None
) → wandb_metric.Metric
Customize metrics logged with wandb.Run.log()
.
Args:
name
: The name of the metric to customize.
step_metric
: The name of another metric to serve as the X-axis for this metric in automatically generated charts.
step_sync
: Automatically insert the last value of step_metric into wandb.Run.log()
if it is not provided explicitly. Defaults to True if step_metric is specified.
hidden
: Hide this metric from automatic plots.
summary
: Specify aggregate metrics added to summary. Supported aggregations include “min”, “max”, “mean”, “last”, “first”, “best”, “copy” and “none”. “none” prevents a summary from being generated. “best” is used together with the goal parameter, “best” is deprecated and should not be used, use “min” or “max” instead. “copy” is deprecated and should not be used.
goal
: Specify how to interpret the “best” summary type. Supported options are “minimize” and “maximize”. “goal” is deprecated and should not be used, use “min” or “max” instead.
overwrite
: If false, then this call is merged with previous define_metric
calls for the same metric by using their values for any unspecified parameters. If true, then unspecified parameters overwrite values specified by previous calls.
Returns:
An object that represents this call but can otherwise be discarded.
method Run.display
display(height: 'int' = 420, hidden: 'bool' = False) → bool
Display this run in Jupyter.
method Run.finish
finish(exit_code: 'int | None' = None, quiet: 'bool | None' = None) → None
Finish a run and upload any remaining data.
Marks the completion of a W&B run and ensures all data is synced to the server. The run’s final state is determined by its exit conditions and sync status.
Run States:
- Running: Active run that is logging data and/or sending heartbeats.
- Crashed: Run that stopped sending heartbeats unexpectedly.
- Finished: Run completed successfully (
exit_code=0
) with all data synced.
- Failed: Run completed with errors (
exit_code!=0
).
- Killed: Run was forcibly stopped before it could finish.
Args:
exit_code
: Integer indicating the run’s exit status. Use 0 for success, any other value marks the run as failed.
quiet
: Deprecated. Configure logging verbosity using wandb.Settings(quiet=...)
.
method Run.finish_artifact
finish_artifact(
artifact_or_path: 'Artifact | str',
name: 'str | None' = None,
type: 'str | None' = None,
aliases: 'list[str] | None' = None,
distributed_id: 'str | None' = None
) → Artifact
Finishes a non-finalized artifact as output of a run.
Subsequent “upserts” with the same distributed ID will result in a new version.
Args:
artifact_or_path
: A path to the contents of this artifact, can be in the following forms:
- /local/directory
- /local/directory/file.txt
- s3://bucket/path
You can also pass an Artifact object created by calling wandb.Artifact
.
name
: An artifact name. May be prefixed with entity/project. Valid names can be in the following forms:
- name:version
- name:alias
- digest This will default to the basename of the path prepended with the current run id if not specified.
type
: The type of artifact to log, examples include dataset
, model
aliases
: Aliases to apply to this artifact, defaults to ["latest"]
distributed_id
: Unique string that all distributed jobs share. If None, defaults to the run’s group name.
Returns:
An Artifact
object.
method Run.link_artifact
link_artifact(
artifact: 'Artifact',
target_path: 'str',
aliases: 'list[str] | None' = None
) → Artifact
Link the given artifact to a portfolio (a promoted collection of artifacts).
Linked artifacts are visible in the UI for the specified portfolio.
Args:
artifact
: the (public or local) artifact which will be linked
target_path
: str
- takes the following forms: {portfolio}
, {project}/{portfolio}
, or {entity}/{project}/{portfolio}
aliases
: List[str]
- optional alias(es) that will only be applied on this linked artifact inside the portfolio. The alias “latest” will always be applied to the latest version of an artifact that is linked.
Returns:
The linked artifact.
method Run.link_model
link_model(
path: 'StrPath',
registered_model_name: 'str',
name: 'str | None' = None,
aliases: 'list[str] | None' = None
) → Artifact | None
Log a model artifact version and link it to a registered model in the model registry.
Linked model versions are visible in the UI for the specified registered model.
This method will:
- Check if ’name’ model artifact has been logged. If so, use the artifact version that matches the files located at ‘path’ or log a new version. Otherwise log files under ‘path’ as a new model artifact, ’name’ of type ‘model’.
- Check if registered model with name ‘registered_model_name’ exists in the ‘model-registry’ project. If not, create a new registered model with name ‘registered_model_name’.
- Link version of model artifact ’name’ to registered model, ‘registered_model_name’.
- Attach aliases from ‘aliases’ list to the newly linked model artifact version.
Args:
path
: (str) A path to the contents of this model, can be in the following forms:
/local/directory
/local/directory/file.txt
s3://bucket/path
registered_model_name
: The name of the registered model that the model is to be linked to. A registered model is a collection of model versions linked to the model registry, typically representing a team’s specific ML Task. The entity that this registered model belongs to will be derived from the run.
name
: The name of the model artifact that files in ‘path’ will be logged to. This will default to the basename of the path prepended with the current run id if not specified.
aliases
: Aliases that will only be applied on this linked artifact inside the registered model. The alias “latest” will always be applied to the latest version of an artifact that is linked.
Raises:
AssertionError
: If registered_model_name is a path or if model artifact ’name’ is of a type that does not contain the substring ‘model’.
ValueError
: If name has invalid special characters.
Returns:
The linked artifact if linking was successful, otherwise None
.
method Run.log
log(
data: 'dict[str, Any]',
step: 'int | None' = None,
commit: 'bool | None' = None
) → None
Upload run data.
Use log
to log data from runs, such as scalars, images, video, histograms, plots, and tables. See Log objects and media for code snippets, best practices, and more.
Basic usage:
import wandb
with wandb.init() as run:
run.log({"train-loss": 0.5, "accuracy": 0.9})
The previous code snippet saves the loss and accuracy to the run’s history and updates the summary values for these metrics.
Visualize logged data in a workspace at wandb.ai, or locally on a self-hosted instance of the W&B app, or export data to visualize and explore locally, such as in a Jupyter notebook, with the Public API.
Logged values don’t have to be scalars. You can log any W&B supported Data Type such as images, audio, video, and more. For example, you can use wandb.Table
to log structured data. See Log tables, visualize and query data tutorial for more details.
W&B organizes metrics with a forward slash (/
) in their name into sections named using the text before the final slash. For example, the following results in two sections named “train” and “validate”:
with wandb.init() as run:
# Log metrics in the "train" section.
run.log(
{
"train/accuracy": 0.9,
"train/loss": 30,
"validate/accuracy": 0.8,
"validate/loss": 20,
}
)
Only one level of nesting is supported; run.log({"a/b/c": 1})
produces a section named “a/b”.
run.log()
is not intended to be called more than a few times per second. For optimal performance, limit your logging to once every N iterations, or collect data over multiple iterations and log it in a single step.
By default, each call to log
creates a new “step”. The step must always increase, and it is not possible to log to a previous step. You can use any metric as the X axis in charts. See Custom log axes for more details.
In many cases, it is better to treat the W&B step like you’d treat a timestamp rather than a training step.
with wandb.init() as run:
# Example: log an "epoch" metric for use as an X axis.
run.log({"epoch": 40, "train-loss": 0.5})
It is possible to use multiple wandb.Run.log()
invocations to log to the same step with the step
and commit
parameters. The following are all equivalent:
with wandb.init() as run:
# Normal usage:
run.log({"train-loss": 0.5, "accuracy": 0.8})
run.log({"train-loss": 0.4, "accuracy": 0.9})
# Implicit step without auto-incrementing:
run.log({"train-loss": 0.5}, commit=False)
run.log({"accuracy": 0.8})
run.log({"train-loss": 0.4}, commit=False)
run.log({"accuracy": 0.9})
# Explicit step:
run.log({"train-loss": 0.5}, step=current_step)
run.log({"accuracy": 0.8}, step=current_step)
current_step += 1
run.log({"train-loss": 0.4}, step=current_step)
run.log({"accuracy": 0.9}, step=current_step)
Args:
data
: A dict
with str
keys and values that are serializable
Python objects including
: int
, float
and string
; any of the wandb.data_types
; lists, tuples and NumPy arrays of serializable Python objects; other dict
s of this structure.
step
: The step number to log. If None
, then an implicit auto-incrementing step is used. See the notes in the description.
commit
: If true, finalize and upload the step. If false, then accumulate data for the step. See the notes in the description. If step
is None
, then the default is commit=True
; otherwise, the default is commit=False
.
Examples:
For more and more detailed examples, see our guides to logging.
Basic usage
import wandb
with wandb.init() as run:
run.log({"train-loss": 0.5, "accuracy": 0.9
Incremental logging
import wandb
with wandb.init() as run:
run.log({"loss": 0.2}, commit=False)
# Somewhere else when I'm ready to report this step:
run.log({"accuracy": 0.8})
Histogram
import numpy as np
import wandb
# sample gradients at random from normal distribution
gradients = np.random.randn(100, 100)
with wandb.init() as run:
run.log({"gradients": wandb.Histogram(gradients)})
Image from NumPy
import numpy as np
import wandb
with wandb.init() as run:
examples = []
for i in range(3):
pixels = np.random.randint(low=0, high=256, size=(100, 100, 3))
image = wandb.Image(pixels, caption=f"random field {i}")
examples.append(image)
run.log({"examples": examples})
Image from PIL
import numpy as np
from PIL import Image as PILImage
import wandb
with wandb.init() as run:
examples = []
for i in range(3):
pixels = np.random.randint(
low=0,
high=256,
size=(100, 100, 3),
dtype=np.uint8,
)
pil_image = PILImage.fromarray(pixels, mode="RGB")
image = wandb.Image(pil_image, caption=f"random field {i}")
examples.append(image)
run.log({"examples": examples})
Video from NumPy
import numpy as np
import wandb
with wandb.init() as run:
# axes are (time, channel, height, width)
frames = np.random.randint(
low=0,
high=256,
size=(10, 3, 100, 100),
dtype=np.uint8,
)
run.log({"video": wandb.Video(frames, fps=4)})
Matplotlib plot
from matplotlib import pyplot as plt
import numpy as np
import wandb
with wandb.init() as run:
fig, ax = plt.subplots()
x = np.linspace(0, 10)
y = x * x
ax.plot(x, y) # plot y = x^2
run.log({"chart": fig})
PR Curve
import wandb
with wandb.init() as run:
run.log({"pr": wandb.plot.pr_curve(y_test, y_probas, labels)})
3D Object
import wandb
with wandb.init() as run:
run.log(
{
"generated_samples": [
wandb.Object3D(open("sample.obj")),
wandb.Object3D(open("sample.gltf")),
wandb.Object3D(open("sample.glb")),
]
}
)
Raises:
wandb.Error
: If called before wandb.init()
.
ValueError
: If invalid data is passed.
method Run.log_artifact
log_artifact(
artifact_or_path: 'Artifact | StrPath',
name: 'str | None' = None,
type: 'str | None' = None,
aliases: 'list[str] | None' = None,
tags: 'list[str] | None' = None
) → Artifact
Declare an artifact as an output of a run.
Args:
artifact_or_path
: (str or Artifact) A path to the contents of this artifact, can be in the following forms:
- /local/directory
- /local/directory/file.txt
- s3://bucket/path
You can also pass an Artifact object created by calling wandb.Artifact
.
name
: (str, optional) An artifact name. Valid names can be in the following forms:
- name:version
- name:alias
- digest This will default to the basename of the path prepended with the current run id if not specified.
type
: (str) The type of artifact to log, examples include dataset
, model
aliases
: (list, optional) Aliases to apply to this artifact, defaults to ["latest"]
tags
: (list, optional) Tags to apply to this artifact, if any.
Returns:
An Artifact
object.
method Run.log_code
log_code(
root: 'str | None' = '.',
name: 'str | None' = None,
include_fn: 'Callable[[str, str], bool] | Callable[[str], bool]' = <function _is_py_requirements_or_dockerfile at 0x10342a8c0>,
exclude_fn: 'Callable[[str, str], bool] | Callable[[str], bool]' = <function exclude_wandb_fn at 0x1050f4ee0>
) → Artifact | None
Save the current state of your code to a W&B Artifact.
By default, it walks the current directory and logs all files that end with .py
.
Args:
root
: The relative (to os.getcwd()
) or absolute path to recursively find code from.
name
: (str, optional) The name of our code artifact. By default, we’ll name the artifact source-$PROJECT_ID-$ENTRYPOINT_RELPATH
. There may be scenarios where you want many runs to share the same artifact. Specifying name allows you to achieve that.
include_fn
: A callable that accepts a file path and (optionally) root path and returns True when it should be included and False otherwise. This
defaults to
lambda path, root: path.endswith(".py")
.
exclude_fn
: A callable that accepts a file path and (optionally) root path and returns True
when it should be excluded and False
otherwise. This defaults to a function that excludes all files within <root>/.wandb/
and <root>/wandb/
directories.
Examples:
Basic usage
import wandb
with wandb.init() as run:
run.log_code()
Advanced usage
import wandb
with wandb.init() as run:
run.log_code(
root="../",
include_fn=lambda path: path.endswith(".py") or path.endswith(".ipynb"),
exclude_fn=lambda path, root: os.path.relpath(path, root).startswith(
"cache/"
),
)
Returns:
An Artifact
object if code was logged
method Run.log_model
log_model(
path: 'StrPath',
name: 'str | None' = None,
aliases: 'list[str] | None' = None
) → None
Logs a model artifact containing the contents inside the ‘path’ to a run and marks it as an output to this run.
The name of model artifact can only contain alphanumeric characters, underscores, and hyphens.
Args:
path
: (str) A path to the contents of this model, can be in the following forms:
- /local/directory
- /local/directory/file.txt
- s3://bucket/path
name
: A name to assign to the model artifact that the file contents will be added to. This will default to the basename of the path prepended with the current run id if not specified.
aliases
: Aliases to apply to the created model artifact, defaults to ["latest"]
Raises:
ValueError
: If name has invalid special characters.
Returns:
None
method Run.mark_preempting
Mark this run as preempting.
Also tells the internal process to immediately report this to server.
method Run.restore
restore(
name: 'str',
run_path: 'str | None' = None,
replace: 'bool' = False,
root: 'str | None' = None
) → None | TextIO
Download the specified file from cloud storage.
File is placed into the current directory or run directory. By default, will only download the file if it doesn’t already exist.
Args:
name
: The name of the file.
run_path
: Optional path to a run to pull files from, i.e. username/project_name/run_id
if wandb.init has not been called, this is required.
replace
: Whether to download the file even if it already exists locally
root
: The directory to download the file to. Defaults to the current directory or the run directory if wandb.init was called.
Returns:
None if it can’t find the file, otherwise a file object open for reading.
Raises:
CommError
: If W&B can’t connect to the W&B backend.
ValueError
: If the file is not found or can’t find run_path.
method Run.save
save(
glob_str: 'str | os.PathLike',
base_path: 'str | os.PathLike | None' = None,
policy: 'PolicyName' = 'live'
) → bool | list[str]
Sync one or more files to W&B.
Relative paths are relative to the current working directory.
A Unix glob, such as “myfiles/*”, is expanded at the time save
is called regardless of the policy
. In particular, new files are not picked up automatically.
A base_path
may be provided to control the directory structure of uploaded files. It should be a prefix of glob_str
, and the directory structure beneath it is preserved.
When given an absolute path or glob and no base_path
, one directory level is preserved as in the example above.
Args:
glob_str
: A relative or absolute path or Unix glob.
base_path
: A path to use to infer a directory structure; see examples.
policy
: One of live
, now
, or end
.
- live: upload the file as it changes, overwriting the previous version
- now: upload the file once now
- end: upload file when the run ends
Returns:
Paths to the symlinks created for the matched files.
For historical reasons, this may return a boolean in legacy code.
import wandb
run = wandb.init()
run.save("these/are/myfiles/*")
# => Saves files in a "these/are/myfiles/" folder in the run.
run.save("these/are/myfiles/*", base_path="these")
# => Saves files in an "are/myfiles/" folder in the run.
run.save("/User/username/Documents/run123/*.txt")
# => Saves files in a "run123/" folder in the run. See note below.
run.save("/User/username/Documents/run123/*.txt", base_path="/User")
# => Saves files in a "username/Documents/run123/" folder in the run.
run.save("files/*/saveme.txt")
# => Saves each "saveme.txt" file in an appropriate subdirectory
# of "files/".
# Explicitly finish the run since a context manager is not used.
run.finish()
method Run.status
Get sync info from the internal backend, about the current run’s sync status.
method Run.unwatch
unwatch(
models: 'torch.nn.Module | Sequence[torch.nn.Module] | None' = None
) → None
Remove pytorch model topology, gradient and parameter hooks.
Args:
models
: Optional list of pytorch models that have had watch called on them.
method Run.upsert_artifact
upsert_artifact(
artifact_or_path: 'Artifact | str',
name: 'str | None' = None,
type: 'str | None' = None,
aliases: 'list[str] | None' = None,
distributed_id: 'str | None' = None
) → Artifact
Declare (or append to) a non-finalized artifact as output of a run.
Note that you must call run.finish_artifact() to finalize the artifact. This is useful when distributed jobs need to all contribute to the same artifact.
Args:
artifact_or_path
: A path to the contents of this artifact, can be in the following forms:
/local/directory
/local/directory/file.txt
s3://bucket/path
name
: An artifact name. May be prefixed with “entity/project”. Defaults to the basename of the path prepended with the current run ID if not specified. Valid names can be in the following forms:
- name:version
- name:alias
- digest
type
: The type of artifact to log. Common examples include dataset
, model
.
aliases
: Aliases to apply to this artifact, defaults to ["latest"]
.
distributed_id
: Unique string that all distributed jobs share. If None, defaults to the run’s group name.
Returns:
An Artifact
object.
method Run.use_artifact
use_artifact(
artifact_or_name: 'str | Artifact',
type: 'str | None' = None,
aliases: 'list[str] | None' = None,
use_as: 'str | None' = None
) → Artifact
Declare an artifact as an input to a run.
Call download
or file
on the returned object to get the contents locally.
Args:
artifact_or_name
: The name of the artifact to use. May be prefixed with the name of the project the artifact was logged to ("" or “/”). If no entity is specified in the name, the Run or API setting’s entity is used. Valid names can be in the following forms
type
: The type of artifact to use.
aliases
: Aliases to apply to this artifact
use_as
: This argument is deprecated and does nothing.
Returns:
An Artifact
object.
Examples:
import wandb
run = wandb.init(project="<example>")
# Use an artifact by name and alias
artifact_a = run.use_artifact(artifact_or_name="<name>:<alias>")
# Use an artifact by name and version
artifact_b = run.use_artifact(artifact_or_name="<name>:v<version>")
# Use an artifact by entity/project/name:alias
artifact_c = run.use_artifact(
artifact_or_name="<entity>/<project>/<name>:<alias>"
)
# Use an artifact by entity/project/name:version
artifact_d = run.use_artifact(
artifact_or_name="<entity>/<project>/<name>:v<version>"
)
# Explicitly finish the run since a context manager is not used.
run.finish()
method Run.use_model
use_model(name: 'str') → FilePathStr
Download the files logged in a model artifact ’name’.
Args:
name
: A model artifact name. ’name’ must match the name of an existing logged model artifact. May be prefixed with entity/project/
. Valid names can be in the following forms
- model_artifact_name:version
- model_artifact_name:alias
Returns:
path
(str): Path to downloaded model artifact file(s).
Raises:
AssertionError
: If model artifact ’name’ is of a type that does not contain the substring ‘model’.
method Run.watch
watch(
models: 'torch.nn.Module | Sequence[torch.nn.Module]',
criterion: 'torch.F | None' = None,
log: "Literal['gradients', 'parameters', 'all'] | None" = 'gradients',
log_freq: 'int' = 1000,
idx: 'int | None' = None,
log_graph: 'bool' = False
) → None
Hook into given PyTorch model to monitor gradients and the model’s computational graph.
This function can track parameters, gradients, or both during training.
Args:
models
: A single model or a sequence of models to be monitored.
criterion
: The loss function being optimized (optional).
log
: Specifies whether to log “gradients”, “parameters”, or “all”. Set to None to disable logging. (default=“gradients”).
log_freq
: Frequency (in batches) to log gradients and parameters. (default=1000)
idx
: Index used when tracking multiple models with wandb.watch
. (default=None)
log_graph
: Whether to log the model’s computational graph. (default=False)
Raises:
ValueError: If wandb.init()
has not been called or if any of the models are not instances of torch.nn.Module
.
8 - Settings
Settings for the W&B SDK.
This class manages configuration settings for the W&B SDK,
ensuring type safety and validation of all settings. Settings are accessible
as attributes and can be initialized programmatically, through environment
variables (WANDB_ prefix
), and with configuration files.
The settings are organized into three categories:
- Public settings: Core configuration options that users can safely modify to customize
W&B’s behavior for their specific needs.
- Internal settings: Settings prefixed with ‘x_’ that handle low-level SDK behavior.
These settings are primarily for internal use and debugging. While they can be modified,
they are not considered part of the public API and may change without notice in future
versions.
- Computed settings: Read-only settings that are automatically derived from other settings or
the environment.
Attributes:
-
allow_offline_artifacts (bool): Flag to allow table artifacts to be synced in offline mode.
To revert to the old behavior, set this to False.
-
allow_val_change (bool): Flag to allow modification of Config
values after they’ve been set.
-
anonymous (Optional): Controls anonymous data logging.
Possible values are:
- “never”: requires you to link your W&B account before
tracking the run, so you don’t accidentally create an anonymous
run.
- “allow”: lets a logged-in user track runs with their account, but
lets someone who is running the script without a W&B account see
the charts in the UI.
- “must”: sends the run to an anonymous account instead of to a
signed-up user account.
-
api_key (Optional): The W&B API key.
-
azure_account_url_to_access_key (Optional): Mapping of Azure account URLs to their corresponding access keys for Azure integration.
-
base_url (str): The URL of the W&B backend for data synchronization.
-
code_dir (Optional): Directory containing the code to be tracked by W&B.
-
config_paths (Optional): Paths to files to load configuration from into the Config
object.
-
console (Literal): The type of console capture to be applied.
Possible values are:
“auto” - Automatically selects the console capture method based on the
system environment and settings.
“off” - Disables console capture.
“redirect” - Redirects low-level file descriptors for capturing output.
“wrap” - Overrides the write methods of sys.stdout/sys.stderr. Will be
mapped to either “wrap_raw” or “wrap_emu” based on the state of the system.
“wrap_raw” - Same as “wrap” but captures raw output directly instead of
through an emulator. Derived from the wrap
setting and should not be set manually.
“wrap_emu” - Same as “wrap” but captures output through an emulator.
Derived from the wrap
setting and should not be set manually.
-
console_multipart (bool): Whether to produce multipart console log files.
-
credentials_file (str): Path to file for writing temporary access tokens.
-
disable_code (bool): Whether to disable capturing the code.
-
disable_git (bool): Whether to disable capturing the git state.
-
disable_job_creation (bool): Whether to disable the creation of a job artifact for W&B Launch.
-
docker (Optional): The Docker image used to execute the script.
-
email (Optional): The email address of the user.
-
entity (Optional): The W&B entity, such as a user or a team.
-
force (bool): Whether to pass the force
flag to wandb.login()
.
-
fork_from (Optional): Specifies a point in a previous execution of a run to fork from.
The point is defined by the run ID, a metric, and its value.
Currently, only the metric ‘_step’ is supported.
-
git_commit (Optional): The git commit hash to associate with the run.
-
git_remote (str): The git remote to associate with the run.
-
git_remote_url (Optional): The URL of the git remote repository.
-
git_root (Optional): Root directory of the git repository.
-
host (Optional): Hostname of the machine running the script.
-
http_proxy (Optional): Custom proxy servers for http requests to W&B.
-
https_proxy (Optional): Custom proxy servers for https requests to W&B.
-
identity_token_file (Optional): Path to file containing an identity token (JWT) for authentication.
-
ignore_globs (Sequence): Unix glob patterns relative to files_dir
specifying files to exclude from upload.
-
init_timeout (float): Time in seconds to wait for the wandb.init
call to complete before timing out.
-
insecure_disable_ssl (bool): Whether to insecurely disable SSL verification.
-
job_name (Optional): Name of the Launch job running the script.
-
job_source (Optional): Source type for Launch.
-
label_disable (bool): Whether to disable automatic labeling features.
-
launch_config_path (Optional): Path to the launch configuration file.
-
login_timeout (Optional): Time in seconds to wait for login operations before timing out.
-
max_end_of_run_history_metrics (int): Maximum number of history sparklines to display at the end of a run.
-
max_end_of_run_summary_metrics (int): Maximum number of summary metrics to display at the end of a run.
-
mode (Literal): The operating mode for W&B logging and synchronization.
-
notebook_name (Optional): Name of the notebook if running in a Jupyter-like environment.
-
organization (Optional): The W&B organization.
-
program (Optional): Path to the script that created the run, if available.
-
program_abspath (Optional): The absolute path from the root repository directory to the script that
created the run.
Root repository directory is defined as the directory containing the
.git directory, if it exists. Otherwise, it’s the current working directory.
-
program_relpath (Optional): The relative path to the script that created the run.
-
project (Optional): The W&B project ID.
-
quiet (bool): Flag to suppress non-essential output.
-
reinit (Union): What to do when wandb.init()
is called while a run is active.
Options:
- “default”: Use “finish_previous” in notebooks and “return_previous”
otherwise.
- “return_previous”: Return the most recently created run
that is not yet finished. This does not update
wandb.run
; see
the “create_new” option.
- “finish_previous”: Finish all active runs, then return a new run.
- “create_new”: Create a new run without modifying other active runs.
Does not update
wandb.run
and top-level functions like wandb.log
.
Because of this, some older integrations that rely on the global run
will not work.
Can also be a boolean, but this is deprecated. False is the same as
“return_previous”, and True is the same as “finish_previous”.
-
relogin (bool): Flag to force a new login attempt.
-
resume (Optional): Specifies the resume behavior for the run.
Options:
- “must”: Resumes from an existing run with the same ID. If no such run exists,
it will result in failure.
- “allow”: Attempts to resume from an existing run with the same ID. If none is
found, a new run will be created.
- “never”: Always starts a new run. If a run with the same ID already exists,
it will result in failure.
- “auto”: Automatically resumes from the most recent failed run on the same
machine.
-
resume_from (Optional): Specifies a point in a previous execution of a run to resume from.
The point is defined by the run ID, a metric, and its value.
Currently, only the metric ‘_step’ is supported.
-
root_dir (str): The root directory to use as the base for all run-related paths.
In particular, this is used to derive the wandb directory and the run directory.
-
run_group (Optional): Group identifier for related runs.
Used for grouping runs in the UI.
-
run_id (Optional): The ID of the run.
-
run_job_type (Optional): Type of job being run (e.g., training, evaluation).
-
run_name (Optional): Human-readable name for the run.
-
run_notes (Optional): Additional notes or description for the run.
-
run_tags (Optional): Tags to associate with the run for organization and filtering.
-
sagemaker_disable (bool): Flag to disable SageMaker-specific functionality.
-
save_code (Optional): Whether to save the code associated with the run.
-
settings_system (Optional): Path to the system-wide settings file.
-
show_errors (bool): Whether to display error messages.
-
show_info (bool): Whether to display informational messages.
-
show_warnings (bool): Whether to display warning messages.
-
silent (bool): Flag to suppress all output.
-
strict (Optional): Whether to enable strict mode for validation and error checking.
-
summary_timeout (int): Time in seconds to wait for summary operations before timing out.
-
sweep_id (Optional): Identifier of the sweep this run belongs to.
-
sweep_param_path (Optional): Path to the sweep parameters configuration.
-
symlink (bool): Whether to use symlinks (True by default except on Windows).
-
sync_tensorboard (Optional): Whether to synchronize TensorBoard logs with W&B.
-
table_raise_on_max_row_limit_exceeded (bool): Whether to raise an exception when table row limits are exceeded.
-
username (Optional): Username.
-
x_disable_meta (bool): Flag to disable the collection of system metadata.
-
x_disable_stats (bool): Flag to disable the collection of system metrics.
-
x_extra_http_headers (Optional): Additional headers to add to all outgoing HTTP requests.
-
x_label (Optional): Label to assign to system metrics and console logs collected for the run.
This is used to group data by on the frontend and can be used to distinguish data
from different processes in a distributed training job.
-
x_primary (bool): Determines whether to save internal wandb files and metadata.
In a distributed setting, this is useful for avoiding file overwrites
from secondary processes when only system metrics and logs are needed,
as the primary process handles the main logging.
-
x_save_requirements (bool): Flag to save the requirements file.
-
x_server_side_derived_summary (bool): Flag to delegate automatic computation of summary from history to the server.
This does not disable user-provided summary updates.
-
x_service_wait (float): Time in seconds to wait for the wandb-core internal service to start.
-
x_skip_transaction_log (bool): Whether to skip saving the run events to the transaction log.
This is only relevant for online runs. Can be used to reduce the amount of
data written to disk.
Should be used with caution, as it removes the gurantees about
recoverability.
-
x_stats_cpu_count (Optional): System CPU count.
If set, overrides the auto-detected value in the run metadata.
-
x_stats_cpu_logical_count (Optional): Logical CPU count.
If set, overrides the auto-detected value in the run metadata.
-
x_stats_disk_paths (Optional): System paths to monitor for disk usage.
-
x_stats_gpu_count (Optional): GPU device count.
If set, overrides the auto-detected value in the run metadata.
-
x_stats_gpu_device_ids (Optional): GPU device indices to monitor.
If not set, the system monitor captures metrics for all GPUs.
Assumes 0-based indexing matching CUDA/ROCm device enumeration.
-
x_stats_gpu_type (Optional): GPU device type.
If set, overrides the auto-detected value in the run metadata.
-
x_stats_open_metrics_endpoints (Optional): OpenMetrics /metrics
endpoints to monitor for system metrics.
-
x_stats_open_metrics_filters (Union): Filter to apply to metrics collected from OpenMetrics /metrics
endpoints.
Supports two formats:
- {“metric regex pattern, including endpoint name as prefix”: {“label”: “label value regex pattern”}}
- (“metric regex pattern 1”, “metric regex pattern 2”, …)
-
x_stats_open_metrics_http_headers (Optional): HTTP headers to add to OpenMetrics requests.
-
x_stats_sampling_interval (float): Sampling interval for the system monitor in seconds.
-
x_stats_track_process_tree (bool): Monitor the entire process tree for resource usage, starting from x_stats_pid
.
When True
, the system monitor aggregates the RSS, CPU%, and thread count
from the process with PID x_stats_pid
and all of its descendants.
This can have a performance overhead and is disabled by default.
-
x_update_finish_state (bool): Flag to indicate whether this process can update the run’s final state on the server.
Set to False in distributed training when only the main process should determine the final state.
9 - Optional Extras
The wandb
package provides optional extras that enable additional functionality for specific use cases. These extras install additional dependencies required for specialized features.
Installation
Install extras using pip with square brackets notation:
# Single extra
pip install "wandb[media]"
# Multiple extras
pip install "wandb[media,sweeps,launch]"
# Note: In zsh (default on macOS), use quotes or escape brackets
pip install wandb\[media\]
Enables advanced media logging capabilities for visualizations and multimedia content.
Installed packages:
bokeh
- Interactive visualization library
moviepy
- Video editing and processing
pillow
- Image processing (PIL fork)
plotly
- Interactive graphing library
imageio
- Image I/O operations
rdkit
- Cheminformatics and molecule visualization
soundfile
- Audio file I/O
Enabled features:
- Advanced plotting with
wandb.plot
functions
- Video logging with
wandb.Video
- Audio logging with
wandb.Audio
- Molecule visualization for chemistry/drug discovery
- Enhanced image processing capabilities
Example usage:
import wandb
import plotly.graph_objects as go
# Requires wandb[media]
wandb.init()
# Log interactive Plotly figures
fig = go.Figure(data=go.Bar(x=['A', 'B', 'C'], y=[1, 3, 2]))
wandb.log({"plotly_chart": wandb.Plotly(fig)})
# Log videos
wandb.log({"video": wandb.Video("path/to/video.mp4")})
wandb[workspaces]
Provides the Workspaces API for programmatically creating and managing W&B workspaces and reports.
Installed packages:
wandb-workspaces
- Official W&B workspaces library
Documentation:
See the dedicated Workspaces API Reference
Example usage:
import wandb_workspaces.workspaces as ws
workspace = ws.Workspace(
entity="team",
project="my-project",
sections=[
ws.Section(
name="Metrics",
panels=[
ws.LinePlot(x="Step", y=["loss", "accuracy"])
]
)
]
)
workspace.save()
wandb[sweeps]
Enhanced support for hyperparameter sweeps and optimization.
Enabled features:
- Advanced sweep configuration options
- Additional sweep controllers
- Enhanced parameter sampling strategies
Example usage:
import wandb
sweep_config = {
'method': 'bayes',
'metric': {'name': 'loss', 'goal': 'minimize'},
'parameters': {
'learning_rate': {'min': 0.001, 'max': 0.1},
'batch_size': {'values': [16, 32, 64]}
}
}
sweep_id = wandb.sweep(sweep_config, project="my-project")
wandb.agent(sweep_id, function=train_fn, count=10)
wandb[launch]
Enables W&B Launch for job orchestration and deployment.
Enabled features:
- Job queueing and scheduling
- Resource management
- Deployment to various compute backends
- Job templates and configurations
Example usage:
import wandb
from wandb.sdk.launch import launch
# Launch a job
launch(
uri="https://github.com/my-org/my-repo",
project="my-project",
entity="my-team",
config={"learning_rate": 0.01}
)
wandb[models]
Enhanced model management and versioning capabilities.
Enabled features:
- Model registry operations
- Model lineage tracking
- Advanced model artifact handling
- Model performance comparison tools
Example usage:
import wandb
run = wandb.init()
# Log model with enhanced metadata
model_artifact = wandb.Artifact(
name="my-model",
type="model",
metadata={
"framework": "pytorch",
"architecture": "resnet50",
"dataset": "imagenet"
}
)
model_artifact.add_file("model.pth")
run.log_artifact(model_artifact)
Cloud Provider Integrations
wandb[aws]
AWS integration for S3 artifact storage and SageMaker support.
Enabled features:
- Direct S3 artifact upload/download
- SageMaker training job integration
- AWS authentication helpers
- Optimized data transfer for AWS regions
wandb[azure]
Azure integration for blob storage and Azure ML support.
Enabled features:
- Azure Blob Storage for artifacts
- Azure ML workspace integration
- Azure authentication helpers
- Optimized data transfer for Azure regions
wandb[gcp]
Google Cloud Platform integration.
Enabled features:
- GCS (Google Cloud Storage) for artifacts
- Vertex AI integration
- GCP authentication helpers
- Optimized data transfer for GCP regions
wandb[kubeflow]
Integration with Kubeflow pipelines and workflows.
Enabled features:
- Kubeflow pipeline component wrappers
- Automatic pipeline metadata logging
- Kubeflow artifact handling
- Pipeline visualization in W&B
Example usage:
from wandb.integration.kubeflow import wandb_log
@wandb_log(project="my-project")
def training_component(learning_rate: float):
# Your training code
pass
wandb[importers]
Tools for importing data from other experiment tracking systems.
Enabled features:
- TensorBoard log importer
- MLflow run importer
- CSV/JSON data importers
- Legacy format converters
Example usage:
from wandb.apis.importers import import_tensorboard
import_tensorboard(
log_dir="./tb_logs",
project="imported-project",
entity="my-team"
)
wandb[perf]
Performance monitoring and profiling tools.
Enabled features:
- GPU memory profiling
- CPU/Memory profiling
- Training bottleneck detection
- Performance regression tracking
Example usage:
import wandb
from wandb.profiler import profile
run = wandb.init()
with profile("training_step"):
# Your training code
model.train()
loss.backward()
optimizer.step()
To verify which extras are installed:
import wandb
import pkg_resources
# Check wandb version
print(f"wandb version: {wandb.__version__}")
# Check for specific extra dependencies
extras_to_check = {
'media': ['plotly', 'bokeh', 'moviepy'],
'workspaces': ['wandb_workspaces'],
'sweeps': ['wandb'], # Core functionality, check for version
}
for extra, packages in extras_to_check.items():
print(f"\n{extra} extra:")
for package in packages:
try:
version = pkg_resources.get_distribution(package).version
print(f" ✓ {package} {version}")
except pkg_resources.DistributionNotFound:
print(f" ✗ {package} not installed")
Best Practices
-
Install only what you need: Each extra brings additional dependencies. Install only the extras required for your use case.
-
Version compatibility: Some extras may have specific version requirements. Use a virtual environment to avoid conflicts.
-
Production deployments: For production, explicitly specify versions:
pip install "wandb[media]==0.21.3"
-
Docker images: When building Docker images, install extras in a single layer:
RUN pip install "wandb[media,sweeps,launch]==0.21.3"
Troubleshooting
If you encounter import errors after installing extras:
-
Verify installation:
-
Reinstall with forced upgrade:
pip install --upgrade --force-reinstall "wandb[extra_name]"
-
Check for conflicting dependencies:
Shell escaping issues
If you get “no matches found” errors:
# Instead of:
pip install wandb[media] # May fail in zsh
# Use:
pip install "wandb[media]" # Works in all shells
# Or:
pip install wandb\[media\] # Escaped brackets
API Documentation Links
For detailed API documentation of features enabled by each extra:
Contributing
The extras are defined in the wandb
package’s setup.py
or pyproject.toml
. To request new extras or report issues:
- Check the W&B GitHub repository
- Open an issue with the “extras” label
- For
wandb-workspaces
, see its dedicated repository
10 - System Metrics Reference
Metrics automatically logged by W&B.
This page provides detailed information about the system metrics that are tracked by the W&B SDK.
wandb
automatically logs system metrics every 15 seconds.
CPU
Process CPU Percent (CPU)
Percentage of CPU usage by the process, normalized by the number of available CPUs.
W&B assigns a cpu
tag to this metric.
Process CPU Threads
The number of threads utilized by the process.
W&B assigns a proc.cpu.threads
tag to this metric.
Disk
By default, the usage metrics are collected for the /
path. To configure the paths to be monitored, use the following setting:
run = wandb.init(
settings=wandb.Settings(
x_stats_disk_paths=("/System/Volumes/Data", "/home", "/mnt/data"),
),
)
Disk Usage Percent
Represents the total system disk usage in percentage for specified paths.
W&B assigns a disk.{path}.usagePercent
tag to this metric.
Disk Usage
Represents the total system disk usage in gigabytes (GB) for specified paths.
The paths that are accessible are sampled, and the disk usage (in GB) for each path is appended to the samples.
W&B assigns a disk.{path}.usageGB
tag to this metric.
Disk In
Indicates the total system disk read in megabytes (MB).
The initial disk read bytes are recorded when the first sample is taken. Subsequent samples calculate the difference between the current read bytes and the initial value.
W&B assigns a disk.in
tag to this metric.
Disk Out
Represents the total system disk write in megabytes (MB).
Similar to Disk In, the initial disk write bytes are recorded when the first sample is taken. Subsequent samples calculate the difference between the current write bytes and the initial value.
W&B assigns a disk.out
tag to this metric.
Memory
Represents the Memory Resident Set Size (RSS) in megabytes (MB) for the process. RSS is the portion of memory occupied by a process that is held in main memory (RAM).
W&B assigns a proc.memory.rssMB
tag to this metric.
Process Memory Percent
Indicates the memory usage of the process as a percentage of the total available memory.
W&B assigns a proc.memory.percent
tag to this metric.
Memory Percent
Represents the total system memory usage as a percentage of the total available memory.
W&B assigns a memory_percent
tag to this metric.
Memory Available
Indicates the total available system memory in megabytes (MB).
W&B assigns a proc.memory.availableMB
tag to this metric.
Network
Network Sent
Represents the total bytes sent over the network.
The initial bytes sent are recorded when the metric is first initialized. Subsequent samples calculate the difference between the current bytes sent and the initial value.
W&B assigns a network.sent
tag to this metric.
Network Received
Indicates the total bytes received over the network.
Similar to Network Sent, the initial bytes received are recorded when the metric is first initialized. Subsequent samples calculate the difference between the current bytes received and the initial value.
W&B assigns a network.recv
tag to this metric.
NVIDIA GPU
In addition to the metrics described below, if the process and/or its descendants use a particular GPU, W&B captures the corresponding metrics as gpu.process.{gpu_index}.{metric_name}
GPU Memory Utilization
Represents the GPU memory utilization in percent for each GPU.
W&B assigns a gpu.{gpu_index}.memory
tag to this metric.
GPU Memory Allocated
Indicates the GPU memory allocated as a percentage of the total available memory for each GPU.
W&B assigns a gpu.{gpu_index}.memoryAllocated
tag to this metric.
GPU Memory Allocated Bytes
Specifies the GPU memory allocated in bytes for each GPU.
W&B assigns a gpu.{gpu_index}.memoryAllocatedBytes
tag to this metric.
GPU Utilization
Reflects the GPU utilization in percent for each GPU.
W&B assigns a gpu.{gpu_index}.gpu
tag to this metric.
GPU Temperature
The GPU temperature in Celsius for each GPU.
W&B assigns a gpu.{gpu_index}.temp
tag to this metric.
GPU Power Usage Watts
Indicates the GPU power usage in Watts for each GPU.
W&B assigns a gpu.{gpu_index}.powerWatts
tag to this metric.
GPU Power Usage Percent
Reflects the GPU power usage as a percentage of its power capacity for each GPU.
W&B assigns a gpu.{gpu_index}.powerPercent
tag to this metric.
GPU SM Clock Speed
Represents the clock speed of the Streaming Multiprocessor (SM) on the GPU in MHz. This metric is indicative of the processing speed within the GPU cores responsible for computation tasks.
W&B assigns a gpu.{gpu_index}.smClock
tag to this metric.
GPU Memory Clock Speed
Represents the clock speed of the GPU memory in MHz, which influences the rate of data transfer between the GPU memory and processing cores.
W&B assigns a gpu.{gpu_index}.memoryClock
tag to this metric.
GPU Graphics Clock Speed
Represents the base clock speed for graphics rendering operations on the GPU, expressed in MHz. This metric often reflects performance during visualization or rendering tasks.
W&B assigns a gpu.{gpu_index}.graphicsClock
tag to this metric.
GPU Corrected Memory Errors
Tracks the count of memory errors on the GPU that W&B automatically corrects by error-checking protocols, indicating recoverable hardware issues.
W&B assigns a gpu.{gpu_index}.correctedMemoryErrors
tag to this metric.
GPU Uncorrected Memory Errors
Tracks the count of memory errors on the GPU that W&B uncorrected, indicating non-recoverable errors which can impact processing reliability.
W&B assigns a gpu.{gpu_index}.unCorrectedMemoryErrors
tag to this metric.
GPU Encoder Utilization
Represents the percentage utilization of the GPU’s video encoder, indicating its load when encoding tasks (for example, video rendering) are running.
W&B assigns a gpu.{gpu_index}.encoderUtilization
tag to this metric.
AMD GPU
W&B extracts metrics from the output of the rocm-smi
tool supplied by AMD (rocm-smi -a --json
).
ROCm 6.x (latest) and 5.x formats are supported. Learn more about ROCm formats in the AMD ROCm documentation. The newer format includes more details.
AMD GPU Utilization
Represents the GPU utilization in percent for each AMD GPU device.
W&B assigns a gpu.{gpu_index}.gpu
tag to this metric.
AMD GPU Memory Allocated
Indicates the GPU memory allocated as a percentage of the total available memory for each AMD GPU device.
W&B assigns a gpu.{gpu_index}.memoryAllocated
tag to this metric.
AMD GPU Temperature
The GPU temperature in Celsius for each AMD GPU device.
W&B assigns a gpu.{gpu_index}.temp
tag to this metric.
AMD GPU Power Usage Watts
The GPU power usage in Watts for each AMD GPU device.
W&B assigns a gpu.{gpu_index}.powerWatts
tag to this metric.
AMD GPU Power Usage Percent
Reflects the GPU power usage as a percentage of its power capacity for each AMD GPU device.
W&B assigns a gpu.{gpu_index}.powerPercent
to this metric.
Apple ARM Mac GPU
Apple GPU Utilization
Indicates the GPU utilization in percent for Apple GPU devices, specifically on ARM Macs.
W&B assigns a gpu.0.gpu
tag to this metric.
Apple GPU Memory Allocated
The GPU memory allocated as a percentage of the total available memory for Apple GPU devices on ARM Macs.
W&B assigns a gpu.0.memoryAllocated
tag to this metric.
Apple GPU Temperature
The GPU temperature in Celsius for Apple GPU devices on ARM Macs.
W&B assigns a gpu.0.temp
tag to this metric.
Apple GPU Power Usage Watts
The GPU power usage in Watts for Apple GPU devices on ARM Macs.
W&B assigns a gpu.0.powerWatts
tag to this metric.
Apple GPU Power Usage Percent
The GPU power usage as a percentage of its power capacity for Apple GPU devices on ARM Macs.
W&B assigns a gpu.0.powerPercent
tag to this metric.
Graphcore IPU
Graphcore IPUs (Intelligence Processing Units) are unique hardware accelerators designed specifically for machine intelligence tasks.
IPU Device Metrics
These metrics represent various statistics for a specific IPU device. Each metric has a device ID (device_id
) and a metric key (metric_key
) to identify it. W&B assigns a ipu.{device_id}.{metric_key}
tag to this metric.
Metrics are extracted using the proprietary gcipuinfo
library, which interacts with Graphcore’s gcipuinfo
binary. The sample
method fetches these metrics for each IPU device associated with the process ID (pid
). Only the metrics that change over time, or the first time a device’s metrics are fetched, are logged to avoid logging redundant data.
For each metric, the method parse_metric
is used to extract the metric’s value from its raw string representation. The metrics are then aggregated across multiple samples using the aggregate
method.
The following lists available metrics and their units:
- Average Board Temperature (
average board temp (C)
): Temperature of the IPU board in Celsius.
- Average Die Temperature (
average die temp (C)
): Temperature of the IPU die in Celsius.
- Clock Speed (
clock (MHz)
): The clock speed of the IPU in MHz.
- IPU Power (
ipu power (W)
): Power consumption of the IPU in Watts.
- IPU Utilization (
ipu utilisation (%)
): Percentage of IPU utilization.
- IPU Session Utilization (
ipu utilisation (session) (%)
): IPU utilization percentage specific to the current session.
- Data Link Speed (
speed (GT/s)
): Speed of data transmission in Giga-transfers per second.
Google Cloud TPU
Tensor Processing Units (TPUs) are Google’s custom-developed ASICs (Application Specific Integrated Circuits) used to accelerate machine learning workloads.
TPU Memory usage
The current High Bandwidth Memory usage in bytes per TPU core.
W&B assigns a tpu.{tpu_index}.memoryUsageBytes
tag to this metric.
TPU Memory usage percentage
The current High Bandwidth Memory usage in percent per TPU core.
W&B assigns a tpu.{tpu_index}.memoryUsageBytes
tag to this metric.
TPU Duty cycle
TensorCore duty cycle percentage per TPU device. Tracks the percentage of time over the sample period during which the accelerator TensorCore was actively processing. A larger value means better TensorCore utilization.
W&B assigns a tpu.{tpu_index}.dutyCycle
tag to this metric.
AWS Trainium
AWS Trainium is a specialized hardware platform offered by AWS that focuses on accelerating machine learning workloads. The neuron-monitor
tool from AWS is used to capture the AWS Trainium metrics.
Trainium Neuron Core Utilization
The utilization percentage of each NeuronCore, reported on a per-core basis.
W&B assigns a trn.{core_index}.neuroncore_utilization
tag to this metric.
Trainium Host Memory Usage, Total
The total memory consumption on the host in bytes.
W&B assigns a trn.host_total_memory_usage
tag to this metric.
Trainium Neuron Device Total Memory Usage
The total memory usage on the Neuron device in bytes.
W&B assigns a trn.neuron_device_total_memory_usage)
tag to this metric.
Trainium Host Memory Usage Breakdown:
The following is a breakdown of memory usage on the host:
- Application Memory (
trn.host_total_memory_usage.application_memory
): Memory used by the application.
- Constants (
trn.host_total_memory_usage.constants
): Memory used for constants.
- DMA Buffers (
trn.host_total_memory_usage.dma_buffers
): Memory used for Direct Memory Access buffers.
- Tensors (
trn.host_total_memory_usage.tensors
): Memory used for tensors.
Trainium Neuron Core Memory Usage Breakdown
Detailed memory usage information for each NeuronCore:
- Constants (
trn.{core_index}.neuroncore_memory_usage.constants
)
- Model Code (
trn.{core_index}.neuroncore_memory_usage.model_code
)
- Model Shared Scratchpad (
trn.{core_index}.neuroncore_memory_usage.model_shared_scratchpad
)
- Runtime Memory (
trn.{core_index}.neuroncore_memory_usage.runtime_memory
)
- Tensors (
trn.{core_index}.neuroncore_memory_usage.tensors
)
OpenMetrics
Capture and log metrics from external endpoints that expose OpenMetrics / Prometheus-compatible data with support for custom regex-based metric filters to be applied to the consumed endpoints.
Refer to Monitoring GPU cluster performance in W&B for a detailed example of how to use this feature in a particular case of monitoring GPU cluster performance with the NVIDIA DCGM-Exporter.