Tables Tutorial: Recreating Whale Melodies on Orchestral Instruments

Interactively exploring ML data and predictions in the audio domain with our new Tables feature
Created on July 11|Last edited on September 13
Comment
﻿W&B Tables—our latest feature for dataset and prediction visualization—enables interactive exploration and analysis of audio data. In this short example, I render whale song as human music. The idea here is to synthesize melodies from the vocalization of whales and other marine mammals as they would sound on a violin, trumpet, etc. 
Specifically, I use Differentiable Digital Signal Processing from Tensorflow's Magenta (resources, colab demo) to generate the music from recordings in the Watkins Marine Mammal Sound Database (via the Woods Hole Oceanographic Institution).
And yes: you can try it yourself with your own audio. 
﻿Try it with your own audio recording →﻿﻿﻿﻿
Now though, let's dig into how Tables works for audio. 
Visualize & explore your dataI chose five of the more coherent whale songs as initial samples. With a dataset visualization table, you can see and interact with your data directly: listen to audio samples, play videos, see images, and more. 
This means you don't need to fill up local storage, wait for files to download, open media in a different app, or navigate multiple windows/browser tabs of file directories. Below, you can see those five whale songs in Tables: 
﻿
﻿
Simplified sample code﻿﻿﻿﻿To see and interact with audio files, log them directly into a wandb.Table. To visualize a piece of media such as an image, video, or song (audio file) in the browser, we need to wrap it in a wandb object of the matching type—in this case, wandb.Audio(). The wandb object takes in the raw contents or a file path to render the contents of the file. 
wandb.init(project="whalesong", name="upload_songs")
columns = ["id", "song_file","spec_plot", "species"]
data = [["001", "001.wav", "001_spec.png", "Bowhead Whale"],
        ["002", "002.wav", "002_spec.png", "Harp Seal"]]
﻿
# create a Table with the specified columns
table = wandb.Table(columns=columns)
for song_id, song_file, song_plot, species in data:
   # combine song metadata and interactive media
  table.add_row(song_id, wandb.Audio(song_file, sample_rate=16000), \
                wandb.Image(song_plot), species)
﻿
wandb.run.log({"whale_songs" : table}) 
You can also associate an interactive Table with a specific versioned folder of data through Artifacts (see the end of this report or a live example & documentation →).﻿﻿
Filter and organize the data tableYou can group by any column: say, group by "species" to listen to different samples or compare spectrograms (scroll right) from the same marine mammal in one row.
﻿
﻿
Interactively analyze resultsBeyond raw data, you may want to visualize training results: model predictions over the course of training, examples generated with different hyperparameters, etc. Tables does that too. 
You can join these to existing data tables to set up powerful interactive visualizations and analysis. Here I have synthesized a few renditions of the marine mammal melodies in different human instruments like violin, flute, and tenor sax, via the amazing DDSP library and Colab Notebook from Magenta for timbre transfer. These synthetic songs are local .wav files created in a Colab or my local dev environment. Each file is associated with the original song_id and the target instrument.
Here are all the generated results. Scroll down for more songs & right to view more synthesis parameters.
﻿
Run set4
﻿
Group by relevant fields to analyzeYou can group by id—or even by the file itself, using the unique hash of the audio data in orig_song—to see all the transformations of a given song in one row (the same melody played on a flute, violin, trumpet, or tenor sax). Alternatively, you can also group by instrument to compare timbre across melodies and species.
Find the header of the column you'd like to group by, click on the three dot menu on the right of the column name, and select "Group by" from the dropdown. You can "Reset & Automate Columns" to reload the default view.
﻿
Run: full upload1
﻿
Compare original and synthetic songs﻿﻿To listen to both song versions side-by-side, you can join the table of original songs to the table of generated songs:
Query across existing tables to create a new wandb.JoinedTable without duplicating data
Join flexibly across data tablesNext, I'll join across tables logged in earlier artifacts to efficiently create new views for analysis—without duplicating your data. 
I've logged all the information about the original marine songs in a song_samples table of my playable_songs artifact and about the synthesized songs in a synth_song_samples table of my synth_samples artifact. To compare the original and synthesized versions, I can join these tables on a single key (or a list of two keys) and even change the join type for the sub-tables (inner, outer, etc) from the browser:
run = wandb.init(project="whale-songs", job_type="explore")
﻿
# original songs table
orig_songs_at = run.use_artifact('playable_songs:latest') 
orig_table = orig_songs_at.get("song_samples")
﻿
# synth songs table
synth_songs_at = run.use_artifact('synth_samples:latest')
synth_table = synth_songs_at.get("synth_song_samples")
﻿
# join the tables on song_id
join_table = wandb.JoinedTable(synth_table, orig_table, "song_id")
join_at = wandb.Artifact("synth_summary", "analysis")
join_at.add(join_table, "synth_explore")
run.log_artifact(join_at)
﻿
Next stepsThis is an early proof of concept to illustrate the power of W&B Dataset and Prediction Visualization for the audio space. I hope to explore timber transfer, DDSP, and the Tensorflow Magenta toolset in more depth—and more serious applications like identifying and tracking different marine mammal species based on underwater recordings—in future reports.
P.S. Version data by referenceFor this project, I can also store my toy dataset in a remote bucket and ﻿﻿version it in a reference artifact﻿﻿. The change from logging a regular W&B Artifact is minimal: instead of adding a local path with artifact.add_artifact([your local file path]), add a remote path (generally a URI) with artifact.add_reference([your remote path]). You can read more about reference artifacts here.
import wandb
run = wandb.init(project="whale-songs", job_type="upload")
# path to my remote data directory in Google Cloud Storage
bucket = "gs://wandb-artifact-refs-public-test/whalesong"
# create a regular artifact
dataset_at = wandb.Artifact('sample_songs',type="raw_data")
# creates a checksum for each file and adds a reference to the bucket
# instead of uploading all of the contents
dataset_at.add_reference(bucket)
run.log_artifact(dataset_at)
List of file paths and sizes in this reference bucket. Note that these are merely references to the contents, not actual files stored in W&B, so they are not available for download from this view
Download data from the cloudOf course you can still fetch the files from the reference artifact and use the data locally:
import wandb
run = wandb.init(project="whale-songs", job_type="show_samples")
dataset_at = run.use_artifact("sample_songs:latest")
songs_dir = dataset_at.download()
# all files available locally in songs_dir
Sample code to add a Table to an Artifact﻿﻿(assuming my songs live in a local folder called whalesong/synth)
import os
import wandb
﻿
run = wandb.init(project="whale-songs", job_type="log_synth")
# full path to the specific folder of synthetic songs:
synth_songs_dir = "whalesong/synth"
﻿
# track all the files in the specific folder of synth songs
dataset_at = wandb.Artifact('synth_songs',type="generated_data")
dataset_at.add_dir(synth_songs_dir)
﻿
# create a table to hold audio samples and metadata in columns
columns = ["song_id", "song_name", "audio", "instrument"]
table = wandb.Table(columns=columns)
﻿
# iterate over all the songs and add them to the data table
for synth_song in os.listdir(synth_songs_dir)
  # song filenames have the form [string id]_[instrument].wav
  song_name = synth_song.split("/")[-1]
  song_path = os.path.join(synth_songs_dir, song_name)
﻿
  # create a wandb.Audio object to show the audio file
  audio = wandb.Audio(song_path, sample_rate=16000)
﻿
  # extract instrument from the filename
  orig_song_id, instrument = song_name.split("_")
  table.add_data(orig_song_id, song_name, audio, instrument.split(".")[0])
﻿
# log the table via a new artifact
songs_at = wandb.Artifact("synth_samples", type="synth_ddsp")
songs_at.add(table, "synth_song_samples")
run.log_artifact(songs_at)
﻿
Learn more about Tables: 
Tables Tutorial: Visualize Data for Image Classification
How to version and interactively explore data and predictions across train/val/test with W&B's new Tables feature
Tables Tutorial: Visualize Text Data & Predictions 
A guide on how to log and organize text data and language model predictions with our old friend William Shakespeare
SBX Robotics: Synthetic Training Data & Scene Composition with Tables
Exploring the impact of scene composition on segmentation model performance with the new Tables view in W&B.
Announcing W&B Tables: Iterate on Your Data
Today, we're excited to launch W&B Tables, a new tool for data iteration and model evaluation. Here's how it works:
﻿
﻿