Data Code

Created on August 23|Last edited on August 23
Comment
﻿
﻿
Log Data to #🪄🐝﻿
project("l5-demo", "l5-common").artifactVersion("source-l5-common-None", "v1").file("s1-upload-l5kit-data-wandb.ipynb")
Pull Data from Woven Planet and then Log to WANDB for Reuse across Different Models
Setup
from pathlib import Path
import os
Enter to Rename, {1} to Preview
#NOTE: DONT USE RELATIVE PATHS FOR THE MODELS PROVIDED BY L5
experiments_directory = Path(Path(os.path.abspath('')).
parent.parent, "Experiments")
experiments_directory.mkdir(parents=True, exist_ok=True)
raw_data_directory = Path(experiments_directory, "raw_data")
raw_data_directory.mkdir(parents=True, exist_ok=True)
save_directory = Path(raw_data_directory, "saved_outputs")
save_directory.mkdir(parents=True, exist_ok=True)
Enter to Rename, {1} to Preview
import os
os.chdir(save_directory)
Enter to Rename, {1} to Preview
%%writefile requirements.txt
wandb
Enter to Rename, {1} to Preview
%%capture
!pip install wandb
Enter to Rename, {1} to Preview
Download Data
%%writefile setup_data.sh
#!/bin/bash
# Make a temporary download folder
TEMP_DOWNLOAD_DIR=$(mktemp -d)
mkdir -p "dataset"
DATASET_DIR="./dataset"
# Download sample zarr
echo "Downloading sample zarr dataset..."
wget https://lyft-l5-datasets-public.s3-us-west-2.amazonaws.
com/prediction/v1.1/sample.tar \
    -q --show-progress -P $TEMP_DOWNLOAD_DIR
mkdir -p $DATASET_DIR/scenes
tar xf $TEMP_DOWNLOAD_DIR/sample.tar -C $DATASET_DIR/scenes
# Download semantic map
echo "Downloading semantic map..."
wget https://lyft-l5-datasets-public.s3-us-west-2.amazonaws.
com/prediction/v1.1/semantic_map.tar \
    -q --show-progress -P $TEMP_DOWNLOAD_DIR
mkdir -p $DATASET_DIR/semantic_map
tar xf $TEMP_DOWNLOAD_DIR/semantic_map.tar -C $DATASET_DIR/
semantic_map
cp $DATASET_DIR/semantic_map/meta.json $DATASET_DIR/meta.
json
# Download aerial maps
echo "Downloading aerial maps (this can take a while)..."
wget https://lyft-l5-datasets-public.s3-us-west-2.amazonaws.
com/prediction/v1.1/aerial_map.tar \
    -q --show-progress -P $TEMP_DOWNLOAD_DIR
tar xf $TEMP_DOWNLOAD_DIR/aerial_map.tar -C $DATASET_DIR
# Dowload configurations
echo "Downloading configurations..."
mkdir -p "configurations"
CONFIG_DIR="./configurations"
wget https://raw.githubusercontent.com/woven-planet/l5kit/
master/examples/visualisation/visualisation_config.yaml -q 
-O $CONFIG_DIR/visualisation_config.yaml
wget https://raw.githubusercontent.com/woven-planet/l5kit/
master/examples/agent_motion_prediction/agent_motion_config.
yaml -q -O $CONFIG_DIR/agent_motion_config.yaml
wget https://raw.githubusercontent.com/woven-planet/l5kit/
Enter to Rename, {1} to Preview
!sh ./setup_data.sh
Enter to Rename, {1} to Preview
Log Data to WANDB
import wandb
wandb.login()
Enter to Rename, {1} to Preview
Enrich Run information and Artifact Information
# Run information
wandb_entity = "l5-demo"
project_name = "l5-common"
run_name = "upload_data"
run_type = "data_upload"
run_description = """
This run uses the ./setup_data.sh file to
(1) Download a sample of scene data
(2) A semantic map to overlay scene data
(3) An aerial map for a higher detailed look of the scene 
data
(4) The collection of useful configurations used to run the 
experiments associated with this data
Read more at https://woven-planet.github.io/l5kit/
"""
tags = ["upload", "data"]
Enter to Rename, {1} to Preview
# Artifact Information
artifact_name = "l5-data"
artifact_type = "dataset"
artifact_description = """
Downloading the Datasets
========================
To use L5Kit you will need to download the Lyft Level 5 
Prediction dataset from https://self-driving.lyft.com/
level5/data/.
It consists of the following components:
* 1000 hours of perception output logged by Lyft AVs 
operating in Palo Alto. This data is stored in 30 second 
chunks using the zarr format.
* `A hand-annotated, HD semantic map <https://medium.com/
lyftlevel5/
semantic-maps-for-autonomous-vehicles-470830ee28b6>`_. This 
data is stored using protobuf format.
* A high-definition aerial map of the Palo Alto area. This 
image has 8cm per pixel resolution and is provided by 
`NearMap <https://www.nearmap.com/>`_.
To read more about the dataset and how it was generated, 
read the `dataset whitepaper <https://arxiv.org/abs/2006.
14480>`_.
Download the datasets
+++++++++++++++++++++
Register at https://self-driving.lyft.com/level5/data/ and 
download the `2020 Lyft prediction dataset <https://arxiv.
org/abs/2006.14480>`_.
Store all files in a single folder to match this structure:
::
      prediction-dataset/
            +- scenes/
                  +- sample.zarr
                        +- train.zarr
                        +- train_full.zarr
            +- aerial_map/
                  +- aerial_map.png
            +- semantic_map/
                  +- semantic_map.pb
            +- meta.json
Dataset Formats in L5Kit
========================
In the L5Kit codebase, we make use of a data format that 
consists of a set of `numpy structured arrays <https://docs.
scipy.org/doc/numpy/user/basics.rec.html>`_. Conceptually, 
it is similar to a set of CSV files with records and 
different columns, only that they are stored as binary 
files instead of text. Structured arrays can be directly 
memory mapped from disk.
Interleaved data in structured arrays
-------------------------------------
Structured arrays are stored in memory in an interleaved 
format, this means that one "row" or "sample" is grouped 
together in memory. For example, if we are storing colors 
and whether we like them (as a boolean :code:`l`), it would 
be :code:`[r,g,b,l,r,g,b,l,r,g,b,l]` and not :code:`[r,r,r,
g,g,g,b,b,b,l,l,l]`). Most ML applications require 
row-based access - column-based operations are much less 
common - making this a good fit.
Here is how this example translates into code::
    import numpy as np
    my_arr = np.zeros(3, dtype=[("color", (np.uint8, 3)), 
("label", np.bool)])
    print(my_arr[0])
    # ([0, 0, 0], False)
Let's add some data and see what the array looks like::
    my_arr[0]["color"] = [0, 218, 130]
    my_arr[0]["label"] = True
    my_arr[1]["color"] = [245, 59, 255]
    my_arr[1]["label"] = True
    print(my_arr)
    # array([([  0, 218, 130],  True), ([245,  59, 255],  
True),
    #        ([  0,   0,   0], False)],
    #       dtype=[('color', 'u1', (3,)), ('label', '?')])
    print(my_arr.tobytes())
    # b'\x00\xda\x82\x01\xf5;\xff\x01\x00\x00\x00\x00')
As you can see, structured arrays allow us to mix different 
data types into a single array, and the byte representation 
lets us group samples together. Now imagine that we have 
such an array on disk with millions of values. Reading the 
first 100 values turns into a matter of reading the first 
100*(3+1) bytes. If we had a separate array for each of the 
different fields we would have to read from 4 smaller files.
This becomes increasingly relevant with a larger number of 
fields and complexities of each field. In our dataset, an 
observation of another agent is described with its centroid 
(:code:`dtype=(float64, 3)`), its rotation matrix 
(:code:`dtype=(np.float64, (3,3))`), its extent or size 
(:code:`dtype=(np.float64, 3)`) to name a few properties. 
Structured arrays are a great fit to group this data 
together in memory and on disk.
Short introduction to zarr
--------------------------
We use the zarr data format to store and read these numpy 
structured arrays from disk. Zarr allows us to write very 
large (structured) arrays to disk in n-dimensional 
compressed chunks. See the `zarr docs <https://zarr.
readthedocs.io/en/stable/>`_. Here is a short tutorial::
    import zarr
    import numpy as np
    z = zarr.open("./path/to/dataset.zarr", mode="w", shape=
(500,), dtype=np.float32, chunks=(100,))
    # We can write to it by assigning to it. This gets 
persisted on disk.
    z[0:150] = np.arange(150)
As we specified chunks to be of size 100, we just wrote to 
two separate chunks. On your filesystem in the 
:code:`dataset.zarr` folder you will now find these two 
chunks. As we didn't completely fill the second chunk, 
those missing values will be set to the fill value 
(defaults to 0). The chunks are actually compressed on disk 
too! We can print some info::
    print(z.info)
    # Type               : zarr.core.Array
    # Data type          : float32
    # Shape              : (500,)
    # Chunk shape        : (100,)
    # Order              : C
    # Read-only          : False
    # Compressor         : Blosc(cname='lz4', clevel=5, 
shuffle=SHUFFLE, blocksize=0)
    # Store type         : zarr.storage.DirectoryStore
    # No. bytes          : 2000 (2.0K)
    # No. bytes stored   : 577
    # Storage ratio      : 3.5
    # Chunks initialized : 2/5
By not doing much work at all we saved almost 75% in disk 
space!
Reading from a zarr array is as easy as slicing from it 
like you would any numpy array. The return value is an 
ordinary numpy array. Zarr takes care of determining which 
chunks to read from::
    print(z[:10])
    # [0. 1. 2. 3. 4. 5. 6. 7. 8. 9.]
    print(z[::20]) # Read every 20th value
    # [  0.  20.  40.  60.  80. 100. 120. 140.   0.   0.   
0.   0.   0.   0.
    #    0.   0.   0.   0.   0.   0.   0.   0.   0.   0.   
0.]
Zarr supports StructuredArrays, the data format we use for 
our datasets are a set of structured arrays stored in zarr 
format.
Some other zarr benefits are:
* Safe to use in a multithreading or multiprocessing setup. 
Reading is entirely safe, for writing there are lock 
mechanisms built-in.
* If you have a dataset that is too large to fit in memory, 
loading a single sample becomes :code:`my_sample = z
[sample_index]` and you get compression out of the box.
* The blosc compressor is so fast that it is faster to read 
the compressed data and uncompress it than reading the 
uncompressed data from disk.
* Zarr supports multiple backend stores, your data could 
also live in a zip file, or even a remote server or S3 
bucket.
* Other libraries such as xarray, Dask and TensorStore have 
good interoperability with Zarr.
* The metadata (e.g. dtype, chunk size, compression type) 
is stored inside the zarr dataset too. If one day you 
decide to change your chunk size, you can still read the 
older datasets without changing any code.
2020 Lyft Competition Dataset format
------------------------------------
The 2020 Lyft competition dataset is stored in four 
structured arrays: :code:`scenes`, :code:`frames`, 
:code:`agents` and :code:`tl_faces`.
Note: in the following all :code:`_interval` fields assume 
that information is stored consecutively in the arrays.
This means that if :code:`frame_index_interval` for 
:code:`scene_0` is :code:`(0, 100)`, frames from 
:code:`scene_1` will start from index 100 in the frames 
array.
Scenes
++++++
A scene is identified by the host (i.e. which car was used 
to collect it) and a start and end time. 
It consists of multiple frames (=snapshots at discretized 
time intervals). 
The scene datatype stores references to its corresponding 
frames in terms of the start and end index within the 
frames array (described below). 
The frames in between these indices all correspond to the 
scene (including start index, excluding end index)::
    SCENE_DTYPE = [
        ("frame_index_interval", np.int64, (2,)),
        ("host", "<U16"),  # Unicode string up to 16 chars
        ("start_time", np.int64),
        ("end_time", np.int64),
    ]
Frames
++++++
A frame captures all information that was observed at a 
time. This includes
* the timestamp, which the frame describes;
* data about the ego vehicle itself such as rotation and 
position;
* a reference to the other agents (vehicles, cyclists and 
pedestrians) that were captured by the ego's sensors;
* a reference to all traffic light faces (see below) for 
all visible lanes.
The properties for both agents and traffic light faces are 
stored in their two respective arrays. 
The frame contains only pointers to these stored objects 
given by a start and an end index in these arrays (again, 
start is included while end excluded)::
    FRAME_DTYPE = [
        ("timestamp", np.int64),
        ("agent_index_interval", np.int64, (2,)),
        ("traffic_light_faces_index_interval", np.int64, (2,
)),
        ("ego_translation", np.float64, (3,)),
        ("ego_rotation", np.float64, (3, 3)),
    ]
Agents
++++++
An agent is an observation by the AV of some other detected 
object. 
Each entry describes the object in terms of its attributes 
such as position and velocity, gives the agent a tracking 
number to track it over multiple frames (but only within 
the same scene!) and its most probable label. 
The label is described as an array of probabilities over 
each defined class associated with them, 
the possible labels are defined `here <https://github.com/
woven-planet/l5kit/blob/master/l5kit/l5kit/data/labels.
py>`_ ::
    AGENT_DTYPE = [
        ("centroid", np.float64, (2,)),
        ("extent", np.float32, (3,)),
        ("yaw", np.float32),
        ("velocity", np.float32, (2,)),
        ("track_id", np.uint64),
        ("label_probabilities", np.float32, (len(LABELS),)),
    ]
Traffic Light Faces
+++++++++++++++++++
Note: we refer to traffic light bulbs (e.g. the red light 
bulb of a specific traffic light) as :code:`faces` in L5Kit.
For the full list of available types for a bulb please 
consult our `protobuf map definition <https://github.com/
woven-planet/l5kit/blob/
20ab033c01610d711c3d36e1963ecec86e8b85b6/l5kit/l5kit/data/
proto/road_network.proto#L615>`_.
Our semantic map holds static information about the world 
only. This means it has a list of all traffic lights, but 
no information about how their status changes over time.
This dynamic information is instead stored in this array.
Each array's element has a unique id to link it to the 
semantic map, a status (if status :code:`>0`, then the face 
is active - i.e., the corresponding light bulb is on, 
otherwise inactive / off ) and a reference to its parent 
traffic light::
    TL_FACE_DTYPE = [
        ("face_id", "<U16"),
        ("traffic_light_id", "<U16"),
        ("traffic_light_face_status", np.float32, (len
(TL_FACE_LABELS,))),
    ]
Working with the zarr format
----------------------------
The ChunkedDataset
++++++++++++++++++
The :code:`ChunkedDataset` (:code:`l5kit.data.
zarr_dataset`) is the first interface between raw data on 
the disk and Python accessible information.
This layer is very thin, and it provides the four arrays 
mapped from the disk. When one of these array is indexed 
(or sliced):
* :code:`zarr` identifies the chunk(s) to be loaded;
* the chunk is decompressed on the fly;
* a numpy array copy is returned; 
The :code:`ChunkedDataset` also provides an LRUcache; but 
it works on `compressed chunks only <https://github.com/
zarr-developers/zarr-python/issues/278>`_.
Performance-aware slicing 
+++++++++++++++++++++++++
A very common operation with the :code:`ChunkedDataset` is 
slicing one array to retrieve some values.
Let's say we want to retrieve the first 10k agents' 
centroids and store them in memory.
A first implementation would look like this::
    from l5kit.data import ChunkedDataset
    dt = ChunkedDataset("<path>").open()
    centroids= []
    for idx in range(10_000):
        centroid = dt.agents[idx]["centroid"]
        centroids.append(centroid)
However, in this implementation **we are decompressing the 
same chunk (or two) 10,000 times!**
If we rewrite it as::
    from l5kit.data import ChunkedDataset
    dt = ChunkedDataset("<path>").open()
    centroids = dt.agents[slice(10_000)]["centroid"]  # 
note this is the same as dt.agents[:10_000]
we reduce the decompression numbers by **a factor of 10K**.
**TL;DR**: when working with :code:`zarr` you should always 
aim to minimise the number of accesses to the compressed 
data.
.. _data_abstraction:
Dataset Abstraction Classes
---------------------------
As shown above, working with the raw :code:`zarr` dataset 
has its own perils. To that end, we provide two structures
which form an additional abstraction layer over the raw 
:code:`zarr` dataset. These two Python classes allow to 
rasterise
and get information about the past and future state of the 
AV or another agent. 
**Notes:** 
* the following 2 classes inherit from Pytorch Dataset and 
as such are tied to work with it;
* the following 2 classes assume the world to be rasterised 
as BEV (Bird-Eye-View), which is a common choice for 
CNN-based approaches. Still, this can be disabled by using 
:code:`stub_debug` as :code:`map_type`.
EgoDataset
++++++++++
The :code:`EgoDataset` retrieves information about the 
status of the AV in the current frame and the frames before 
it (if history is enabled).
When iterated, it yields a dict with the following 
information:
Enter to Rename, {1} to Preview
Log Data to Wandb
#🪄🐝
run = wandb.init(entity=wandb_entity,
                 project=project_name,
                 job_type=run_type,
                 name=run_name,
                 tags=tags,
                 notes=run_description)
Enter to Rename, {1} to Preview
#🪄🐝
l5_data_artifact = wandb.Artifact(name=artifact_name, 
type=artifact_type, description=artifact_description)
l5_data_artifact.add_dir("./dataset", "dataset")
l5_data_artifact.add_dir("./configurations", 
"configurations")
l5_data_artifact.add_file("./requirements.txt")
Enter to Rename, {1} to Preview
run.log_artifact(l5_data_artifact)
Enter to Rename, {1} to Preview
run.finish()
Enter to Rename, {1} to Preview
Visualized Logged Data Back to #🪄🐝﻿
project("l5-demo", "l5-common").artifactVersion("source-l5-common-None", "v1").file("s2-visualise-l5-data.ipynb")
Visualisation ExamplesThis notebook shows some of the visualisation utility of our toolkit.
The core packages for visualisation are:
rasterizationcontains classes for getting visual data as multi-channel tensors and turning them into interpretable RGB images.
Every class has at least a rasterize method to get the tensor and a to_rgb method to convert it into an image.
A few examples are:
BoxRasterizer: this object renders agents (e.g. vehicles or pedestrians) as oriented 2D boxes
SatelliteRasterizer: this object renders an oriented crop from a satellite map
visualizationcontains utilities to draw additional information (e.g. trajectories) onto RGB images. These utilities are commonly used after a to_rgb call to add other information to the final visualisation.
One example is:
draw_trajectory: this function draws 2D trajectories from coordinates and yaws offset on an image
Setup
from pathlib import Path
import os
Enter to Rename, {1} to Preview
experiments_directory = Path(Path(os.path.abspath('')).
parent.parent, "Experiments")
experiments_directory.mkdir(parents=True, exist_ok=True)
data_directory = Path(experiments_directory, "data")
data_directory.mkdir(parents=True, exist_ok=True)
visualization_directory = Path(experiments_directory, 
"visualization")
visualization_directory.mkdir(parents=True, exist_ok=True)
save_directory = Path(visualization_directory, 
Enter to Rename, {1} to Preview
import os
os.chdir(visualization_directory)
Enter to Rename, {1} to Preview
%%writefile requirements.txt
l5kit
pyyaml
wandb
Enter to Rename, {1} to Preview
%%capture
!pip install -r requirements.txt
Enter to Rename, {1} to Preview
import matplotlib.pyplot as plt
import numpy as np
from l5kit.data import ChunkedDataset, LocalDataManager
from l5kit.dataset import EgoDataset, AgentDataset
from l5kit.rasterization import build_rasterizer
from l5kit.configs import load_config_data
from l5kit.visualization import draw_trajectory, 
TARGET_POINTS_COLOR
from l5kit.geometry import transform_points
from tqdm import tqdm
from collections import Counter
from l5kit.data import PERCEPTION_LABELS
from prettytable import PrettyTable
from l5kit.visualization.visualizer.zarr_utils import 
zarr_to_visualizer_scene
from l5kit.visualization.visualizer.visualizer import 
visualize
from bokeh.io import output_notebook, show
Enter to Rename, {1} to Preview
First, let's configure where our data lives!The data is expected to live in a folder that can be configured using the L5KIT_DATA_FOLDER env variable. You data folder is expected to contain subfolders for the aerial and semantic maps as well as the scenes (.zarr files).
In this example, the env variable is set to the local data folder. You should make sure the path points to the correct location for you.
We built our code to work with a human-readable yaml config. This config file holds much useful information, however, we will only focus on a few functionalities concerning loading and visualization here.
import wandb
wandb.login()
Enter to Rename, {1} to Preview
# Run information
wandb_entity = "l5-demo"
project_name = "l5-common"
run_name = "visualize_data"
run_type = "visualize"
run_description = """
(1) we iterate over the frames to get a scatter plot of the 
AV locations
(2) agents types distribution
We will use two classes from the dataset package for this 
example. Both of them can be iterated and return 
multi-channel images from the rasterizer along with future 
trajectories offsets and other information.
* EgoDataset: this dataset iterates over the AV annotations
* AgentDataset: this dataset iterates over other agents 
annotations
(3) Visualizing the AV
We can convert the AgentDataset's target_position 
(displacements in meters in agent coordinates) into pixel 
coordinates in the image space, and call our utility 
function draw_trajectory (note that you can use this 
function for the predicted trajectories, as well)
(a) Bounding boxes on Semantic map
(b) Bounding boxes on Aerial map
(4) Visualizing the Agent
We can just replace the EgoDataset with an AgentDataset. 
Now we're iterating over agents and not the AV anymore
(a) Bounding boxes on Semantic map
(b) Bounding boxes on Aerial map
(5) Visualizing the Scene
Both EgoDataset and AgentDataset provide 2 methods for 
getting interesting indices:
get_frame_indices returns the indices for a given frame. 
For the EgoDataset this matches a single observation, while 
Enter to Rename, {1} to Preview
#🪄🐝
run = wandb.init(
    entity=wandb_entity,
    project=project_name,
    job_type=run_type,
    name=run_name,
    notes=run_description,
    tags=tags
)
Enter to Rename, {1} to Preview
artifact_entity = "l5-demo"
artifact_project = "l5-common"
artifact_name = "l5-data"
artifact_alias = "latest"
artifact_type = "dataset"
Enter to Rename, {1} to Preview
#🪄🐝
artifact = run.use_artifact(f"{artifact_entity}/
{artifact_project}/{artifact_name}:{artifact_alias}", 
Enter to Rename, {1} to Preview
_ = artifact.download(data_directory)
Enter to Rename, {1} to Preview
# Dataset is assumed to be on the folder specified
# in the L5KIT_DATA_FOLDER environment variable
# get config
cfg = load_config_data(Path(data_directory, 
"configurations", "visualisation_config.yaml"))
run.config.update(cfg)
Enter to Rename, {1} to Preview
We can look into our current configuration for interesting fields- when loaded in python, the yamlfile is converted into a python dict. 
raster_params contains all the information related to the transformation of the 3D world onto an image plane:
raster_size: the image plane size
pixel_size: how many meters correspond to a pixel
ego_center: our raster is centered around an agent, we can move the agent in the image plane with this param
map_type: the rasterizer to be employed. We currently support a satellite-based and a semantic-based one. We will look at the differences further down in this script
print(f'current raster_param:\n')
for k,v in cfg["raster_params"].items():
    print(f"{k}:{v}")
Enter to Rename, {1} to Preview
Load the dataThe same config file is also used to load the data. Every split in the data has its own section, and multiple datasets can be used (as a whole or sliced). In this short example we will only use the first dataset from the sample set. You can change this by configuring the 'train_data_loader' variable in the config.
You may also have noticed that we're building a LocalDataManager object. This will resolve relative paths from the config using the L5KIT_DATA_FOLDER env variable we have just set.
def get_zarr_dataset_details(zarr_dataset):
    # TODO add traffic faces
    fields = [
        "Num Scenes",
        "Num Frames",
        "Num Agents",
        "Num TR lights",
        "Total Time (hr)",
        "Avg Frames per Scene",
        "Avg Agents per Frame",
        "Avg Scene Time (sec)",
        "Avg Frame frequency",
    ]
    if len(zarr_dataset.frames) > 1:
        # read a small chunk of frames to speed things up
        times = np.diff(zarr_dataset.frames[:50]
["timestamp"])
        frequency = np.mean(1 / (times / 1e9))  # from nano 
to sec
    else:
        warnings.warn(
            f"not enough frames({len(zarr_dataset.frames)}) 
to read the frequency, 10 will be set",
            RuntimeWarning,
            stacklevel=2,
        )
        frequency = 10
    values = [
        len(zarr_dataset.scenes),
        len(zarr_dataset.frames),
        len(zarr_dataset.agents),
        len(zarr_dataset.tl_faces),
        len(zarr_dataset.frames) / max(frequency, 1) / 3600,
        len(zarr_dataset.frames) / max(len(zarr_dataset.
scenes), 1),
        len(zarr_dataset.agents) / max(len(zarr_dataset.
frames), 1),
        len(zarr_dataset.frames) / max(len(zarr_dataset.
scenes), 1) / frequency,
        frequency,
Enter to Rename, {1} to Preview
l5_data_location = Path(data_directory, "dataset")
Enter to Rename, {1} to Preview
dm = LocalDataManager(l5_data_location)
dataset_path = dm.require(cfg["val_data_loader"]["key"])
zarr_dataset = ChunkedDataset(dataset_path)
zarr_dataset.open()
# print(zarr_dataset)
Enter to Rename, {1} to Preview
zarr_details_fields, zarr_details_values = 
get_zarr_dataset_details(zarr_dataset)
Enter to Rename, {1} to Preview
zarr_dataset_details_table = wandb.Table
(columns=zarr_details_fields)
zarr_dataset_details_table.add_data(*zarr_details_values)
Enter to Rename, {1} to Preview
list(zip(zarr_dataset_details_table.columns, 
zarr_dataset_details_table.data[0]))
Enter to Rename, {1} to Preview
Working with the raw data.zarr files support most of the traditional numpy array operations. In the following cell we iterate over the frames to get a scatter plot of the AV locations:
frames = zarr_dataset.frames
coords = np.zeros((len(frames), 2))
for idx_coord, idx_data in enumerate(tqdm(range(len(frames))
, desc="getting centroid to plot trajectory")):
    frame = zarr_dataset.frames[idx_data]
    coords[idx_coord] = frame["ego_translation"][:2]
Enter to Rename, {1} to Preview
#🪄🐝
coord_table = wandb.Table(data=coords, columns=["x", "y"]) 
#TODO: Validate these columns are correct
Enter to Rename, {1} to Preview
Another easy thing to try is to get an idea of the agents types distribution. 
We can get all the agents label_probabilities and get the argmax for each raw. because .zarr files map to numpy array we can use all the traditional numpy operations and functions.
agents = zarr_dataset.agents
probabilities = agents["label_probabilities"]
labels_indexes = np.argmax(probabilities, axis=1)
counts = []
for idx_label, label in enumerate(PERCEPTION_LABELS):
    counts.append(np.sum(labels_indexes == idx_label))
Enter to Rename, {1} to Preview
label_table = wandb.Table(columns=["label", "counts"])
for count, label in zip(counts, PERCEPTION_LABELS):
    label_table.add_data(label, count)
Enter to Rename, {1} to Preview
label_table.data
Enter to Rename, {1} to Preview
Working with data abstractionEven though it's absolutely fine to work with the raw data, we also provide classes that abstract data access to offer an easier way to generate inputs and targets.
Core ObjectsAlong with the rasterizer, our toolkit contains other classes you may want to use while you build your solution. The dataset package, for example, already implements PyTorch ready datasets, so you can hit the ground running and start coding immediately.
Dataset packageWe will use two classes from the dataset package for this example. Both of them can be iterated and return multi-channel images from the rasterizer along with future trajectories offsets and other information.
EgoDataset: this dataset iterates over the AV annotations
AgentDataset: this dataset iterates over other agents annotations
Both support multi-threading (through PyTorch DataLoader) OOB.
What if I want to visualise the Autonomous Vehicle (AV)?Let's get a sample from the dataset and use our rasterizer to get an RGB image we can plot. 
If we want to plot the ground truth trajectory, we can convert the dataset's target_position (displacements in meters in agent coordinates) into pixel coordinates in the image space, and call our utility function draw_trajectory (note that you can use this function for the predicted trajectories, as well).
import random
Enter to Rename, {1} to Preview
cfg["raster_params"]["map_type"] = "py_semantic"
rast = build_rasterizer(cfg, dm)
dataset = EgoDataset(cfg, zarr_dataset, rast)
Enter to Rename, {1} to Preview
n_samples = 10
frame_sample_indexes = random.sample(range(0, dataset.
cumulative_sizes[-1]-1), n_samples)
Enter to Rename, {1} to Preview
sampled_av_frames_semantic_directory = Path(save_directory, 
"semantic_av_samples")
sampled_av_frames_semantic_directory.mkdir(parents=True, 
Enter to Rename, {1} to Preview
semantic_frame_paths = []
for idx in tqdm(frame_sample_indexes):
    data = dataset[idx]
    im = data["image"].transpose(1, 2, 0)
    im = dataset.rasterizer.to_rgb(im)
    target_positions_pixels = transform_points(data
["target_positions"], data["raster_from_agent"])
    draw_trajectory(im, target_positions_pixels, 
TARGET_POINTS_COLOR, yaws=data["target_yaws"])
    av_frame_path = Path
(sampled_av_frames_semantic_directory, f"semantic-frame-
{idx}.png")
Enter to Rename, {1} to Preview
What if I want to change the rasterizer?We can do so easily by building a new rasterizer and new dataset for it. In this example, we change the value to py_satellite which renders boxes on an aerial image.
sampled_av_frames_satellite_directory = Path
(save_directory, "satellite_av_samples")
sampled_av_frames_satellite_directory.mkdir(parents=True, 
Enter to Rename, {1} to Preview
cfg["raster_params"]["map_type"] = "py_satellite"
rast = build_rasterizer(cfg, dm)
dataset = EgoDataset(cfg, zarr_dataset, rast)
Enter to Rename, {1} to Preview
satellite_frame_paths = []
for idx in tqdm(frame_sample_indexes):
    data = dataset[idx]
    im = data["image"].transpose(1, 2, 0)
    im = dataset.rasterizer.to_rgb(im)
    target_positions_pixels = transform_points(data
["target_positions"], data["raster_from_agent"])
    draw_trajectory(im, target_positions_pixels, 
TARGET_POINTS_COLOR, yaws=data["target_yaws"])
    av_frame_path = Path
(sampled_av_frames_satellite_directory, f"satellite-frame-
{idx}.png")
Enter to Rename, {1} to Preview
#🪄🐝
sampled_frames_table = wandb.Table(columns = ["frame_idx", 
"semantic_view", "satellite_view"])
for frame_idx, semantic_frame_path, satellite_frame_path in 
zip(frame_sample_indexes, semantic_frame_paths, 
Enter to Rename, {1} to Preview
What if I want to visualise an agent?Glad you asked! We can just replace the EgoDataset with an AgentDataset. Now we're iterating over agents and not the AV anymore, and the first one happens to be the pace car (you will see this one around a lot in the dataset).
Semantic
sampled_agent_semantic_directory = Path(save_directory, 
"semantic_agent_examples")
sampled_agent_semantic_directory.mkdir(parents=True, 
Enter to Rename, {1} to Preview
cfg["raster_params"]["map_type"] = "py_semantic"
rast = build_rasterizer(cfg, dm)
dataset = AgentDataset(cfg, zarr_dataset, rast)
Enter to Rename, {1} to Preview
n_samples = 10
agent_sample_indexes = random.sample(range(0, len(dataset.
agents_indices)-1), n_samples)
Enter to Rename, {1} to Preview
semantic_agent_paths = []
for idx in tqdm(agent_sample_indexes):
    data = dataset[idx]
    im = data["image"].transpose(1, 2, 0)
    im = dataset.rasterizer.to_rgb(im)
    target_positions_pixels = transform_points(data
["target_positions"], data["raster_from_agent"])
    draw_trajectory(im, target_positions_pixels, 
TARGET_POINTS_COLOR, yaws=data["target_yaws"])
    agent_sample_path = Path
(sampled_agent_semantic_directory, f"semantic-agent-{idx}.
png")
Enter to Rename, {1} to Preview
Satellite
sampled_agent_satellite_directory = Path(save_directory, 
"satellite_agent_examples")
sampled_agent_satellite_directory.mkdir(parents=True, 
Enter to Rename, {1} to Preview
cfg["raster_params"]["map_type"] = "py_satellite"
rast = build_rasterizer(cfg, dm)
dataset = AgentDataset(cfg, zarr_dataset, rast)
Enter to Rename, {1} to Preview
satellite_agent_paths = []
for idx in tqdm(agent_sample_indexes):
    data = dataset[idx]
    im = data["image"].transpose(1, 2, 0)
    im = dataset.rasterizer.to_rgb(im)
    target_positions_pixels = transform_points(data
["target_positions"], data["raster_from_agent"])
    draw_trajectory(im, target_positions_pixels, 
TARGET_POINTS_COLOR, yaws=data["target_yaws"])
    agent_sample_path = Path
(sampled_agent_satellite_directory, f"satellite-agent-{idx}.
png")
Enter to Rename, {1} to Preview
Join into Table
#🪄🐝
sampled_agents_table = wandb.Table(columns = ["agent_idx", 
"semantic_view", "satellite_view"])
for agent_idx, semantic_agent_path, satellite_agent_path in 
zip(agent_sample_indexes, semantic_agent_paths, 
Enter to Rename, {1} to Preview
System Origin and Orientation~At this point you may have noticed that we vertically flip the image before plotting it.~
Vertical flipping is not required anymore as it's already performed inside the rasteriser.
Further, all our rotations are counter-clockwise for positive value of the angle.
How does an entire scene look like?It's easy to visualise an individual scene using our toolkit. Both EgoDataset and AgentDataset provide 2 methods for getting interesting indices:
get_frame_indices returns the indices for a given frame. For the EgoDataset this matches a single observation, while more than one index could be available for the AgentDataset, as that given frame may contain more than one valid agent
get_scene_indices returns indices for a given scene. For both datasets, these might return more than one index
In this example, we visualise a scene from the ego's point of view:
Semantic
from IPython.display import display, clear_output
import PIL
Enter to Rename, {1} to Preview
sampled_semantic_scene_animation_directory = Path
(save_directory, "semantic_animations")
sampled_semantic_scene_animation_directory.mkdir
Enter to Rename, {1} to Preview
cfg["raster_params"]["map_type"] = "py_semantic"
rast = build_rasterizer(cfg, dm)
dataset = EgoDataset(cfg, zarr_dataset, rast)
Enter to Rename, {1} to Preview
n_samples = 10
sampled_scene_indices = random.sample(range(0, len
(zarr_dataset.scenes)-1), n_samples)
Enter to Rename, {1} to Preview
semantic_scene_animation_paths = []
for scene_idx in tqdm(sampled_scene_indices):
    indexes = dataset.get_scene_indices(scene_idx)
    scene_frames = []
    for idx in indexes:
        data = dataset[idx]
        im = data["image"].transpose(1, 2, 0)
        im = dataset.rasterizer.to_rgb(im)
        target_positions_pixels = transform_points(data
["target_positions"], data["raster_from_agent"])
        center_in_pixels = np.asarray(cfg["raster_params"]
["ego_center"]) * cfg["raster_params"]["raster_size"]
        draw_trajectory(im, target_positions_pixels, 
TARGET_POINTS_COLOR, yaws=data["target_yaws"])
        scene_frames.append(PIL.Image.fromarray(im))
    gif_path = Path
(sampled_semantic_scene_animation_directory, 
f"semantic-scene-{scene_idx}.gif")
    frame_one = scene_frames[0]
Enter to Rename, {1} to Preview
satellite
sampled_satellite_scene_animation_directory = Path
(save_directory, "satellite_animations")
sampled_satellite_scene_animation_directory.mkdir
Enter to Rename, {1} to Preview
cfg["raster_params"]["map_type"] = "py_satellite"
rast = build_rasterizer(cfg, dm)
dataset = EgoDataset(cfg, zarr_dataset, rast)
Enter to Rename, {1} to Preview
satellite_scene_animation_paths = []
for scene_idx in tqdm(sampled_scene_indices):
    indexes = dataset.get_scene_indices(scene_idx)
    scene_frames = []
    for idx in indexes:
        data = dataset[idx]
        im = data["image"].transpose(1, 2, 0)
        im = dataset.rasterizer.to_rgb(im)
        target_positions_pixels = transform_points(data
["target_positions"], data["raster_from_agent"])
        center_in_pixels = np.asarray(cfg["raster_params"]
["ego_center"]) * cfg["raster_params"]["raster_size"]
        draw_trajectory(im, target_positions_pixels, 
TARGET_POINTS_COLOR, yaws=data["target_yaws"])
        scene_frames.append(PIL.Image.fromarray(im))
    gif_path = Path
(sampled_satellite_scene_animation_directory, 
f"satellite-scene-{scene_idx}.gif")
    frame_one = scene_frames[0]
Enter to Rename, {1} to Preview
Introducing a new visualizerstarting from l5kit v1.3.0 you can now use an interactive visualiser (based on Bokeh) to inspect the scene.
The visualization can be built starting from individual scenes and allows for a closer inspection over ego, agents and trajectories.
PRO TIP: try to hover over one agent to show information about it
sampled_bokeh_animations = Path(save_directory, 
"bokeh_animations")
sampled_bokeh_animations.mkdir(parents=True, exist_ok=True)
Enter to Rename, {1} to Preview
from bokeh.plotting import figure, output_file, save
Enter to Rename, {1} to Preview
# output_notebook()
bokeh_animation_paths = []
mapAPI = MapAPI.from_cfg(dm, cfg)
for scene_idx in tqdm(sampled_scene_indices):
    out = zarr_to_visualizer_scene(zarr_dataset.
get_scene_dataset(scene_idx), mapAPI)
    out_vis = visualize(scene_idx, out)
    bokeh_path = Path(sampled_bokeh_animations, f"scene-
{scene_idx}.html")
    output_file(bokeh_path)
    save(out_vis)
Enter to Rename, {1} to Preview
#🪄🐝
sampled_animations_table = wandb.Table(columns = 
["scene_idx", "semantic_view", "satellite_view", 
"bokeh_view"])
for scene_idx, semantic_scene_path, satellite_scene_path, 
bokeh_scene_path in zip(sampled_scene_indices, 
semantic_scene_animation_paths, 
satellite_scene_animation_paths, bokeh_animation_paths):
    sampled_animations_table.add_data(scene_idx, 
Enter to Rename, {1} to Preview
# Artifact Information
artifact_name = "l5-data-analysis"
artifact_type = "analysis"
artifact_description = """
(0) Table of aggregate details of zarr dataset
(1) Table of Coordinates which generates a scatter plot of 
the AV locations
(2) Agents Types Distribution
(3) Table of Ego Frames
(4) Table of Agent Frames
(5) Table of Scene animations
"""
Enter to Rename, {1} to Preview
# zarr_dataset_details_table #move this to metadata of 
artifact during the initial logging step?
# coord_table
# label_table
# sampled_frames_table
# sampled_agents_table
# sampled_animations_table
Enter to Rename, {1} to Preview
#🪄🐝
analysis_artifact = wandb.Artifact(name=artifact_name, 
type=artifact_type, description=artifact_description)
analysis_artifact.add(zarr_dataset_details_table, 
"zarr_dataset_details_table")
analysis_artifact.add(coord_table, "coord_table")
analysis_artifact.add(label_table, "label_table")
analysis_artifact.add(sampled_frames_table, 
"sampled_frames_table")
analysis_artifact.add(sampled_agents_table, 
Enter to Rename, {1} to Preview
run.log_artifact(analysis_artifact)
Enter to Rename, {1} to Preview
run.finish()
Enter to Rename, {1} to Preview
﻿
Add a comment
Data Code

Log Data to #🪄🐝

Pull Data from Woven Planet and then Log to WANDB for Reuse across Different Models

Setup

Download Data

Log Data to WANDB

Enrich Run information and Artifact Information

Log Data to Wandb

Visualized Logged Data Back to #🪄🐝

Visualisation Examples

`rasterization`

`visualization`

Setup

First, let's configure where our data lives!

We can look into our current configuration for interesting fields

Load the data

Working with the raw data

Working with data abstraction

Core Objects

Dataset package

What if I want to visualise the Autonomous Vehicle (AV)?

What if I want to change the rasterizer?

What if I want to visualise an agent?

Semantic

Satellite

Join into Table

System Origin and Orientation

How does an entire scene look like?

Semantic

satellite

Introducing a new visualizer