Hazy Image Restoration Using Keras

An introduction to building an Image Restoration model using Tensorflow, Keras, and Weights & Biases.
Created on October 15|Last edited on February 16
Comment
﻿
Who am I?
Soumik Rakshit
Machine Learning Engineer at Weights & Biases || Google Developer Expert (JAX)
﻿
IntroductionThe existence of haze in the atmosphere–due to the presence of aerosols such as dust, mist, and smoke–adds complicated noise to photographs. It dramatically degrades the visibility of outdoor images (and sometimes even indoor images), where contrasts are reduced and surface colors become faint.
From an ML perspective, a hazy image jeopardizes the effectiveness of many subsequent high-level computer vision tasks such as object detection, recognition, and segmentation.
﻿
runs.summary["Object-Detection"]
 - 2 of 2
Predictions
Number-of-Objects
Is-Hazy
2
4
runs.summary["Object-Detection"]
 - 2 of 2
Predictions
Number-of-Objects
Is-Hazy
1
3
The presence of atmospheric haze significatlt impacts the performance of State-of-the-Art Object Detection systems such as YOLOv6. This makes the restoration of images an extremely crucial aspect of any computer vision pipeline.4
﻿
It follows then that problem of image-dehazing is a widely explored problem in computer vision. In this report, we'll be exploring a fast and simple end-to-end method for image dehazing proposed by the paper An All-in-One Network for Dehazing and Beyond. 
The paper, albeit quite old (in terms of the blazing fast pace in which ML research moves), was the first paper to propose an end-to-end trainable image dehazing model, that can be trained to directly produce a clean image from a hazy image. The paper formulates a simple neural network architecture called AODNet (All-in-One De-hazing Network) which we train using Tensorflow and Keras. We'll also share and visualize the results of our experiments using Weights & Biases.
This article was written as a Weights & Biases Report which is a project management and collaboration tool for machine learning projects. Reports let you organize and embed visualizations, describe your findings, share updates with collaborators, and more. To know more about reports, check out Collaborative Reports.
💡
What We'll Be CoveringWho am I?IntroductionLooking at the DatasetThe Data Input PipelineThe All-in-One Dehazing NetworkThe Mathematical Model for Hazy ImagesThe Core Idea Behind AODNetImplementing the AODNetTraining AODNetExperiment Tracking using Weights & BiasesEvaluation ResultsInteractive DemoConclusionSimilar Reports
﻿
Looking at the DatasetWe'll train the AODNet model primarily on synthetic hazy images, and test on both synthetic and real natural images from multiple datasets. We list the datasets and Weave panels to visualize and explore these datasets below:
﻿
The Image-dehazing Dataset298
﻿
﻿
The Data Input PipelineFor training our model, we build a simple Tensorflow-based data input pipeline using the tf.data API. The tf.data API makes it possible to handle large amounts of data, read from different data formats, and perform complex transformations.
﻿
The Data Input Pipeline0
﻿
﻿
The All-in-One Dehazing NetworkNow that we have the data input pipeline in place, let's take a look at the concept of the AODNet architecture in detail.
The Mathematical Model for Hazy ImagesThe classical description for the hazy image generation (and conversely image-dehazing too) has been described by the atmospheric scattering model. According to this model, a hazy image is given by...
I(x)=J(x)t(x)+A(1−t(x))\Large{I(x)=J(x) t(x)+A(1-t(x))}I(x)=J(x)t(x)+A(1−t(x))﻿
...where:
﻿I(x)I(x)I(x)﻿ is the observed hazy image
﻿J(x)J(x)J(x)﻿ is the idea clean image
﻿AAA﻿ denotes the global atmospheric light
﻿t(x)t(x)t(x)﻿ is the transmission matrix
The image dehazing problem can be formulated by expressing the atmospheric scattering model in terms of the hazy image:
J(x)=1t(x)I(x)−A1t(x)+A\Large{J(x)=\frac{1}{t(x)} I(x)-A \frac{1}{t(x)}+A}J(x)=t(x)1​I(x)−At(x)1​+A﻿
A lot of prior approaches in image dehazing such as DehazeNet and Multi-scale CNNs follow the identical three-step procedure:
Estimating the transmission matrix t(x)t(x)t(x)﻿ from the hazy image I(x)I(x)I(x)﻿ using a sophisticated deep model.
Estimating the global atmospheric light AAA﻿ using some kind of empirical method.
Estimating the clean image J(x)J(x)J(x)﻿ using the aforementioned formulation of the same.
However, such a procedure for image-dehazing leads to a sub-optimal solution that does not directly minimize the image reconstruction errors. The separate estimation of t(x)t(x)t(x)﻿ and AAA﻿ will cause accumulated or even amplified errors, when combining them together to calculate the clean image J(x)J(x)J(x)﻿.
The Core Idea Behind AODNetThe core idea behind AODNet is to unify the transmission matrix t(x)t(x)t(x)﻿ and the global atmospheric light AAA﻿ into a single expression given by K(x)K(x)K(x)﻿. Therefore, the formulation of the clean image can be transformed to...
J(x)=K(x)I(x)−K(x)+1\Large{J(x)=K(x) I(x)-K(x)+1}J(x)=K(x)I(x)−K(x)+1﻿
Since K(x)K(x)K(x)﻿ is dependent on I(x)I(x)I(x)﻿, we can then build a deep convolutional model with the hazy images as input, that minimizes the reconstruction error between the output J(x)J(x)J(x)﻿ and the ground-truth clean image.The overall architecture of AODNet can be summarized by the following diagram
Source: Figure 4(a) from https://arxiv.org/pdf/1707.06543.pdf﻿
Note, that K(x)K(x)K(x)﻿ is just t(x)t(x)t(x)﻿ and AAA﻿ translated to the pixel domain. We can express it as K(x)=1t(x)(I(x)−A)+(A−b)I(x)−1K(x)=\frac{\frac{1}{t(x)}(I(x)-A)+(A-b)}{I(x)-1}K(x)=I(x)−1t(x)1​(I(x)−A)+(A−b)​﻿. However, the exact expression of K(x)K(x)K(x)﻿ is not relevant for us, since we would train a deep convolutional neural network to approximate this expression.
💡
Implementing the AODNetAs per the architecture summarized in the aforementioned diagram, we can see that the AODNet Model essentially consists of two modules:
 A K-estimation module that estimates K(x)K(x)K(x)﻿ from the hazy image I(x)I(x)I(x)﻿. This module is responsible for estimating the depth and relative haze level.
A clean image generation module that generated the clean image from K(x)K(x)K(x)﻿ using the transformed formulation of the atmospheric scattering model.
﻿
Implementation of the AODNet Architecture1
﻿
﻿
Training AODNetNow that we have both the data input pipeline and the model implementation in place, let's train the model. We'll also explore the Weights & Biases callbacks for Keras which enables us to log and track the results of our training experiments and the model checkpoint easily on Weights & Biases.
﻿
The Loss, Optimizer and the Metrics0
﻿
Experiment Tracking using Weights & BiasesWeights & Biases provides us with three callbacks for experiment tracking:
﻿WandbMetricsLogger: This callback automatically tracks the training and validation logs (such as losses and metrics) and logs them on our Weights & Biases dashboard. It also tracks our system metrics, such as CPU and GPU utilization.
﻿WandbModelCheckpoint: This callback periodically saves a Keras model or model weights and uploads it to Weights & Biases as an Artifact. This callback provides the following features:
Save the model that has achieved the best performance based on the monitor.
Save the model at the end of every epoch regardless of the performance.
Save the model at the end of the epoch or after a fixed number of training batches.
Save only model weights, or save the whole model.
Save the model either in SavedModel format or in h5 format.
﻿WandbEvalCallback: This is an abstract base class to build Keras callbacks for model prediction visualization. We can build callbacks for visualizing model predictions on_epoch_end  that can be passed to model.fit() for different machine-learning tasks such as classification, object detection, segmentation, etc. To use this, we can create our own visualization callback that inherits from this base callback class and implement the add_ground_truth and add_model_prediction methods. The base WandbEvalCallback class will take care of the following:
Initialize data_table for logging the ground truth and pred_table for predictions.
The data uploaded to data_table is used as a reference for the pred_table. This is to reduce the memory footprint. The data_table_ref is a list that can be used to access the referenced data. We will demonstrate this with an example in the training notebook.
Log the tables to Weights & Biases as Artifacts.
Each new pred_table is logged as a new version with aliases.
﻿
﻿
Keras callbacks for logging into Weights & Biases0
﻿
Now, let's take a look at the training logs:
﻿
Training Logs2
﻿
﻿
Evaluation Results﻿
Evaluation Results2
﻿
﻿
Interactive DemoWe also created an interactive inference demo using Gradio that enables us to easily perform inference on hazy images of our choice and log them into our Weights & Biases dashboard for reproducibility and future reference.
﻿
﻿
﻿
Demo25
﻿
﻿
ConclusionThe ideas explored and implemented in this paper are based on the paper An All-in-One Network for Dehazing and Beyond.
There is one crucial con with respect to the problem formulation in AODNet using the atmospheric scattering model; it does not account for the depth of an image and often results in the removal of foreground haze while the background haze still lingers.
Some of the images for the demo have been taken from github.com/MayankSingal/PyTorch-Image-Dehazing.
This report was presented at
the ML Monthly Meetup #21 organized by Bangalore AI/ML/Data Developers Group.
the reboot session of TensorFlow User Group, Kolkata.
The source code of this repository has been made open-source at ﻿﻿github.com/soumik12345/image-restoration-primer.
Shoutout to Ayush Thakur who has been instrumental in the development and revamping of the Weights & Biases callbacks for Keras.
﻿
Similar Reports
Digging Into StyleGAN-NADA for CLIP-Guided Domain Adaptation
In this article, we take a deep dive into how StyleGAN-NADA achieved the task of CLIP-guided domain adaptation and explore how we can use the model itself.
Barbershop: Hair Transfer with GAN-Based Image Compositing Using Segmentation Masks
A novel GAN-based optimization method for photo-realistic hairstyle transfer
Improving Generative Images with Instructions: Prompt-to-Prompt Image Editing with Cross Attention Control
A primer on text-driven image editing for large-scale text-based image synthesis models like Stable Diffusion & Imagen
PoE-GAN: Generating Images from Multi-Modal Inputs
PoE-GAN is a recent, fascinating paper where the authors generate images from multiple inputs like text, style, segmentation, and sketch. We dig into the architecture, the underlying math, and of course, generate some images along the way. 
﻿
﻿
Add a comment
Tags: Articles, Keras, Computer Vision, Image manipulation, Panels, Plots, Advanced
Iterate on AI agents and models faster. Try Weights & Biases today.