Skip to main content

KL_coeff = 0 and encoded dimension

Created on July 14|Last edited on July 21
  • I compared the performance in terms of reconstruction of the models with kl_coefficient = 0, varying the encoded dimension and testing the standard CNN architecture against a different architecture inspired to ResNet18 (both encoder and decoder), including skip connections (NB: to remain faithful to the original implementation this has more filters and more downsampling for now).
  • The results show a clear correlation between the encoded dimension and the reconstruction capabilities of the model, perhaps implying that more dimensions that we originally assumed are necessary for this dataset.
  • I notice a difference between in the reconstructions in that the CNN images tend to look more 'blurry' while the resnet outputs present more 'pixelated' images.
  • Although the reconstruction loss is higher for the resnet-inspired architecture, the generations seem to have at least capture some features (tonality)

CVAECVAE_resnetmodel100150200250300350400450500550encoded_dim00kl_coefficient5678910111213141516171819reconstruction_loss




Run set
15