Fast and Efficient Image Generation Using Variational Autoencoders and K-Nearest Neighbor OveRsampling Approach

Islam, Ashhadul; Belhaouari, Samir Brahim

doi:10.1109/access.2023.3259236

10.1109_access.2023.3259236.pdf (1.62 MB)

Fast and Efficient Image Generation Using Variational Autoencoders and K-Nearest Neighbor OveRsampling Approach

journal contribution

submitted on 2024-02-12, 08:48 and posted on 2024-02-12, 08:48 authored by Ashhadul Islam, Samir Brahim Belhaouari

Researchers gravitate towards Generative Adversarial Networks (GAN) to create artificial images. However, GANs suffer from convergence issues, mode collapse, and overall complexity in balancing the Nash Equilibrium. Images generated are often distorted, rendering them useless. We propose a combination of Variational Autoencoders (VAEs) and a statistical oversampling method called K-Nearest Neighbor OveRsampling (KNNOR) to create artificial images. This combination of VAE and KNNOR results in more life-like images with reduced distortion. We fine-tune several pre-trained networks on a separate set of real and fake face images to test images generated by our method against images generated by conventional Deep Convolutional GANs (DCGANs). We also compare the combination of VAEs and Synthetic Minority Oversampling Technique (SMOTE) to establish the efficacy of KNNOR against naive oversampling methods. Not only are our methods better able to convince the classifiers that the images generated are authentic, but the models are also half in size of DCGANs. The code is available at GitHub for public use.

Other Information

Published in: IEEE Access
License: http://creativecommons.org/licenses/by/4.0
See article on publisher's website: https://dx.doi.org/10.1109/access.2023.3259236

Funding

Open Access funding provided by the Qatar National Library.

History

Language

English

Publisher

IEEE

Publication Year

2023

License statement

This Item is licensed under the Creative Commons Attribution 4.0 International License

Institution affiliated with

Hamad Bin Khalifa University
College of Science and Engineering - HBKU

Usage metrics

Keywords

Generators Feature extraction Generative adversarial networks Decoding Training Nash equilibrium Image synthesis Face recognition variational autoencoders Image reconstruction artificial image creation

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Fast and Efficient Image Generation Using Variational Autoencoders and K-Nearest Neighbor OveRsampling Approach

Other Information

Funding

Open Access funding provided by the Qatar National Library.

History

Language

Publisher

Publication Year

License statement

Institution affiliated with

Usage metrics

Categories

Keywords

Licence

Exports