Advanced Architectures of Autoencoder in Image Processing
The increasing complexity of modern datasets and pervasive noise necessitate the development of robust techniques for data processing, reconstruction, and denoising. This study presents four novel autoencoder architectures tailored to address these challenges in image processing. The ResoFocus Autoencoder introduces innovative architecture complemented by a refined loss function, specifically targeting the optimization and denoising of images at both 2x and 4x magnifications. Empirical evaluations on diverse datasets, including CelebA and Autism Face data, demonstrate substantial enhancements in Peak Signal-to-Noise Ratio (PSNR), highlighting the efficacy of these architectures in handling intricate data complexities and mitigating noise artifacts.
Moreover, the FragmentumZoom Autoencoder strategically focuses on denoising tasks by processing small image segments. This architecture complements the ResoFocus Autoencoder and contributes to the overall enhancement in image quality. Empirical evaluations on CelebA and Autism Face data further validate its effectiveness in noise reduction tasks.Another architecture, the DualEncoder SplitPath Autoencoder (DESPAE), presents a pioneering architectural framework for image reconstruction. DESPAE efficiently partitions input images into two distinct splits, each processed by dedicated encoder networks, and integrates the resulting latent spaces to facilitate efficient reconstruction. Experimental evaluations on CelebA and X-ray datasets substantiate its performance, with metrics such as Mean Squared Error (MSE) and PSNR validating its efficacy. Comparative analysis against prominent autoencoder architectures underscores the superior performance of DESPAE, thereby contributing significantly to the advancement of image reconstruction methodologies.
Additionally, the Split Spectrum Autoencoder Architecture uses Fourier transformation, low pass filtering, high pass filtering, and latent space integration within its encoder-decoder model. This architecture effectively dissects images into distinct frequency components, capturing intricate frequency-specific features. Employing two dedicated encoders specialized for low and high-frequency components, it integrates separate latent spaces to yield a comprehensive representation harmonizing both low and high-frequency attributes. Subsequent decoding employs reverse transformation techniques and advanced upscaling methods to meticulously reconstruct the original image, seamlessly incorporating refined frequency details from both domains.
Furthermore, the introduction of novel loss functions enriches the capabilities of these autoencoder architectures. A specialized loss function in denoising architectures embeds spatial context to enhance denoising and reconstruction capabilities, while the utilization of a composite reconstruction error with Frobenius norm ensures accurate reconstruction of composite images by quantifying differences between reconstructed and original images.These autoencoder architectures represent significant strides in image processing techniques, particularly in contexts requiring heightened resolution, noise reduction, and precise reconstruction. By adeptly addressing challenges posed by diverse datasets and using innovative architectural designs, this research catalyzes advancements in image processing methodologies.
History
Language
- English
Publication Year
- 2024
License statement
© The author. The author has granted HBKU and Qatar Foundation a non-exclusive, worldwide, perpetual, irrevocable, royalty-free license to reproduce, display and distribute the manuscript in whole or in part in any form to be posted in digital or print format and made available to the public at no charge. Unless otherwise specified in the copyright statement or the metadata, all rights are reserved by the copyright holder. For permission to reuse content, please contact the author.Institution affiliated with
- Hamad Bin Khalifa University
- College of Science and Engineering - HBKU
Degree Date
- 2024
Degree Type
- Master's