Optica Publishing Group
Browse

Image-to-image machine translation enables computational defogging in real-world images

Version 2 2024-09-04, 14:20
Version 1 2024-09-04, 14:19
Posted on 2024-09-04 - 14:20
This paper addresses the challenge of computational defogging using image-to-image (I2I) machine learning models trained on real-world data. We introduce Stereofog, the largest and most diverse dataset to date, comprising 10, 067 paired clear-foggy images captured with a custom-built binocular camera setup. By training a pix2pix I2I model on this dataset, we achieve a Complex Wavelet Structural Similarity Index (CW-SSIM) of 0.76, Multi-scale Structural Similarity Index (MS-SSIM) of 0.7, and Pearson correlation coefficient of 0.4 for defogged images, demonstrating significant improvements in defogging efficacy compared to models trained on synthetic data. The model maintains high performance with a CW-SSIM of 0.95 for low fog density and 0.8 for real data, though it drops to 0.5 at very high fog densities. These results underscore the model’s ability to produce plausible reconstructions under varying fog conditions. This study advances the field by providing a robust, open-source dataset, and demonstrating the practical applicability of open-sourced I2I machine learning models for real-world computational defogging.

CITE THIS COLLECTION

DataCite
3 Biotech
3D Printing in Medicine
3D Research
3D-Printed Materials and Systems
4OR
AAPG Bulletin
AAPS Open
AAPS PharmSciTech
Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg
ABI Technik (German)
Academic Medicine
Academic Pediatrics
Academic Psychiatry
Academic Questions
Academy of Management Discoveries
Academy of Management Journal
Academy of Management Learning and Education
Academy of Management Perspectives
Academy of Management Proceedings
Academy of Management Review
or
Select your citation style and then place your mouse over the citation text to select it.

SHARE

email
need help?