Enhancing image style transfer for real-time indoor geometric data using GAN
Abstract
The need for two-dimensional (2D) simulated environments, such as those seen in VR/AR and the multiverse, has grown substantially in the rapidly emerging modern era. To reduce the need for human interaction, the present studies have focused on the automated transfer of image processing styles inside a 2D virtual world through the application of computer vision. However, there are several limits to the current research on 2D environment-style transfers that rely on modelling. One thing to keep in mind is that there is a significant amount of style image data required for training a style transfer network specifically for 2D simulations. All this information must be combined with perspectives that are fairly accurate representations of the internet. The second issue was that the 2D structures were inconsistent. Most of the research relies on 2D input image attributes and ignores 2D scene geometric data. Lastly, changing the way something looks does not change the fact that every object has its own distinct qualities. To address these issues, we propose an enhanced Generative Adversarial Networks (GAN) approach for image style transfer in which oversampling and fine tuning is used to improve the data loss. Image Style transfer is an approach to image-to-image translation that supports the transfer of the style of an image to a real-time image and is mainly used to enhance the resolution and quality of an image. The performance of the proposed approach is analyzed using the MIT dataset, which is retrieved from Kaggle. It is an open-source dataset that contains 67 indoor categories and has more than 15,000 images. The proposed approach is implemented using MATLAB simulation tool. The results of proposed approach show that, content loss and style loss are lower than other deep learning models such as VGG16 and Alex Net.
Keywords
Full Text:
PDFReferences
1. Goodfellow IJ. NIPS 2016 Tutorial: Generative Adversarial Networks. arXiv: 1701.00160v4, April 2017.
2. Iglesias G, Talavera E, Díaz-Álvarez A. A survey on GANs for computer vision: Recent research, analysis and taxonomy. Computer Science Review. 2023, 48: 100553. doi: 10.1016/j.cosrev.2023.100553
3. Srikanth T. Transfer learning using VGG-16 with Deep Convolutional Neural Network for Classifying Images. International Journal of Scientific and Research Publications (IJSRP), 2019.
4. Singh I, Goyal G, Chandel A. AlexNet architecture based convolutional neural network for toxic comments classification. Journal of King Saud University - Computer and Information Sciences. 2022, 34(9): 7547-7558. doi: 10.1016/j.jksuci.2022.06.007
5. Sudha V, T. R. Ganeshbabu Dr. A Convolutional Neural Network Classifier VGG-19 Architecture for Lesion Detection and Grading in Diabetic Retinopathy Based on Deep Learning. Computers, Materials & Continua. 2020, 66(1): 827-842. doi: 10.32604/cmc.2020.012008
6. Cîrneanu AL, Popescu D, Iordache D. New Trends in Emotion Recognition Using Image Analysis by Neural Networks, a Systematic Review. Sensors. 2023, 23(16): 7092. doi: 10.3390/s23167092
7. Jorge V, Joao A, Carlos M, Sandra J. Developments in Image Processing Using Deep Learning and Reinforcement Learning. Journal of Imaging. September 2023.
8. Aggarwal A, Mittal M, Battineni G. Generative adversarial network: An overview of theory and applications. International Journal of Information Management Data Insights. 2021, 1(1): 100004. doi: 10.1016/j.jjimei.2020.100004
9. Isola P, Zhu Y, Zhou T, Efros A. Image-to-Image Translation: Methods and Applications. arXiv:2101.08629v2 [cs.CV], July 2021.
10. Han X, Wu Y, Wan R. A Method for Style Transfer from Artistic Images Based on Depth Extraction Generative Adversarial Network. Applied Sciences. 2023, 13(2): 867. doi: 10.3390/app13020867
11. Salvaris M, Dean D, Tok WH. Generative Adversarial Networks. Deep Learning with Azure. Published online 2018: 187-208. doi: 10.1007/978-1-4842-3679-6_8
12. Zhu JY, Park T, Isola P, et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. 2017 IEEE International Conference on Computer Vision (ICCV). Published online October 2017. doi: 10.1109/iccv.2017.244
13. Gulrajani I, Ahmed F, Arjovsky M. Improved Training of Wasserstein GANs. arXiv:1704.00028v3. 25 Dec 2017.
14. Karras T, Laine S, Aittala M, et al. Analyzing and Improving the Image Quality of StyleGAN. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Published online June 2020. doi: 10.1109/cvpr42600.2020.00813
15. Han X, Wu Y, Wan R. A Method for Style Transfer from Artistic Images Based on Depth Extraction Generative Adversarial Network. Applied Sciences. MDPI. 2023.
16. Ulyanov D, Vedaldi A, Lempitsky V. Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Published online July 2017. doi: 10.1109/cvpr.2017.437
17. Liu H, Michelini PN, Zhu D. Artsy-GAN: A style transfer system with improved quality, diversity and performance. 2018 24th International Conference on Pattern Recognition (ICPR). Published online August 2018. doi: 10.1109/icpr.2018.8546172
18. Xu W, Long C, Wang R, et al. DRB-GAN: A Dynamic ResBlock Generative Adversarial Network for Artistic Style Transfer. 2021 IEEE/CVF International Conference on Computer Vision (ICCV). Published online October 2021. doi: 10.1109/iccv48922.2021.00632
19. Zheng Z, Liu J. P2—GAN: Efficient style transfer using single style image. arXiv:2001.07466v2. 30 January 2020.
20. Bing Z, Xin P, Seppe VB, Jin X. A GAN-based hybrid sampling method for imbalanced customer classification. Information Sciences. 2022, 609: 1397-1411.
21. Jiaxing H, Kaiwen C, Dayan G, Aoran X, Fangneng Z, Shijian L, Shengcai L, Eric X. Masked Generative Adversarial Networks are Data-Efficient Generation Learners”, 36th Conference on Neural Information Processing Systems (NeurIPS 2022).
22. Alireza A, Modjtaba R. Fine-tuned Generative Adversarial Network-based Model for Medical Image Super-Resolution”, Computer Vision and Pattern Recognition. arXiv:2211.00577. 2023.
DOI: https://doi.org/10.32629/jai.v7i3.1301
Refbacks
- There are currently no refbacks.
Copyright (c) 2024 Sudesh Kumari, Karan Kumar
License URL: https://creativecommons.org/licenses/by-nc/4.0/