Proposal of Image generation model using cGANs for sketching faces

Nguyen Phat Huu; Nguyet Giap Thi

Authors

Nguyen Phat Huu Hanoi University of Science and Technology (HUST)
Nguyet Giap Thi Hanoi University of Science and Technology (HUST)

Keywords:

GANs, cGANs, CNN, Sketching faces, Image processing

Abstract

The transition from sketches to realistic images of human faces has an important application in criminal investigation science to find criminals as depicted by witnesses. However, due to the difference between the sketch image and the real face image in terms of image detail and color, it is challenging and takes time to transform from hand-drawn sketches to actual faces. To solve this problem, we propose an image generation model using the conditional generative adversarial network with autoencoder (cGANs-AE) model to generate synthetic samples for variable length and multi-feature sequence datasets. The goal of the model is to learn how to encode a dataset that reduces its vector size. Using a vector with reducing the dimension, the autoencoder will have to recreate the image similar to the original image. The autoencoder aims to produce output as input and focus only on the essential features. Raw sketches over the cGANS create realistic images that quickly and easily make the sketch images raw images. The results show that the model achieves high accuracy of up to 75%, and PSNR is 25.5 dB that is potentially applicable for practice with only 606 face images. The performance of our proposed architecture is compared with other solutions, and the results show that our proposal obtains competitive performance in terms of output quality (25.5 dB) and efficiency (above 75%).

References

Y. Jo and J. Park, â€œSC-FEGAN: Face Editing Generative Adversarial Network With Userâ€™s Sketch and Color,â€ 2019 IEEE/CVF Int. Conf. Comput. Vis., pp. 1745â€“1753, Oct. 2019, doi: 10.1109/ICCV.2019.00183.

I. J. Goodfellow et al., â€œGenerative Adversarial Nets,â€ Adv. Neural Inf. Process. Syst., pp. 2672â€“2680, Jun. 2014, doi: 10.5555/2969033.2969125.

C.-H. Lee, Z. Liu, L. Wu, and P. Luo, â€œMaskGAN: Towards Diverse and Interactive Facial Image Manipulation,â€ 2020 IEEE/CVF Conf. Comput. Vis. Pattern Recognit., pp. 5548â€“5557, Jun. 2020, doi: 10.1109/CVPR42600.2020.00559.

M. Wang et al., â€œExample-Guided Style-Consistent Image Synthesis From Semantic Labeling,â€ 2019 IEEE/CVF Conf. Comput. Vis. Pattern Recognit., pp. 1495â€“1504, Jun. 2019, doi: 10.1109/CVPR.2019.00159.

D. Pathak, P. Krahenbuhl, J. Donahue, T. Darrell, and A. A. Efros, â€œContext Encoders: Feature Learning by Inpainting,â€ 2016 IEEE Conf. Comput. Vis. Pattern Recognit., pp. 2536â€“2544, Jun. 2016, doi: 10.1109/CVPR.2016.278.

P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros, â€œImage-to-Image Translation with Conditional Adversarial Networks,â€ 2017 IEEE Conf. Comput. Vis. Pattern Recognit., pp. 5967â€“5976, Jul. 2017, doi: 10.1109/CVPR.2017.632.

T.-C. Wang, M.-Y. Liu, J.-Y. Zhu, A. Tao, J. Kautz, and B. Catanzaro, â€œHigh-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs,â€ 2018 IEEE/CVF Conf. Comput. Vis. Pattern Recognit., pp. 8798â€“8807, Jun. 2018, doi: 10.1109/CVPR.2018.00917.

J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros, â€œUnpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks,â€ 2017 IEEE Int. Conf. Comput. Vis., pp. 2242â€“2251, Oct. 2017, doi: 10.1109/ICCV.2017.244.

D. Wu and Q. Dai, â€œSketch realizing: lifelike portrait synthesis from sketch,â€ Proc. 2009 Comput. Graph. Int. Conf., pp. 13â€“20, 2009, doi: 10.1145/1629739.1629741.

H. V. Dinh, â€œBuilding database of human to apply to the portrait of the criminal through descriptions of witnesses and victims,â€ Quang Ninh province police, 2017. http://cstc.cand.com.vn.

S. A. Israel et al., â€œGenerative Adversarial Networks for Classification,â€ 2017 IEEE Appl. Imag. Pattern Recognit. Work., pp. 1â€“4, Oct. 2017, doi: 10.1109/AIPR.2017.8457952.

L. Gonog and Y. Zhou, â€œA Review: Generative Adversarial Networks,â€ 2019 14th IEEE Conf. Ind. Electron. Appl., pp. 505â€“510, Jun. 2019, doi: 10.1109/ICIEA.2019.8833686.

Y.-J. Cao et al., â€œRecent Advances of Generative Adversarial Networks in Computer Vision,â€ IEEE Access, vol. 7, pp. 14985â€“15006, 2019, doi: 10.1109/ACCESS.2018.2886814.

M. A. Souibgui and Y. Kessentini, â€œDE-GAN: A Conditional Generative Adversarial Network for Document Enhancement,â€ IEEE Trans. Pattern Anal. Mach. Intell., pp. 1â€“1, 2021, doi: 10.1109/TPAMI.2020.3022406.

J. Wang, X. Li, and J. Yang, â€œStacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal,â€ 2018 IEEE/CVF Conf. Comput. Vis. Pattern Recognit., pp. 1788â€“1797, Jun. 2018, doi: 10.1109/CVPR.2018.00192.

S. Kim and D. Y. Suh, â€œRecursive Conditional Generative Adversarial Networks for Video Transformation,â€ IEEE Access, vol. 7, pp. 37807â€“37821, 2019, doi: 10.1109/ACCESS.2019.2906472.

N. Hubens, â€œDeep inside: Autoencoders,â€ 2018. https://towardsdatascience.com/deep-inside-autoencoders-7e41f319999f.

Q. P. Nguyen, K. W. Lim, D. M. Divakaran, K. H. Low, and M. C. Chan, â€œGEE: A Gradient-based Explainable Variational Autoencoder for Network Anomaly Detection,â€ 2019 IEEE Conf. Commun. Netw. Secur., pp. 91â€“99, Jun. 2019, doi: 10.1109/CNS.2019.8802833.

J. Xue, P. P. K. Chan, and X. Hu, â€œExperimental study on stacked autoencoder on insufficient training samples,â€ 2017 Int. Conf. Wavelet Anal. Pattern Recognit., pp. 223â€“229, Jul. 2017, doi: 10.1109/ICWAPR.2017.8076693.

A. Deshpande, J. Lu, M.-C. Yeh, M. J. Chong, and D. Forsyth, â€œLearning Diverse Image Colorization,â€ 2017 IEEE Conf. Comput. Vis. Pattern Recognit., pp. 2877â€“2885, Jul. 2017, doi: 10.1109/CVPR.2017.307.

R. TyleË‡cek, â€œThe CMP Facade Database,â€ Res. Reports C. Czech Tech. Univ. Prague, No. 24, 2012, pp. 1â€“8, 2013, [Online]. Available: https://cmp.felk.cvut.cz/~tylecr1/facade/CMP_facade_DB_2013.pdf.

A. Martinez and R. Benavente, â€œThe AR Face Database: CVC Technical Report, 24,â€ 1998. Available: Google Scholar.

K. Messer, J. Matas, J. Kittler, J. Luettin, and G. Maitre, â€œXM2VTSDB: The extended M2VTS database,â€ Second Int. Conf. audio video-based biometric Pers. authentication, vol. 964, pp. 965â€“966, 1999. Available: Google Scholar.

A. Radford, L. Metz, and S. Chintala, â€œUnsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks. BT - 4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, May 2-4, 2016, Conference Track Proceedings.â€ 2016, [Online]. Available: http://arxiv.org/abs/1511.06434.

Q. Chen and V. Koltun, â€œPhotographic Image Synthesis with Cascaded Refinement Networks,â€ 2017 IEEE Int. Conf. Comput. Vis., pp. 1520â€“1529, Oct. 2017, doi: 10.1109/ICCV.2017.168.

K. Shmelkov, C. Schmid, and K. Alahari, â€œHow Good Is My GAN?,â€ Ferrari V., Hebert M., Sminchisescu C., Weiss Y. Comput. Vis. â€“ ECCV 2018. ECCV 2018. Lect. Notes Comput. Sci., vol. 11206, pp. 218â€“234, 2018, doi: 10.1007/978-3-030-01216-8_14.

S. Gu, J. Bao, H. Yang, D. Chen, F. Wen, and L. Yuan, â€œMask-Guided Portrait Editing With Conditional GANs,â€ 2019 IEEE/CVF Conf. Comput. Vis. Pattern Recognit., pp. 3431â€“3440, Jun. 2019, doi: 10.1109/CVPR.2019.00355.

V. Carvalho, F. Soares, and R. Vasconcelos, â€œArtificial intelligence and image processing based techniques: A tool for yarns parameterization and fabrics prediction,â€ 2009 IEEE Conf. Emerg. Technol. Fact. Autom., pp. 1â€“4, Sep. 2009, doi: 10.1109/ETFA.2009.5347255.

P. N. Huu, T. Tran Van, and N. G. Thi, â€œProposing distortion compensation algorithm for determining distance using two cameras,â€ 2019 6th NAFOSTED Conf. Inf. Comput. Sci., pp. 172â€“177, Dec. 2019, doi: 10.1109/NICS48868.2019.9023875.

P. N. Huu, V. Tran-Quang, and T. Miyoshi, â€œEnergy threshold adaptation algorithms on image compression to prolong WSN lifetime,â€ 2010 7th Int. Symp. Wirel. Commun. Syst., pp. 834â€“838, Sep. 2010, doi: 10.1109/ISWCS.2010.5624318.

S. S. Kumar, F. Taheri, and M. R. Islam, â€œArtificial Intelligence and Image Processing Approaches in Damage Assessment and Material Evaluation,â€ Int. Conf. Comput. Intell. Model. Control Autom. Int. Conf. Intell. Agents, Web Technol. Internet Commer., vol. 1, pp. 307â€“313, doi: 10.1109/CIMCA.2005.1631284.

S. Shukla, A. Lakhmani, and A. K. Agarwal, â€œApproaches of artificial intelligence in biomedical image processing: A leading tool between computer vision & biological vision,â€ 2016 Int. Conf. Adv. Comput. Commun. Autom., pp. 1â€“6, Apr. 2016, doi: 10.1109/ICACCA.2016.7578900.

A. K. Rathinam, Y. Lee, D. N. C. Ling, and R. Singh, â€œA review of image processing leading to artificial intelligence methods to detect instruments in ultrasound guided minimally invasive surgical procedures,â€ 2017 IEEE Int. Conf. Power, Control. Signals Instrum. Eng., pp. 3074â€“3079, Sep. 2017, doi: 10.1109/ICPCSI.2017.8392290.

X. Jia, â€œImage recognition method based on deep learning,â€ 2017 29th Chinese Control Decis. Conf., pp. 4730â€“4735, May 2017, doi: 10.1109/CCDC.2017.7979332.

J. Ruili, W. Haocong, W. Han, E. Oâ€™Connell, and S. McGrath, â€œSmart Parking System Using Image Processing and Artificial Intelligence,â€ 2018 12th Int. Conf. Sens. Technol., pp. 232â€“235, Dec. 2018, doi: 10.1109/ICSensT.2018.8603590.

W. Chao, L. Chang, X. Wang, J. Cheng, X. Deng, and F. Duan, â€œHigh-Fidelity Face Sketch-To-Photo Synthesis Using Generative Adversarial Network,â€ 2019 IEEE Int. Conf. Image Process., pp. 4699â€“4703, Sep. 2019, doi: 10.1109/ICIP.2019.8803549.

Proposal of Image generation model using cGANs for sketching faces

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

quicklinks

Information

Current Issue

template

tools

crossref

Developed By