Discrimination-enhanced generative adversarial network in text-to-image generation

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (05): 855-861.

• Graphics and Images • Previous Articles Next Articles

Discrimination-enhanced generative adversarial network in text-to-image generation

TAN Hong-chen1,HUANG Shi-hua2,XIAO He-wen3,YU Bing-bing3,LIU Xiu-ping3

(1.School of Artificial Intelligence and Automation，Beijing University of Technology,Beijing 100124;
2.Department of Computer Science，The Hong Kong Polytechnic University,Hongkong 999077;
3.School of Mathematical Sciences，Dalian University of Technology，Dalian 116024,China）

Received:2021-11-11 Revised:2022-01-07 Accepted:2022-05-25 Online:2022-05-25 Published:2022-05-24

Abstract

Abstract: Based on Generative Adversarial Networks (GANs), most current text-to-image generation algorithms focus on designing different attention generation models to improve the characterization and expression of image details. However, they ignore the discriminators perception of key local semantics, so the generation models can easily generate poor image details to “fool” the discriminators. This paper designs a vocabulary-image discriminative attention module in the discriminators to enhance the discriminators ability to perceive and capture key semantics, and drive the generation model to generate high-quality image details. Therefore, a discrimination-enhanced generative adversarial model (DE-GAN) is proposed. The experimental results show that， on the CUB-Bird dataset, DE-GAN achieves 4.70 on the IS index, which is 4.2% higher than the baseline model and achieves high performance.

Key words: text-to-image generation, generative adversarial network, attention mechanism, discrimination model

TAN Hong-chen, HUANG Shi-hua, XIAO He-wen, YU Bing-bing, LIU Xiu-ping. Discrimination-enhanced generative adversarial network in text-to-image generation[J]. Computer Engineering & Science, 2022, 44(05): 855-861.

[1]	ZHANG Yong-zhi, HE Ke-ren, GE Jue. Low-altitude remote sensing image object detection based on improved YOLOv7 network [J]. Computer Engineering & Science, 2024, 46(07): 1269-1277.
[2]	WANG Ze-yu, XU Hui-ying, ZHU Xin-zhong, LI Chen, LIU Zi-yang, WANG Zi-yi. An improved dense pedestrian detection algorithm based on YOLOv8: MER-YOLO [J]. Computer Engineering & Science, 2024, 46(06): 1050-1062.
[3]	DENG Xiang-yu, PEI Hao-yuan, SHENG Ying. Facial expression recognition based on network fusion to improve MobileViT [J]. Computer Engineering & Science, 2024, 46(06): 1072-1080.
[4]	ZHANG Yu-ying, ZHU Guang-li, TAN Guang-pu, . A financial implicit sentiment analysis model based on sentiment enhancement and semantic dependency [J]. Computer Engineering & Science, 2024, 46(06): 1112-1120.
[5]	YIN Chun-yong, ZHAO Feng. An anomaly detection model of time series based on dual attention and deep autoencoder [J]. Computer Engineering & Science, 2024, 46(05): 826-835.
[6]	ZHAO Jin-yuan, JIA Di. A multi-person pose estimation correction algorithm based on improved YOLOv5 [J]. Computer Engineering & Science, 2024, 46(05): 852-860.
[7]	MA Chang-lin, SUN Zhuang. Distantly supervised relation extraction based on entity knowledge [J]. Computer Engineering & Science, 2024, 46(05): 945-950.
[8]	CAO Hao-dong, WANG Hai-tao, HE Jian-fen. Date-aware sequential recommendation fusing local information of sequences [J]. Computer Engineering & Science, 2024, 46(04): 734-742.
[9]	YAO Yuan-yuan, LIU Yu-hang, CHENG Yu-jing, PENG Meng-xiao, ZHENG Wen, . Self-supervised few-shot medical image segmentation with multi-attention mechanism [J]. Computer Engineering & Science, 2024, 46(03): 479-487.
[10]	JIN Guang-yin, ZHAO Xu-jun, GONG Yi-xuan. Moving trajectory destination prediction based on long short-term memory network [J]. Computer Engineering & Science, 2024, 46(03): 525-534.
[11]	YANG Xiao-qiang, HUANG Jia-cheng. A multi-branch fine-grained recognition method based on dynamic localization and feature fusion [J]. Computer Engineering & Science, 2024, 46(02): 253-263.
[12]	HUANG Zhen-wei, CHEN Wei, WANG Wen-jie, LU Jin-tong. Underwater vehicle target detection and experiment based on improved RetinaNet network [J]. Computer Engineering & Science, 2024, 46(02): 264-271.
[13]	ZHANG Huan, LI Wei-jiang, . Distant supervision relation extraction based on type attention and GCN [J]. Computer Engineering & Science, 2024, 46(02): 316-324.
[14]	PENG Yan-fei, MENG Xin, LI Yong-xin, LIU Lan-xi. Combining coordinate attention and generative adversarial network for image super-resolution reconstruction [J]. Computer Engineering & Science, 2024, 46(01): 122-131.
[15]	WANG Shan-shan, WANG Meng-zhu, LUO Zhi-gang. A focally discriminative loss for unsupervised domain adaptation method [J]. Computer Engineering & Science, 2024, 46(01): 132-141.

Discrimination-enhanced generative adversarial network in text-to-image generation

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments