DR-GAN: Distribution Regularization for Text-to-Image Generation - Details

Author：

Tan, Hongchen (Tan, Hongchen.) | Liu, Xiuping (Liu, Xiuping.) | Yin, Baocai (Yin, Baocai.) (Scholars：尹宝才) | Li, Xin (Li, Xin.)

Indexed by：

EI Scopus SCIE

Abstract：

This　article　presents　a　new　text-to-image　(T2I)　generation　model,　named　distribution　regularization　generative　adversarial　network　(DR-GAN),　to　generate　images　from　text　descriptions　from　improved　distribution　learning.　In　DR-GAN,　we　introduce　two　novel　modules:　a　semantic　disentangling　module　(SDM)　and　a　distribution　normalization　module　(DNM).　SDM　combines　the　spatial　self-attention　mechanism　(SSAM)　and　a　new　semantic　disentangling　loss　(SDL)　to　help　the　generator　distill　key　semantic　information　for　the　image　generation.　DNM　uses　a　variational　auto-encoder　(VAE)　to　normalize　and　denoise　the　image　latent　distribution,　which　can　help　the　discriminator　better　distinguish　synthesized　images　from　real　images.　DNM　also　adopts　a　distribution　adversarial　loss　(DAL)　to　guide　the　generator　to　align　with　normalized　real　image　distributions　in　the　latent　space.　Extensive　experiments　on　two　public　datasets　demonstrated　that　our　DR-GAN　achieved　a　competitive　performance　in　the　T2I　task.　The　code　link:　https://github.com/Tan-H-C/DR-GAN-Distribution-Regularization-for-Text-to-Image-Generation.

Keyword：

text-to-image (T2I) generation Image synthesis Visualization generative adversarial network Generators Training Distribution normalization Stability analysis Semantics semantic disentanglement mechanism Task analysis

Author Community：

[ 1 ] [Tan, Hongchen]Beijing Univ Technol, Artificial Intelligence Res Inst, Beijing 100124, Peoples R China
[ 2 ] [Yin, Baocai]Beijing Univ Technol, Artificial Intelligence Res Inst, Beijing 100124, Peoples R China
[ 3 ] [Liu, Xiuping]Dalian Univ Technol, Sch Math Sci, Dalian 116024, Peoples R China
[ 4 ] [Li, Xin]Louisiana State Univ, Sch Elect Engn & Comp Sci, Baton Rouge, LA 70808 USA
[ 5 ] [Li, Xin]Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70808 USA

Reprint Author's Address：

Email：

tanhongchenphd@bjut.edu.cn |
xpliu@dlut.edu.cn |
ybc@bjut.edu.cn |
xin.shane.li@ieee.org

Show more details

Related Keywords：

ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
2023，IEEE TRANSACTIONS ON MULTIMEDIA
A Two-Stage Multi-Target Domain Adaptation Framework for Prediction of Key Performance Indicators Based on Adversarial Network
2024，IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE
A Parallel-Data-Free Speech Enhancement Method Using Multi-Objective Learning Cycle-Consistent Generative Adversarial Network
2020，IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
STGAN: Spatio-Temporal Generative Adversarial Network for Traffic Data Imputation
2023，IEEE TRANSACTIONS ON BIG DATA

Source ：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

ISSN： 2162-237X

Year： 2022

Issue： 12

Volume： 34

Page： 10309-10323

1 0 . 4

JCR@2022

1 0 . 4 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：46

JCR Journal Grade：1

CAS Journal Grade：1

Cited Count：

WoS CC Cited Count： 21

SCOPUS Cited Count： 32

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 6

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to