The detection of distributional discrepancy for language GANs

Xingyuan Chena Department of Computer Science and Technology, Nanjing University, Nanjing, People's Republic of ChinaView further author information

Peng Jinb School of Electronic Engineering and Artificial Intelligence, Leshan Normal University, Leshan, People's Republic of China

https://orcid.org/0000-0002-4835-0312 View further author information

Ping Caic School of Information Science and Technology, Southwest Jiaotong University, Chengdu, People's Republic of ChinaView further author information

Hongjun Wangc School of Information Science and Technology, Southwest Jiaotong University, Chengdu, People's Republic of ChinaView further author information

Xinyu Daia Department of Computer Science and Technology, Nanjing University, Nanjing, People's Republic of ChinaCorrespondence[email protected] [email protected]
View further author information

Jiajun Chena Department of Computer Science and Technology, Nanjing University, Nanjing, People's Republic of ChinaView further author information

ABSTRACT

A pre-trained neural language model (LM) is usually used to generate texts. Due to exposure bias, the generated text is not as good as real text. Many researchers claimed they employed the Generative Adversarial Nets (GAN) to alleviate this issue by feeding reward signals from a discriminator to update the LM (generator). However, some researchers argued that GAN did not work by evaluating the generated texts with a quality-diversity metric such as Bleu versus self-Bleu, and language model score versus reverse language model score. Unfortunately, these two-dimension metrics are not reliable. Furthermore, the existing methods only assessed the final generated texts, thus neglecting the dynamic evaluating the adversarial learning process. Different from the above-mentioned methods, we adopted the most recent metric functions, which measure the distributional discrepancy between real and generated text. Besides that, we design a comprehensive experiment to investigate the performance during the learning process. First, we evaluate a language model with two functions and identify a large discrepancy. Then, several methods with the detected discrepancy signal to improve the generator were tried. Experimenting with two language GANs on two benchmark datasets, we found that the distributional discrepancy increases with more adversarial learning rounds. Our research provides convicted evidence that the language GANs fail.

KEYWORDS:

Acknowledgments

We thank the anonymous reviewers for their valuable comments.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1 An exception is RelGAN which does not need to pre-train D.

2 http://cocodataset.org/.

3 http://www.statmt.org/wmt17/.

4 According to Section 3, we sample generated instances as much as test instances.

Additional information

Funding

This work was supported by the National Natural Science Foundation of China [grant numbers 61936012, 61976114, 81373056] and the National Key Research and Development Program of China [grant number 2018YFB1005102].

The detection of distributional discrepancy for language GANs

Information for

Open access

Opportunities

Help and information

The detection of distributional discrepancy for language GANs

ABSTRACT

Acknowledgments

Disclosure statement

Notes

Additional information

Funding

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature