MoCoUTRL: a momentum contrastive framework for unsupervised text representation learning

Ao Zoua Command and Control Engineering College, Army Engineering University of PLA, Nanjing, People’s Republic of China

https://orcid.org/0000-0002-9204-2376 View further author information

Wenning Haoa Command and Control Engineering College, Army Engineering University of PLA, Nanjing, People’s Republic of ChinaCorrespondence[email protected]
View further author information

Dawei Jina Command and Control Engineering College, Army Engineering University of PLA, Nanjing, People’s Republic of ChinaView further author information

Gang Chena Command and Control Engineering College, Army Engineering University of PLA, Nanjing, People’s Republic of ChinaView further author information

Feiyan Suna Command and Control Engineering College, Army Engineering University of PLA, Nanjing, People’s Republic of China;b Jinling Institute of Technology, Nanjing, People’s Republic of ChinaView further author information

Abstract

This paper presents MoCoUTRL: a Momentum Contrastive Framework for Unsupervised Text Representation Learning. This model improves two aspects of recently popular contrastive learning algorithms in natural language processing (NLP). Firstly, MoCoUTRL employs multi-granularity semantic contrastive learning objectives, enabling a more comprehensive understanding of the semantic features of samples. Secondly, MoCoUTRL uses a dynamic dictionary to act as the approximately ground-truth representation for each token, providing the pseudo labels for token-level contrastive learning. The MoCoUTRL can extend the use of pre-trained language models (PLM) and even large-scale language models (LLM) into a plug-and-play semantic feature extractor that can fuel multiple downstream tasks. Experimental results on several publicly available datasets and further theoretical analysis validate the effectiveness and interpretability of the proposed method in this paper.

KEYWORDS:

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by Defense Industrial Technology Development Program: [Grant Number JCKY2020601B018].

MoCoUTRL: a momentum contrastive framework for unsupervised text representation learning

Information for

Open access

Opportunities

Help and information

MoCoUTRL: a momentum contrastive framework for unsupervised text representation learning

Abstract

Disclosure statement

Additional information

Funding

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature