MixCSE_AAAI2022

A PyTorch implementation for our paper "Unsupervised Sentence Representation via Contrastive Learning with Mixing Negatives".

You can download the paper from here.

Abstract

Unsupervised sentence representation learning is a fundamental problem in natural language processing. Recently, contrastive learning has made great success on this task. Existing constrastive learning based models usually apply random sampling to select negative examples for training. Previous work in computer vision has shown that hard negative examples help contrastive learning to achieve faster convergency and better optimization for representation learning. However, the importance of hard negatives in contrastive learning for sentence representation is yet to be explored. In this study, we prove that hard negatives are essential for main�taining strong gradient signals in the training process while random sampling negative examples is ineffective for sentence representation. Accordingly, we present a contrastive model, MixCSE, that extends the current state-of-the-art SimCSE by continually constructing hard negatives via mixing both positive and negative features. The superior performance of the proposed approach is demonstrated via empirical studies on Semantic Textual Similarity datasets and Transfer task datasets

Requirement

Python = 3.7
torch = 1.11.0
numpy = 1.17.2
transformers = 4.19.2

train

bash run_unsup_example.sh

evaluate

python evaluation.py \
    --model_name_or_path trained_model \
    --pooler cls \
    --task_set sts \
    --mode test

Citation

If this work is helpful, please cite as:

@article{zhang2022unsupervised,
  title={Unsupervised Sentence Representation via Contrastive Learning with Mixing Negatives},
  author={Zhang, Yanzhao and Zhang, Richong and Mensah, Samuel and Liu, Xudong and Mao, Yongyi},
  year={2022}
}

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
mixcse		mixcse
README.md		README.md
evaluation.py		evaluation.py
run_unsup_example.sh		run_unsup_example.sh
simcse_to_huggingface.py		simcse_to_huggingface.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MixCSE_AAAI2022

Abstract

Requirement

train

evaluate

Citation

License

About

Releases

Packages

Languages

BDBC-KG-NLP/MixCSE_AAAI2022

Folders and files

Latest commit

History

Repository files navigation

MixCSE_AAAI2022

Abstract

Requirement

train

evaluate

Citation

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages