Parameter-Efficient Transfer Learning for NLP

This repository contains a paper implementation for "Parameter-Efficient Transfer Learning for NLP". The implementation uses pytorch-lightning and huggingface transformers. Currently the runs were performed for the 'CoLA' dataset from the GLUE benchmark. The authors of this paper have proposed an adapter module for tranfer learning which is parameter efficient as compared to full finetuning. For demonstration, the BERT model has been used. Here is the overview of adapter module:

All experiments run can be found here: wandb/adapter-bert

Setup

Clone repository

git clone https://github.com/cs-mshah/Adapter-Bert.git

Environment

conda env create -f environment.yml
conda activate adapter
cd adapter-bert

Structure

.
├── adapter-bert            
│   ├── cfgs                   # configurations to run
│   ├── config.py              # default configuration
│   ├── dataset.py             # LightningDataModule for GLUE tasks
│   ├── train.py              # main file for training
│   ├── model                  # model architectures
│       ├── bert.py            # modifications to huggingface/transformers/bert
│       ├── adapter.py         # adapter module
│       └── model.py           # LightningModule to train
├── assets                     # figures/outputs
├── environment.yml            # environment configuration
├── .gitignore                 # ignore files that cannot commit to Git
├── README.md                  # project description
├── LICENSE                    # Apache 2.0 License

Training

yacs is used as the configuration system and wandb for logging. You can change the configuration in config.py as it will get imported in train.py for training. For training run:

python train.py

# use saved config
python train.py --config cfgs/adapter.yaml

python train.py --help
usage: train.py [-h] [--config CONFIG]

Adapter-Bert

optional arguments:
  -h, --help       show this help message and exit
  --config CONFIG  path to yaml config

References

lightining examples: text transformers
krypticmouse/Adapter-BERT
huggingface/transformers

Citation

Official paper citation:

@inproceedings{houlsby2019parameter,
  title = {Parameter-Efficient Transfer Learning for {NLP}},
  author = {Houlsby, Neil and Giurgiu, Andrei and Jastrzebski, Stanislaw and Morrone, Bruna and De Laroussilhe, Quentin and Gesmundo, Andrea and Attariyan, Mona and Gelly, Sylvain},
  booktitle = {Proceedings of the 36th International Conference on Machine Learning},
  year = {2019},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parameter-Efficient Transfer Learning for NLP

Setup

Clone repository

Environment

Structure

Training

References

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
adapter-bert		adapter-bert
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

License

cs-mshah/Adapter-Bert

Folders and files

Latest commit

History

Repository files navigation

Parameter-Efficient Transfer Learning for NLP

Setup

Clone repository

Environment

Structure

Training

References

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages