Skip to content

[NeurIPS2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning

Notifications You must be signed in to change notification settings

Smu-Tan/meft_decoder_only

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MEFT for Decoder-only Model

The repository of the Decoder-only Model Part for paper Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning

Code for the other tasks such as Encoder-Only models can be found at here.

Features

  • OPT on Question-Answering

Installation

conda create -n mefts python=3.8
conda activate mefts
pip install torch==1.13.1+cu117 torchvision==0.14.1+cu117 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu117
pip install -r requirements.txt

Fine-Tuning

Run Experiments with OPT

  • Edit the #TODO places in scripts/run.sh
  • Run as
    bash scripts/run.sh
  • openbookqa_test.sh is an example of how to run the openbook_qa task.

Citation

If you find our work or code useful, please cite as:

@misc{liao2023make,
    title={Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning}, 
    author={Baohao Liao and Shaomu Tan and Christof Monz},
    year={2023},
    eprint={2306.00477},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

About

[NeurIPS2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published