Please cite the paper if this repository is useful for you.
@inproceedings{shen-etal-2024-pruning,
title = "Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations",
author = "Shen, Bowen and
Lin, Zheng and
Zha, Daren and
Liu, Wei and
Luan, Jian and
Wang, Bin and
Wang, Weiping",
booktitle = "Findings of the Association for Computational Linguistics ACL 2024",
year = "2024",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.findings-acl.582",
doi = "10.18653/v1/2024.findings-acl.582",
pages = "9781--9793",
}
TransAct
This repository is the official implementation of the ACL 2024 paper Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations.
Training and Evaluation
transact.dockerfile.train_config.yamlanddeepspeed.json.run_trainer.sh, for example Runbash run_trainer.sh -hfor help.eval.sh, for example Runbash eval.sh -hfor help.Citations
Please cite the paper if this repository is useful for you.