ERAT-DLoRA: Parameter-efficient tuning with enhanced range adaptation in time and depth aware dynamic LoRA - Details

Author：

Luo, Dan (Luo, Dan.) | Zheng, Kangfeng (Zheng, Kangfeng.) | Wu, Chunhua (Wu, Chunhua.) | Wang, Xiujuan (Wang, Xiujuan.) | Wang, Jvjie (Wang, Jvjie.)

Indexed by：

EI Scopus SCIE

Abstract：

Despite　their　potential,　the　industrial　deployment　of　large　language　models　(LLMs)　is　constrained　by　traditional　fine-tuning　procedures　that　are　both　resource-intensive　and　time-consuming.　Low-Rank　Adaptation　(LoRA)　has　emerged　as　a　pioneering　methodology　for　addressing　these　challenges.　By　integrating　low-rank　decomposition　matrices　into　network　weights　to　reduce　trainable　parameters,　LoRA　effectively　accelerates　the　adaptation　process.　While　research　on　LoRA　primarily　focuses　on　adjusting　low-rank　matrices,　DyLoRA　optimizes　the　rank-setting　mechanism　to　avoid　extensive　effort　in　rank　size　training　and　searching.　However,　DyLoRA　rank　configuration　mechanism　has　its　own　limitation.　First,　DyLoRA　sets　the　same　rank　size　for　all　the　low-rank　adaptation　layers　at　each　time　step.　Given　that　layers　with　different　depth　contain　distinct　information,　they　should　have　varying　rank　values　to　accurately　capture　their　unique　characteristics.　Second,　the　truncated　phase　selected　for　ordering　representation　based　on　nested　dropout　regulation　is　only　half　dynamic,　continuously　dropping　tail　units,　thereby　limiting　its　ability　to　access　information.　In　this　work,　we　propose　a　novel　technique,　enhanced　range　adaptation　in　time　and　depth　aware　dynamic　LoRA　(ERAT-DLoRA)　to　address　these　problems.　The　ERAT-DLoRA　method　introduces　a　dynamic　range　to　the　truncated　phase　that　makes　the　truncated　phase　fully　dynamic.　Additionally,　we　design　a　time　and　layer-aware　dynamic　rank　to　ensure　appropriate　adjustments　at　different　time　steps　and　layer　levels.　We　evaluate　our　solution　on　natural　languages　understanding　and　language　generation　tasks.　Extensive　evaluation　results　demonstrate　the　effectiveness　of　the　proposed　method.

Keyword：

LoRA Fine-tuning Parameter-efficient

Author Community：

[ 1 ] [Luo, Dan]Beijing Univ Posts & Telecommun, 10 Xitu Cheng Rd, Beijing 100876, Peoples R China
[ 2 ] [Zheng, Kangfeng]Beijing Univ Posts & Telecommun, 10 Xitu Cheng Rd, Beijing 100876, Peoples R China
[ 3 ] [Wu, Chunhua]Beijing Univ Posts & Telecommun, 10 Xitu Cheng Rd, Beijing 100876, Peoples R China
[ 4 ] [Wang, Jvjie]Beijing Univ Posts & Telecommun, 10 Xitu Cheng Rd, Beijing 100876, Peoples R China
[ 5 ] [Wang, Xiujuan]Beijing Univ Technol, 100 Pingleyuan, Beijing 100124, Peoples R China

Reprint Author's Address：

[Zheng, Kangfeng]Beijing Univ Posts & Telecommun, 10 Xitu Cheng Rd, Beijing 100876, Peoples R China

Email：

luodan@bupt.edu.cn |
kfzheng@bupt.edu.cn |
wuchunhua@bupt.edu.cn |
xjwang@bjut.edu.cn |
xjwang@bjut.edu.cn

Show more details

Related Keywords：

Deep Learning Networks-Based Action Videos Classification and Search
2021，INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE
Breaking the Barrier Between Pre-training and Fine-tuning: A Hybrid Prompting Model for Knowledge-Based VQA
2023，PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023
High-accuracy vehicle color recognition using hierarchical fine-tuning strategy for urban surveillance videos
2018，JOURNAL OF ELECTRONIC IMAGING
NMSSM with generalized deflected mirage mediation
2019，EUROPEAN PHYSICAL JOURNAL C

Source ：

NEUROCOMPUTING

ISSN： 0925-2312

Year： 2024

Volume： 614

6 . 0 0 0

JCR@2022

Cited Count：

WoS CC Cited Count： 1

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to