New LLM-Quantization LoftQ outperforms QLoRA
4.6 هزار بار بازدید -
10 ماه پیش
-
New LLM Quantization method called
New LLM Quantization method called LoftQ (LoRA-Fine-Tuning-aware Quantization) by GeorgiaTech and Microsoft outperforms QLoRA.
Deep dive into the theory of the latest LLM Quantization combined with Low Rank Adaptations (LoRA) of high-precision weight tensors. LoftQ explained in simple terms.
All rights with authors:
https://arxiv.org/pdf/2310.08659.pdf
(please switch the the latest version, in my case v3)
#ai
#quantization
#memory
Deep dive into the theory of the latest LLM Quantization combined with Low Rank Adaptations (LoRA) of high-precision weight tensors. LoftQ explained in simple terms.
All rights with authors:
https://arxiv.org/pdf/2310.08659.pdf
(please switch the the latest version, in my case v3)
#ai
#quantization
#memory
10 ماه پیش
در تاریخ 1402/08/07 منتشر شده
است.
4,621
بـار بازدید شده