This is the repo for the Efficient Finetuning of Quantized LLMs
project, which aims to build and share instruction-following Chinese baichuan-7b/LLaMA/Pythia/GLM
model tuning methods which can be trained on a single Nvidia RTX-2080TI, multi-round chatbot which can be trained on a single Nvidia RTX-3090 with the context len 2048.
資訊詳見:https://github.com/jianzhnie/Efficient-Tuning-LLMs/tree/main