diff --git a/README.md b/README.md index 77f0868..ab0e705 100644 --- a/README.md +++ b/README.md @@ -7,14 +7,44 @@ Reconstruct fp16 matrix from 4bit data and call torch.matmul largely increased t
Added install script for windows and linux.
+ # Requirements gptq-for-llama: https://github.com/qwopqwop200/GPTQ-for-LLaMa
peft: https://github.com/huggingface/peft.git

-# Install -copy files from GPTQ-for-LLaMa into GPTQ-for-LLaMa path and re-compile cuda extension
-copy files from peft/tuners/lora.py to peft path, replace it
-
-# Finetuning -The same finetune script from https://github.com/tloen/alpaca-lora can be used.
+# Install +~copy files from GPTQ-for-LLaMa into GPTQ-for-LLaMa path and re-compile cuda extension~
+~copy files from peft/tuners/lora.py to peft path, replace it~
+ +
+Linux: + +``` +./install.sh +``` + +
+Windows: + +``` +./install.bat +``` +
+ +# Finetune +~The same finetune script from https://github.com/tloen/alpaca-lora can be used.~
+ +After installation, this script can be used: + +``` +python finetune.py +``` + +# Inference + +After installation, this script can be used: + +``` +python inference.py +```