Merge branch 'main' of https://github.com/johnsmith0031/alpaca_lora_4bit

2023-03-21 09:45:06 +00:00 · 2023-03-21 09:45:06 +00:00 · 467849d13a
parent ef0a326cec 3be75bb3db
commit 467849d13a
1 changed files with 2 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -3,6 +3,8 @@ Made some adjust for the code in peft and gptq for llama, and make it possible f
 <br>
 ~Still numerically unstable.~ Resolved.
 <br>
+Reconstruct fp16 matrix from 4bit data and call torch.matmul largely increased the inference speed.
+<br>
 # Requirements
 gptq-for-llama: https://github.com/qwopqwop200/GPTQ-for-LLaMa<br>
 peft: https://github.com/huggingface/peft.git<br>