Update README.md

This commit is contained in:
John Smith 2023-03-21 16:48:17 +08:00 committed by GitHub
parent 3471be4e56
commit 8d198e0171
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 2 additions and 0 deletions

View File

@ -3,6 +3,8 @@ Made some adjust for the code in peft and gptq for llama, and make it possible f
<br> <br>
~Still numerically unstable.~ Resolved. ~Still numerically unstable.~ Resolved.
<br> <br>
Reconstruct fp16 matrix from 4bit data and call torch.matmul drastically increased the inference speed.
<br>
# Requirements # Requirements
gptq-for-llama: https://github.com/qwopqwop200/GPTQ-for-LLaMa<br> gptq-for-llama: https://github.com/qwopqwop200/GPTQ-for-LLaMa<br>
peft: https://github.com/huggingface/peft.git<br> peft: https://github.com/huggingface/peft.git<br>