Update README.md

This commit is contained in:
John Smith 2023-03-23 16:31:49 +08:00 committed by GitHub
parent 44978669cf
commit 619a177fbb
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 5 additions and 1 deletions

View File

@ -6,12 +6,16 @@ Made some adjust for the code in peft and gptq for llama, and make it possible f
# Update Logs
* Resolved numerically unstable issue
<br>
* Reconstruct fp16 matrix from 4bit data and call torch.matmul largely increased the inference speed.
<br>
* Added install script for windows and linux.
<br>
* Added Gradient Checkpointing. Now It can finetune 30b model 4bit on a single GPU with 24G VRAM. (finetune.py updated)
* Added Gradient Checkpointing. Now It can finetune 30b model 4bit on a single GPU with 24G VRAM with Gradient Checkpointing enabled. (finetune.py updated) (but would reduce training speed, so if having enough VRAM this option is not needed)
<br>
* Added install manual by s4rduk4r
<br>