Update README.md

This commit is contained in:
John Smith 2023-04-09 12:50:49 +08:00 committed by GitHub
parent 4c18a56fc0
commit f185b90c3e
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 1 additions and 0 deletions

View File

@ -33,6 +33,7 @@ It's fast on a 3070 Ti mobile. Uses 5-6 GB of GPU RAM.
* Added monkey patch for text generation webui for fixing initial eos token issue.
* Added Flash attention support. (Use --flash-attention)
* Added Triton backend to support model using groupsize and act-order. (Use --backend=triton)
* Added g_idx support in cuda backend (need recompile cuda kernel)
# Requirements
gptq-for-llama <br>