alpaca_lora_4bit/GPTQ-for-LLaMa at a955a1c2a5ea7b97aa806ef77bc33cefcdac3f73 - alpaca_lora_4bit - Telosama Gitea Server

ilotoki_thu/alpaca_lora_4bit

Files

History

John Smith a955a1c2a5 fix bug

2023-03-22 00:18:24 +08:00

..

autograd_4bit.py

fix bug

2023-03-22 00:18:24 +08:00

quant_cuda_kernel.cu

add fast_4bit_matmul and auto switch 2 methods according to bottleneck

2023-03-21 08:43:07 +00:00

quant_cuda.cpp

add fast_4bit_matmul and auto switch 2 methods according to bottleneck

2023-03-21 08:43:07 +00:00