alpaca_lora_4bit/GPTQ-for-LLaMa at ef0a326cec099e3065ea3009850b683c5ebbcc33 - alpaca_lora_4bit - Telosama Gitea Server

ilotoki_thu/alpaca_lora_4bit

Files

History

John Smith ef0a326cec update autograd

2023-03-21 09:41:18 +00:00

..

autograd_4bit.py

update autograd

2023-03-21 09:41:18 +00:00

quant_cuda_kernel.cu

add fast_4bit_matmul and auto switch 2 methods according to bottleneck

2023-03-21 08:43:07 +00:00

quant_cuda.cpp

add fast_4bit_matmul and auto switch 2 methods according to bottleneck

2023-03-21 08:43:07 +00:00