alpaca_lora_4bit/GPTQ-for-LLaMa
John Smith ef0a326cec update autograd 2023-03-21 09:41:18 +00:00
..
autograd_4bit.py update autograd 2023-03-21 09:41:18 +00:00
quant_cuda.cpp add fast_4bit_matmul and auto switch 2 methods according to bottleneck 2023-03-21 08:43:07 +00:00
quant_cuda_kernel.cu add fast_4bit_matmul and auto switch 2 methods according to bottleneck 2023-03-21 08:43:07 +00:00