alpaca_lora_4bit/GPTQ-for-LLaMa
John Smith 5b64833390 add half support on cuda kernel 2023-03-20 09:19:05 +00:00
..
autograd_4bit.py reduced memory usage by a little 2023-03-20 00:51:52 +08:00
quant_cuda.cpp add half support on cuda kernel 2023-03-20 09:19:05 +00:00
quant_cuda_kernel.cu add half support on cuda kernel 2023-03-20 09:19:05 +00:00