John Smith
|
dc036373b2
|
add more scripts and adjust code for transformer branch
|
2023-03-22 04:09:04 +00:00 |
John Smith
|
a955a1c2a5
|
fix bug
|
2023-03-22 00:18:24 +08:00 |
John Smith
|
ef0a326cec
|
update autograd
|
2023-03-21 09:41:18 +00:00 |
John Smith
|
3471be4e56
|
add fast_4bit_matmul and auto switch 2 methods according to bottleneck
|
2023-03-21 08:43:07 +00:00 |
John Smith
|
dd0d5a31f7
|
add half support
|
2023-03-20 09:37:51 +00:00 |
John Smith
|
5b64833390
|
add half support on cuda kernel
|
2023-03-20 09:19:05 +00:00 |
John Smith
|
04f5575a23
|
reduced memory usage by a little
|
2023-03-20 00:51:52 +08:00 |
John Smith
|
2b84b32fbe
|
Update autograd_4bit.py
|
2023-03-18 22:13:11 +08:00 |
John Smith
|
6f4bbb40a9
|
Update autograd_4bit.py
|
2023-03-18 18:49:26 +08:00 |
John Smith
|
551f62a0e8
|
add patch for gptq and peft
|
2023-03-18 13:31:48 +08:00 |