Commit Graph

25 Commits

Author SHA1 Message Date
John Smith 9351f49542 merge pull request in new branch 2023-04-07 10:40:24 +08:00
John Smith 8020b3ec3b
Update README.md 2023-04-06 13:57:32 +08:00
Andrey Glushenkov f20570343f
GPTQv2 support
GPTQv2 support.
1. Adds dependency on `triton`
2. Refactors autograd_4bit to include both GPTQv1 and GPTQv2
3. Introduces new environment variable GPTQ_VERSION to select autograd_4bit version
4. Fixes triton kernels
5. Matrix multiplications are in fp16
2023-04-06 02:29:36 +03:00
John Smith 00bf0a1e1b
Update README.md 2023-03-31 14:17:35 +08:00
John Smith 8db4633d84
Update README.md 2023-03-30 11:24:25 +08:00
John Smith 5986649b37
Update README.md 2023-03-29 14:46:28 +08:00
John Smith ac07457473
Update README.md 2023-03-28 20:44:02 +08:00
John Smith 6c8c07e7ad
Update README.md 2023-03-27 18:03:28 +08:00
John Smith cf94d7af68
Update README.md 2023-03-27 17:52:35 +08:00
John Smith 1ca9b8abf8
Update README.md 2023-03-27 17:51:04 +08:00
Star Dorminey 399c3d124e Tested and should be ready! 2023-03-25 20:52:38 -07:00
John Smith 619a177fbb
Update README.md 2023-03-23 16:31:49 +08:00
John Smith 44978669cf Add gradient checkpointing 2023-03-23 08:25:29 +00:00
John Smith 9b04b8eec6 add monkey patch for webui 2023-03-22 07:58:51 +00:00
John Smith 45d2f22c14
Update README.md 2023-03-22 14:56:50 +08:00
John Smith cab067fef9
Update README.md 2023-03-22 14:55:24 +08:00
John Smith dc036373b2 add more scripts and adjust code for transformer branch 2023-03-22 04:09:04 +00:00
John Smith 3be75bb3db
Update README.md 2023-03-21 16:49:08 +08:00
John Smith 8d198e0171
Update README.md 2023-03-21 16:48:17 +08:00
John Smith 5c1411ff18
Update README.md 2023-03-20 15:04:18 +08:00
John Smith fecce0e1a5
Update README.md 2023-03-18 18:21:01 +08:00
John Smith ae04f88e57
Update README.md 2023-03-18 13:36:06 +08:00
John Smith bbaf1b1bf5
Update README.md 2023-03-18 13:35:36 +08:00
John Smith 326bc9214a
Update README.md 2023-03-18 13:26:04 +08:00
John Smith 42118e3267
Initial commit 2023-03-18 13:21:20 +08:00