Commit Graph

62 Commits

Author SHA1 Message Date
John Smith 1043ded7d9 Merge branch 'main' of github.com:johnsmith0031/alpaca_lora_4bit 2023-03-29 01:26:20 +08:00
John Smith d28ee06202 fix bug 2023-03-29 01:25:37 +08:00
John Smith b5e3dae573
Merge pull request #34 from winglian/v2-fixes
fixes for most recent update
2023-03-28 23:49:56 +08:00
Wing Lian b47da33084 fixes for most recent update 2023-03-28 10:56:35 -04:00
John Smith 234004ceb5 fix bug 2023-03-28 22:05:18 +08:00
John Smith f26615fc0c fix bug 2023-03-28 21:47:22 +08:00
John Smith 0768d0fdff update finetune data format 2023-03-28 21:45:33 +08:00
John Smith 8a6c8661df Merge branch 'main' of github.com:johnsmith0031/alpaca_lora_4bit
merged
2023-03-28 21:14:35 +08:00
John Smith 211af574b6 fix bug 2023-03-28 21:12:51 +08:00
John Smith ac07457473
Update README.md 2023-03-28 20:44:02 +08:00
John Smith bff039de95 add v2 model support 2023-03-28 20:33:55 +08:00
John Smith 667e43cb5b
Merge pull request #30 from winglian/features/python-fixes
backwards support for pre-py3.10, add datasets requirement used in train
2023-03-28 09:34:50 +08:00
Wing Lian 101d314bd9 add missing dependency to train with LlamaTokenizer 2023-03-27 16:13:46 -04:00
Wing Lian 62e54ac1c7 backwards support for pre-py3.10, add datasets requirement used in train 2023-03-27 16:08:20 -04:00
John Smith 6c8c07e7ad
Update README.md 2023-03-27 18:03:28 +08:00
John Smith cf94d7af68
Update README.md 2023-03-27 17:52:35 +08:00
John Smith 1ca9b8abf8
Update README.md 2023-03-27 17:51:04 +08:00
John Smith 0b5b376de1
Merge pull request #23 from sterlind/star/repos
Get dependencies straight from pip!
2023-03-27 17:47:39 +08:00
Star Dorminey 399c3d124e Tested and should be ready! 2023-03-25 20:52:38 -07:00
Star Dorminey a2a4c1d117 Remove gitmodules. 2023-03-25 20:23:46 -07:00
Star Dorminey 96440c8717 Removing submodules actually. 2023-03-25 20:20:38 -07:00
Star Dorminey 17f3da744c Fix repos. 2023-03-25 20:16:48 -07:00
Star Dorminey 8e705eddcb Add GPTQ and peft as submodules. 2023-03-25 20:02:19 -07:00
John Smith 82dd6dd13e
Merge pull request #20 from kooshi/multi-gpu
Enable model parallelism and distributed data parallelism for multi-gpu setups
2023-03-25 15:06:01 +08:00
kooshi 8e471516b8
distributed data parallelism with torchrun 2023-03-24 23:56:06 -05:00
kooshi 2bc64597aa
model parallelism 2023-03-24 23:03:43 -05:00
John Smith cd1a299ba3
Merge pull request #18 from s4rduk4r/finetune-refactor
Refactor finetune.py
2023-03-25 10:29:36 +08:00
John Smith 0879580006
Merge branch 'main' into finetune-refactor 2023-03-25 10:29:02 +08:00
John Smith e64bd6a214
Create LICENSE 2023-03-25 10:17:44 +08:00
Andrey Glushenkov 397f5041c3
Reflect last changes in main
Reflect commits:
4906961bf1
60b227d0ba
2023-03-24 15:46:03 +03:00
Andrey Glushenkov 50dbb101e9
Refactor finetune.py
1. Add command line arguments support
2. Add Stanford Alpaca-like dataset support. Used code from - https://github.com/tloen/alpaca-lora
3. Fix LoRA pre-train application
2023-03-24 14:15:07 +03:00
John Smith 4906961bf1
fix bug 2023-03-23 23:37:39 +08:00
John Smith 94851cec68
Merge pull request #14 from Ph0rk0z/AtomicAddHalf
Fix cuda kernel for Pascal & Cuda 6/6.1
2023-03-23 20:40:29 +08:00
Forkoz 58998acc9f
Fix cuda kernel for Pascal & Cuda 6/6.1
When I left the other functions to use normal atomic add it seemed like a small speedup. 4.79 it/s vs 5.23 it/s
2023-03-23 07:33:57 -05:00
John Smith 60b227d0ba fix minor bug 2023-03-23 08:43:18 +00:00
John Smith 619a177fbb
Update README.md 2023-03-23 16:31:49 +08:00
John Smith 44978669cf Add gradient checkpointing 2023-03-23 08:25:29 +00:00
John Smith eb8ce878d4
Merge pull request #6 from sterlind/patch-1
Fix path to autograd_4bit.py in install.sh
2023-03-22 16:07:25 +08:00
John Smith 9b04b8eec6 add monkey patch for webui 2023-03-22 07:58:51 +00:00
sterlind 0aea8f45e3
Fix path to autograd_4bit.py in install.sh 2023-03-22 00:24:52 -07:00
John Smith 45d2f22c14
Update README.md 2023-03-22 14:56:50 +08:00
John Smith cab067fef9
Update README.md 2023-03-22 14:55:24 +08:00
John Smith 02bd0338f1
add data 2023-03-22 12:13:34 +08:00
John Smith dc036373b2 add more scripts and adjust code for transformer branch 2023-03-22 04:09:04 +00:00
John Smith a955a1c2a5
fix bug 2023-03-22 00:18:24 +08:00
John Smith 467849d13a Merge branch 'main' of https://github.com/johnsmith0031/alpaca_lora_4bit 2023-03-21 09:45:06 +00:00
John Smith ef0a326cec update autograd 2023-03-21 09:41:18 +00:00
John Smith 3be75bb3db
Update README.md 2023-03-21 16:49:08 +08:00
John Smith 8d198e0171
Update README.md 2023-03-21 16:48:17 +08:00
John Smith 3471be4e56 add fast_4bit_matmul and auto switch 2 methods according to bottleneck 2023-03-21 08:43:07 +00:00