John Smith
|
1043ded7d9
|
Merge branch 'main' of github.com:johnsmith0031/alpaca_lora_4bit
|
2023-03-29 01:26:20 +08:00 |
John Smith
|
d28ee06202
|
fix bug
|
2023-03-29 01:25:37 +08:00 |
John Smith
|
b5e3dae573
|
Merge pull request #34 from winglian/v2-fixes
fixes for most recent update
|
2023-03-28 23:49:56 +08:00 |
Wing Lian
|
b47da33084
|
fixes for most recent update
|
2023-03-28 10:56:35 -04:00 |
John Smith
|
234004ceb5
|
fix bug
|
2023-03-28 22:05:18 +08:00 |
John Smith
|
f26615fc0c
|
fix bug
|
2023-03-28 21:47:22 +08:00 |
John Smith
|
0768d0fdff
|
update finetune data format
|
2023-03-28 21:45:33 +08:00 |
John Smith
|
8a6c8661df
|
Merge branch 'main' of github.com:johnsmith0031/alpaca_lora_4bit
merged
|
2023-03-28 21:14:35 +08:00 |
John Smith
|
211af574b6
|
fix bug
|
2023-03-28 21:12:51 +08:00 |
John Smith
|
ac07457473
|
Update README.md
|
2023-03-28 20:44:02 +08:00 |
John Smith
|
bff039de95
|
add v2 model support
|
2023-03-28 20:33:55 +08:00 |
John Smith
|
667e43cb5b
|
Merge pull request #30 from winglian/features/python-fixes
backwards support for pre-py3.10, add datasets requirement used in train
|
2023-03-28 09:34:50 +08:00 |
Wing Lian
|
101d314bd9
|
add missing dependency to train with LlamaTokenizer
|
2023-03-27 16:13:46 -04:00 |
Wing Lian
|
62e54ac1c7
|
backwards support for pre-py3.10, add datasets requirement used in train
|
2023-03-27 16:08:20 -04:00 |
John Smith
|
6c8c07e7ad
|
Update README.md
|
2023-03-27 18:03:28 +08:00 |
John Smith
|
cf94d7af68
|
Update README.md
|
2023-03-27 17:52:35 +08:00 |
John Smith
|
1ca9b8abf8
|
Update README.md
|
2023-03-27 17:51:04 +08:00 |
John Smith
|
0b5b376de1
|
Merge pull request #23 from sterlind/star/repos
Get dependencies straight from pip!
|
2023-03-27 17:47:39 +08:00 |
Star Dorminey
|
399c3d124e
|
Tested and should be ready!
|
2023-03-25 20:52:38 -07:00 |
Star Dorminey
|
a2a4c1d117
|
Remove gitmodules.
|
2023-03-25 20:23:46 -07:00 |
Star Dorminey
|
96440c8717
|
Removing submodules actually.
|
2023-03-25 20:20:38 -07:00 |
Star Dorminey
|
17f3da744c
|
Fix repos.
|
2023-03-25 20:16:48 -07:00 |
Star Dorminey
|
8e705eddcb
|
Add GPTQ and peft as submodules.
|
2023-03-25 20:02:19 -07:00 |
John Smith
|
82dd6dd13e
|
Merge pull request #20 from kooshi/multi-gpu
Enable model parallelism and distributed data parallelism for multi-gpu setups
|
2023-03-25 15:06:01 +08:00 |
kooshi
|
8e471516b8
|
distributed data parallelism with torchrun
|
2023-03-24 23:56:06 -05:00 |
kooshi
|
2bc64597aa
|
model parallelism
|
2023-03-24 23:03:43 -05:00 |
John Smith
|
cd1a299ba3
|
Merge pull request #18 from s4rduk4r/finetune-refactor
Refactor finetune.py
|
2023-03-25 10:29:36 +08:00 |
John Smith
|
0879580006
|
Merge branch 'main' into finetune-refactor
|
2023-03-25 10:29:02 +08:00 |
John Smith
|
e64bd6a214
|
Create LICENSE
|
2023-03-25 10:17:44 +08:00 |
Andrey Glushenkov
|
397f5041c3
|
Reflect last changes in main
Reflect commits:
4906961bf1
60b227d0ba
|
2023-03-24 15:46:03 +03:00 |
Andrey Glushenkov
|
50dbb101e9
|
Refactor finetune.py
1. Add command line arguments support
2. Add Stanford Alpaca-like dataset support. Used code from - https://github.com/tloen/alpaca-lora
3. Fix LoRA pre-train application
|
2023-03-24 14:15:07 +03:00 |
John Smith
|
4906961bf1
|
fix bug
|
2023-03-23 23:37:39 +08:00 |
John Smith
|
94851cec68
|
Merge pull request #14 from Ph0rk0z/AtomicAddHalf
Fix cuda kernel for Pascal & Cuda 6/6.1
|
2023-03-23 20:40:29 +08:00 |
Forkoz
|
58998acc9f
|
Fix cuda kernel for Pascal & Cuda 6/6.1
When I left the other functions to use normal atomic add it seemed like a small speedup. 4.79 it/s vs 5.23 it/s
|
2023-03-23 07:33:57 -05:00 |
John Smith
|
60b227d0ba
|
fix minor bug
|
2023-03-23 08:43:18 +00:00 |
John Smith
|
619a177fbb
|
Update README.md
|
2023-03-23 16:31:49 +08:00 |
John Smith
|
44978669cf
|
Add gradient checkpointing
|
2023-03-23 08:25:29 +00:00 |
John Smith
|
eb8ce878d4
|
Merge pull request #6 from sterlind/patch-1
Fix path to autograd_4bit.py in install.sh
|
2023-03-22 16:07:25 +08:00 |
John Smith
|
9b04b8eec6
|
add monkey patch for webui
|
2023-03-22 07:58:51 +00:00 |
sterlind
|
0aea8f45e3
|
Fix path to autograd_4bit.py in install.sh
|
2023-03-22 00:24:52 -07:00 |
John Smith
|
45d2f22c14
|
Update README.md
|
2023-03-22 14:56:50 +08:00 |
John Smith
|
cab067fef9
|
Update README.md
|
2023-03-22 14:55:24 +08:00 |
John Smith
|
02bd0338f1
|
add data
|
2023-03-22 12:13:34 +08:00 |
John Smith
|
dc036373b2
|
add more scripts and adjust code for transformer branch
|
2023-03-22 04:09:04 +00:00 |
John Smith
|
a955a1c2a5
|
fix bug
|
2023-03-22 00:18:24 +08:00 |
John Smith
|
467849d13a
|
Merge branch 'main' of https://github.com/johnsmith0031/alpaca_lora_4bit
|
2023-03-21 09:45:06 +00:00 |
John Smith
|
ef0a326cec
|
update autograd
|
2023-03-21 09:41:18 +00:00 |
John Smith
|
3be75bb3db
|
Update README.md
|
2023-03-21 16:49:08 +08:00 |
John Smith
|
8d198e0171
|
Update README.md
|
2023-03-21 16:48:17 +08:00 |
John Smith
|
3471be4e56
|
add fast_4bit_matmul and auto switch 2 methods according to bottleneck
|
2023-03-21 08:43:07 +00:00 |