alpaca_lora_4bit

Commit Graph

Author	SHA1	Message	Date
John Smith	1043ded7d9	Merge branch 'main' of github.com:johnsmith0031/alpaca_lora_4bit	2023-03-29 01:26:20 +08:00
John Smith	d28ee06202	fix bug	2023-03-29 01:25:37 +08:00
John Smith	b5e3dae573	Merge pull request #34 from winglian/v2-fixes fixes for most recent update	2023-03-28 23:49:56 +08:00
Wing Lian	b47da33084	fixes for most recent update	2023-03-28 10:56:35 -04:00
John Smith	234004ceb5	fix bug	2023-03-28 22:05:18 +08:00
John Smith	f26615fc0c	fix bug	2023-03-28 21:47:22 +08:00
John Smith	0768d0fdff	update finetune data format	2023-03-28 21:45:33 +08:00
John Smith	8a6c8661df	Merge branch 'main' of github.com:johnsmith0031/alpaca_lora_4bit merged	2023-03-28 21:14:35 +08:00
John Smith	211af574b6	fix bug	2023-03-28 21:12:51 +08:00
John Smith	ac07457473	Update README.md	2023-03-28 20:44:02 +08:00
John Smith	bff039de95	add v2 model support	2023-03-28 20:33:55 +08:00
John Smith	667e43cb5b	Merge pull request #30 from winglian/features/python-fixes backwards support for pre-py3.10, add datasets requirement used in train	2023-03-28 09:34:50 +08:00
Wing Lian	101d314bd9	add missing dependency to train with LlamaTokenizer	2023-03-27 16:13:46 -04:00
Wing Lian	62e54ac1c7	backwards support for pre-py3.10, add datasets requirement used in train	2023-03-27 16:08:20 -04:00
John Smith	6c8c07e7ad	Update README.md	2023-03-27 18:03:28 +08:00
John Smith	cf94d7af68	Update README.md	2023-03-27 17:52:35 +08:00
John Smith	1ca9b8abf8	Update README.md	2023-03-27 17:51:04 +08:00
John Smith	0b5b376de1	Merge pull request #23 from sterlind/star/repos Get dependencies straight from pip!	2023-03-27 17:47:39 +08:00
Star Dorminey	399c3d124e	Tested and should be ready!	2023-03-25 20:52:38 -07:00
Star Dorminey	a2a4c1d117	Remove gitmodules.	2023-03-25 20:23:46 -07:00
Star Dorminey	96440c8717	Removing submodules actually.	2023-03-25 20:20:38 -07:00
Star Dorminey	17f3da744c	Fix repos.	2023-03-25 20:16:48 -07:00
Star Dorminey	8e705eddcb	Add GPTQ and peft as submodules.	2023-03-25 20:02:19 -07:00
John Smith	82dd6dd13e	Merge pull request #20 from kooshi/multi-gpu Enable model parallelism and distributed data parallelism for multi-gpu setups	2023-03-25 15:06:01 +08:00
kooshi	8e471516b8	distributed data parallelism with torchrun	2023-03-24 23:56:06 -05:00
kooshi	2bc64597aa	model parallelism	2023-03-24 23:03:43 -05:00
John Smith	cd1a299ba3	Merge pull request #18 from s4rduk4r/finetune-refactor Refactor finetune.py	2023-03-25 10:29:36 +08:00
John Smith	0879580006	Merge branch 'main' into finetune-refactor	2023-03-25 10:29:02 +08:00
John Smith	e64bd6a214	Create LICENSE	2023-03-25 10:17:44 +08:00
Andrey Glushenkov	397f5041c3	Reflect last changes in main Reflect commits: `4906961bf1` `60b227d0ba`	2023-03-24 15:46:03 +03:00
Andrey Glushenkov	50dbb101e9	Refactor finetune.py 1. Add command line arguments support 2. Add Stanford Alpaca-like dataset support. Used code from - https://github.com/tloen/alpaca-lora 3. Fix LoRA pre-train application	2023-03-24 14:15:07 +03:00
John Smith	4906961bf1	fix bug	2023-03-23 23:37:39 +08:00
John Smith	94851cec68	Merge pull request #14 from Ph0rk0z/AtomicAddHalf Fix cuda kernel for Pascal & Cuda 6/6.1	2023-03-23 20:40:29 +08:00
Forkoz	58998acc9f	Fix cuda kernel for Pascal & Cuda 6/6.1 When I left the other functions to use normal atomic add it seemed like a small speedup. 4.79 it/s vs 5.23 it/s	2023-03-23 07:33:57 -05:00
John Smith	60b227d0ba	fix minor bug	2023-03-23 08:43:18 +00:00
John Smith	619a177fbb	Update README.md	2023-03-23 16:31:49 +08:00
John Smith	44978669cf	Add gradient checkpointing	2023-03-23 08:25:29 +00:00
John Smith	eb8ce878d4	Merge pull request #6 from sterlind/patch-1 Fix path to autograd_4bit.py in install.sh	2023-03-22 16:07:25 +08:00
John Smith	9b04b8eec6	add monkey patch for webui	2023-03-22 07:58:51 +00:00
sterlind	0aea8f45e3	Fix path to autograd_4bit.py in install.sh	2023-03-22 00:24:52 -07:00
John Smith	45d2f22c14	Update README.md	2023-03-22 14:56:50 +08:00
John Smith	cab067fef9	Update README.md	2023-03-22 14:55:24 +08:00
John Smith	02bd0338f1	add data	2023-03-22 12:13:34 +08:00
John Smith	dc036373b2	add more scripts and adjust code for transformer branch	2023-03-22 04:09:04 +00:00
John Smith	a955a1c2a5	fix bug	2023-03-22 00:18:24 +08:00
John Smith	467849d13a	Merge branch 'main' of https://github.com/johnsmith0031/alpaca_lora_4bit	2023-03-21 09:45:06 +00:00
John Smith	ef0a326cec	update autograd	2023-03-21 09:41:18 +00:00
John Smith	3be75bb3db	Update README.md	2023-03-21 16:49:08 +08:00
John Smith	8d198e0171	Update README.md	2023-03-21 16:48:17 +08:00
John Smith	3471be4e56	add fast_4bit_matmul and auto switch 2 methods according to bottleneck	2023-03-21 08:43:07 +00:00

1 2

62 Commits All Branches Search

62 Commits

All Branches