alpaca_lora_4bit

Commit Graph

Author	SHA1	Message	Date
Andrey Glushenkov	c991e2a091	Delete triton_test.py	2023-04-06 02:39:40 +03:00
Andrey Glushenkov	0d271d5d90	Add files via upload Fix triton kernels	2023-04-06 02:38:06 +03:00
Andrey Glushenkov	4a2d23aa29	Delete autograd_4bit.py File moved to autograd_4bit module	2023-04-06 02:31:06 +03:00
Andrey Glushenkov	f20570343f	GPTQv2 support GPTQv2 support. 1. Adds dependency on `triton` 2. Refactors autograd_4bit to include both GPTQv1 and GPTQv2 3. Introduces new environment variable GPTQ_VERSION to select autograd_4bit version 4. Fixes triton kernels 5. Matrix multiplications are in fp16	2023-04-06 02:29:36 +03:00
John Smith	86387a0a35	update multi gpu support in finetune.py	2023-04-03 23:55:58 +08:00
John Smith	5655f218ed	add g_idx buffer.\nadd triton matmul utils for future support.	2023-04-02 21:29:06 +08:00
John Smith	f3a25342e1	fix device_map bug when using lora_apply_dir	2023-03-31 19:44:36 +08:00
John Smith	00bf0a1e1b	Update README.md	2023-03-31 14:17:35 +08:00
John Smith	dd0efc721f	Merge pull request #47 from winglian/better-gpt4all fix gpt4all training to more closely match the released logic, other small fixes and optimizations	2023-03-31 11:20:03 +08:00
Wing Lian	8791eaee9a	fix gpt4all training to more closely match the released logic, other small fixes and optimizations	2023-03-30 22:40:40 -04:00
John Smith	878eada8dd	add amp_wrapper for autocast support.	2023-03-30 19:57:19 +08:00
John Smith	b3c91a5af5	Merge pull request #45 from winglian/fix-missing-bracket fix missing paren	2023-03-30 13:53:55 +08:00
Wing Lian	e744aec8bf	fix missing paren	2023-03-29 23:40:30 -04:00
John Smith	8db4633d84	Update README.md	2023-03-30 11:24:25 +08:00
John Smith	8a62560e6c	add offload support	2023-03-30 11:21:21 +08:00
John Smith	32976f91c4	Merge pull request #42 from winglian/multigpu-fix better multi-gpu support, support gpt4all training data	2023-03-30 00:03:27 +08:00
Wing Lian	b7361da58a	better multi-gpu support, support gpt4all training data	2023-03-29 11:21:47 -04:00
John Smith	0fdae9224c	optimized groupsize backward for performance	2023-03-29 17:44:51 +08:00
John Smith	5986649b37	Update README.md	2023-03-29 14:46:28 +08:00
John Smith	1c02d4262d	add resume checkpoint to continue a training	2023-03-29 14:35:39 +08:00
John Smith	2a1cb42966	add padding support as an option	2023-03-29 11:20:16 +08:00
John Smith	cff57ebfa4	Merge pull request #39 from winglian/fix-prompt-eos-token properly include the eos token so inference doesn't blabber on	2023-03-29 10:35:46 +08:00
Wing Lian	daad59f8ef	properly include the eos token so inference doesn't blabber on	2023-03-28 20:53:16 -04:00
John Smith	1719bd0ce3	fix bug	2023-03-29 08:09:40 +08:00
John Smith	1043ded7d9	Merge branch 'main' of github.com:johnsmith0031/alpaca_lora_4bit	2023-03-29 01:26:20 +08:00
John Smith	d28ee06202	fix bug	2023-03-29 01:25:37 +08:00
John Smith	b5e3dae573	Merge pull request #34 from winglian/v2-fixes fixes for most recent update	2023-03-28 23:49:56 +08:00
Wing Lian	b47da33084	fixes for most recent update	2023-03-28 10:56:35 -04:00
John Smith	234004ceb5	fix bug	2023-03-28 22:05:18 +08:00
John Smith	f26615fc0c	fix bug	2023-03-28 21:47:22 +08:00
John Smith	0768d0fdff	update finetune data format	2023-03-28 21:45:33 +08:00
John Smith	8a6c8661df	Merge branch 'main' of github.com:johnsmith0031/alpaca_lora_4bit merged	2023-03-28 21:14:35 +08:00
John Smith	211af574b6	fix bug	2023-03-28 21:12:51 +08:00
John Smith	ac07457473	Update README.md	2023-03-28 20:44:02 +08:00
John Smith	bff039de95	add v2 model support	2023-03-28 20:33:55 +08:00
John Smith	667e43cb5b	Merge pull request #30 from winglian/features/python-fixes backwards support for pre-py3.10, add datasets requirement used in train	2023-03-28 09:34:50 +08:00
Wing Lian	101d314bd9	add missing dependency to train with LlamaTokenizer	2023-03-27 16:13:46 -04:00
Wing Lian	62e54ac1c7	backwards support for pre-py3.10, add datasets requirement used in train	2023-03-27 16:08:20 -04:00
John Smith	6c8c07e7ad	Update README.md	2023-03-27 18:03:28 +08:00
John Smith	cf94d7af68	Update README.md	2023-03-27 17:52:35 +08:00
John Smith	1ca9b8abf8	Update README.md	2023-03-27 17:51:04 +08:00
John Smith	0b5b376de1	Merge pull request #23 from sterlind/star/repos Get dependencies straight from pip!	2023-03-27 17:47:39 +08:00
Star Dorminey	399c3d124e	Tested and should be ready!	2023-03-25 20:52:38 -07:00
Star Dorminey	a2a4c1d117	Remove gitmodules.	2023-03-25 20:23:46 -07:00
Star Dorminey	96440c8717	Removing submodules actually.	2023-03-25 20:20:38 -07:00
Star Dorminey	17f3da744c	Fix repos.	2023-03-25 20:16:48 -07:00
Star Dorminey	8e705eddcb	Add GPTQ and peft as submodules.	2023-03-25 20:02:19 -07:00
John Smith	82dd6dd13e	Merge pull request #20 from kooshi/multi-gpu Enable model parallelism and distributed data parallelism for multi-gpu setups	2023-03-25 15:06:01 +08:00
kooshi	8e471516b8	distributed data parallelism with torchrun	2023-03-24 23:56:06 -05:00
kooshi	2bc64597aa	model parallelism	2023-03-24 23:03:43 -05:00

1 2

86 Commits All Branches Search

86 Commits

All Branches