John Smith
97804534b9
fix reference
2023-04-26 17:29:29 +08:00
John Smith
8e5cf08479
fix dependency
2023-04-26 17:17:59 +08:00
John Smith
42ef3484a9
fix _SentinelTokenStoppingCriteria
2023-04-26 17:13:56 +08:00
John Smith
1abdc99675
add server
2023-04-26 17:13:00 +08:00
John Smith
6aab31bd73
update reference
2023-04-13 10:35:10 +08:00
Daniel Nouri
ee7d94a1f3
Bugfix in custom_monkey_patch for v1 models
...
Previously generation would fail with:
File "/alpaca_lora_4bit/text-generation-webui/matmul_utils_4bit.py", line 79, in _matmul4bit_v1_recons
quant_cuda.vecquant4recons_v1(qweight, buffer, scales, zeros)
RuntimeError: expected scalar type Half but found Float
See #71
2023-04-10 12:41:16 +02:00
John Smith
5d3267d80d
add v1 model as default in custom monkey patch
2023-04-10 09:33:41 +08:00
John Smith
9a02a88fb8
add patch for encode function to remove eos token at the beginning of left side
2023-04-06 12:56:27 +08:00
John Smith
bff039de95
add v2 model support
2023-03-28 20:33:55 +08:00
Star Dorminey
399c3d124e
Tested and should be ready!
2023-03-25 20:52:38 -07:00
John Smith
9b04b8eec6
add monkey patch for webui
2023-03-22 07:58:51 +00:00