Commit Graph

11 Commits

Author SHA1 Message Date
John Smith 97804534b9 fix reference 2023-04-26 17:29:29 +08:00
John Smith 8e5cf08479 fix dependency 2023-04-26 17:17:59 +08:00
John Smith 42ef3484a9 fix _SentinelTokenStoppingCriteria 2023-04-26 17:13:56 +08:00
John Smith 1abdc99675 add server 2023-04-26 17:13:00 +08:00
John Smith 6aab31bd73 update reference 2023-04-13 10:35:10 +08:00
Daniel Nouri ee7d94a1f3 Bugfix in custom_monkey_patch for v1 models
Previously generation would fail with:

    File "/alpaca_lora_4bit/text-generation-webui/matmul_utils_4bit.py", line 79, in _matmul4bit_v1_recons
      quant_cuda.vecquant4recons_v1(qweight, buffer, scales, zeros)
  RuntimeError: expected scalar type Half but found Float

See #71
2023-04-10 12:41:16 +02:00
John Smith 5d3267d80d add v1 model as default in custom monkey patch 2023-04-10 09:33:41 +08:00
John Smith 9a02a88fb8 add patch for encode function to remove eos token at the beginning of left side 2023-04-06 12:56:27 +08:00
John Smith bff039de95 add v2 model support 2023-03-28 20:33:55 +08:00
Star Dorminey 399c3d124e Tested and should be ready! 2023-03-25 20:52:38 -07:00
John Smith 9b04b8eec6 add monkey patch for webui 2023-03-22 07:58:51 +00:00