alpaca_lora_4bit

History

Daniel Nouri ee7d94a1f3 Bugfix in custom_monkey_patch for v1 models Previously generation would fail with: File "/alpaca_lora_4bit/text-generation-webui/matmul_utils_4bit.py", line 79, in _matmul4bit_v1_recons quant_cuda.vecquant4recons_v1(qweight, buffer, scales, zeros) RuntimeError: expected scalar type Half but found Float See #71	2023-04-10 12:41:16 +02:00
..
custom_monkey_patch.py	Bugfix in custom_monkey_patch for v1 models	2023-04-10 12:41:16 +02:00

Daniel Nouri ee7d94a1f3 Bugfix in custom_monkey_patch for v1 models

Previously generation would fail with:

    File "/alpaca_lora_4bit/text-generation-webui/matmul_utils_4bit.py", line 79, in _matmul4bit_v1_recons
      quant_cuda.vecquant4recons_v1(qweight, buffer, scales, zeros)
  RuntimeError: expected scalar type Half but found Float

See #71

2023-04-10 12:41:16 +02:00

custom_monkey_patch.py

Bugfix in custom_monkey_patch for v1 models

2023-04-10 12:41:16 +02:00