How to train this model with text-generation-webui?

by wmr - opened May 9, 2023

wmr

May 9, 2023

I tried adding an alpaca style dataset with text-generation-webui and it gave the following error in the training:

(...) training.py", line 247, in tokenize
    result = shared.tokenizer(prompt, truncation=True, max_length=cutoff_len + 1, padding="max_length")
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: 'LlamaCppModel' object is not callable

I'm using q5_1.

TheBloke

Owner May 10, 2023

You can't train GGML models I'm afraid.

You'll need a model for GPU inference, either an unquantised HF model, or (I think) a GPTQ 4bit model can work. I've never tried training in text-gen-ui myself so not sure of the specifics. But it definitely can't work on a GGML model with llama.cpp (not yet anyway - maybe llama.cpp will add that in the future!)

wmr

May 10, 2023

How to add new knowledge to this model then, without a full retraining? Thanks.

TheBloke

Owner May 10, 2023

•

edited May 10, 2023

You would do fine tuning on the HF model, available here: https://huggingface.co./TheBloke/wizardLM-7B-HF

You'll need a GPU with enough VRAM, though. Which means you'll need at least 16GB VRAM. If you have less, you could investigate doing fine tuning in 4bit, eg check out these repos:
https://github.com/johnsmith0031/alpaca_lora_4bit
https://github.com/stochasticai/xturing/tree/main/examples/int4_finetuning

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment