Check the model's maximum input value
#9
by
minhdang
- opened
I want to check the maximum input value of model K8_0 but I don't see instructions related to it?
Following is a sample loading statement, model = AutoModelForCausalLM.from_pretrained('TheBloke/Llama-2-7B-Chat-GGML', model_file = 'llama-2-7b-chat.ggmlv3.q4_K_S.bin', context_length=1024). I think that the maximum input context window is 1024.