runtime error

Exit code: 1. Reason: 7MB/s] model.safetensors: 100%|█████████▉| 2.47G/2.47G [00:06<00:00, 370MB/s] generation_config.json: 0%| | 0.00/189 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 189/189 [00:00<00:00, 1.48MB/s] llama_new_context_with_model: n_ctx_per_seq (2048) < n_ctx_train (131072) -- the full capacity of the model will not be utilized Starting generation test... Running comparison... Traceback (most recent call last): File "/home/user/app/app.py", line 160, in <module> test_generation() File "/home/user/app/app.py", line 155, in test_generation strategies_results, results_df = compare_strategies(llama_model, llama_tokenizer, prm_model, sample_prompt, 1) File "/home/user/app/app.py", line 129, in compare_strategies "Majority Voting": majority_voting(model, tokenizer, prompt, num_samples), TypeError: majority_voting() takes from 1 to 2 positional arguments but 4 were given Exception ignored in: <function Llama.__del__ at 0x7f605a6e7a30> Traceback (most recent call last): File "/usr/local/lib/python3.10/site-packages/llama_cpp/llama.py", line 2201, in __del__ File "/usr/local/lib/python3.10/site-packages/llama_cpp/llama.py", line 2198, in close File "/usr/local/lib/python3.10/contextlib.py", line 584, in close File "/usr/local/lib/python3.10/contextlib.py", line 576, in __exit__ File "/usr/local/lib/python3.10/contextlib.py", line 561, in __exit__ File "/usr/local/lib/python3.10/contextlib.py", line 340, in __exit__ File "/usr/local/lib/python3.10/site-packages/llama_cpp/_internals.py", line 69, in close File "/usr/local/lib/python3.10/contextlib.py", line 584, in close File "/usr/local/lib/python3.10/contextlib.py", line 576, in __exit__ File "/usr/local/lib/python3.10/contextlib.py", line 561, in __exit__ File "/usr/local/lib/python3.10/contextlib.py", line 449, in _exit_wrapper File "/usr/local/lib/python3.10/site-packages/llama_cpp/_internals.py", line 63, in free_model TypeError: 'NoneType' object is not callable

Container logs:

Fetching error logs...