2024-07-03 09:12:28 | INFO | model_worker | args: Namespace(awq_ckpt=None, awq_groupsize=-1, awq_wbits=16, controller_address='http://localhost:21001', conv_template=None, cpu_offloading=False, debug=False, device='cuda', dtype=None, embed_in_truncate=False, enable_exllama=False, enable_xft=False, exllama_cache_8bit=False, exllama_gpu_split=None, exllama_max_seq_len=4096, gptq_act_order=False, gptq_ckpt=None, gptq_groupsize=-1, gptq_wbits=16, gpus=None, host='127.0.0.1', limit_worker_concurrency=5, load_8bit=False, max_gpu_memory=None, model_names=None, model_path='lmsys/vicuna-7b-v1.5', no_register=False, num_gpus=1, port=31000, revision='main', seed=None, ssl=False, stream_interval=2, worker_address='http://localhost:31000', xft_dtype=None, xft_max_seq_len=4096) 2024-07-03 09:12:28 | INFO | model_worker | Loading the model ['vicuna-7b-v1.5'] on worker 8276027a ... 2024-07-03 09:12:29 | ERROR | stderr | /usr/local/lib/python3.8/dist-packages/torch/storage.py:315: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. 2024-07-03 09:12:29 | ERROR | stderr | warnings.warn(message, UserWarning) 2024-07-03 09:12:29 | ERROR | stderr | Loading checkpoint shards: 0%| | 0/2 [00:00: Failed to establish a new connection: [Errno 111] Connection refused 2024-07-03 09:13:01 | ERROR | stderr | 2024-07-03 09:13:01 | ERROR | stderr | The above exception was the direct cause of the following exception: 2024-07-03 09:13:01 | ERROR | stderr | 2024-07-03 09:13:01 | ERROR | stderr | Traceback (most recent call last): 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/local/lib/python3.8/dist-packages/requests/adapters.py", line 667, in send 2024-07-03 09:13:01 | ERROR | stderr | resp = conn.urlopen( 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/local/lib/python3.8/dist-packages/urllib3/connectionpool.py", line 843, in urlopen 2024-07-03 09:13:01 | ERROR | stderr | retries = retries.increment( 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/local/lib/python3.8/dist-packages/urllib3/util/retry.py", line 519, in increment 2024-07-03 09:13:01 | ERROR | stderr | raise MaxRetryError(_pool, url, reason) from reason # type: ignore[arg-type] 2024-07-03 09:13:01 | ERROR | stderr | urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='localhost', port=21001): Max retries exceeded with url: /register_worker (Caused by NewConnectionError(': Failed to establish a new connection: [Errno 111] Connection refused')) 2024-07-03 09:13:01 | ERROR | stderr | 2024-07-03 09:13:01 | ERROR | stderr | During handling of the above exception, another exception occurred: 2024-07-03 09:13:01 | ERROR | stderr | 2024-07-03 09:13:01 | ERROR | stderr | Traceback (most recent call last): 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/lib/python3.8/runpy.py", line 194, in _run_module_as_main 2024-07-03 09:13:01 | ERROR | stderr | return _run_code(code, main_globals, None, 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/lib/python3.8/runpy.py", line 87, in _run_code 2024-07-03 09:13:01 | ERROR | stderr | exec(code, run_globals) 2024-07-03 09:13:01 | ERROR | stderr | File "/LLM_32T/evelyn/FastChat/fastchat/serve/model_worker.py", line 414, in 2024-07-03 09:13:01 | ERROR | stderr | args, worker = create_model_worker() 2024-07-03 09:13:01 | ERROR | stderr | File "/LLM_32T/evelyn/FastChat/fastchat/serve/model_worker.py", line 385, in create_model_worker 2024-07-03 09:13:01 | ERROR | stderr | worker = ModelWorker( 2024-07-03 09:13:01 | ERROR | stderr | File "/LLM_32T/evelyn/FastChat/fastchat/serve/model_worker.py", line 102, in __init__ 2024-07-03 09:13:01 | ERROR | stderr | self.init_heart_beat() 2024-07-03 09:13:01 | ERROR | stderr | File "/LLM_32T/evelyn/FastChat/fastchat/serve/base_model_worker.py", line 81, in init_heart_beat 2024-07-03 09:13:01 | ERROR | stderr | self.register_to_controller() 2024-07-03 09:13:01 | ERROR | stderr | File "/LLM_32T/evelyn/FastChat/fastchat/serve/base_model_worker.py", line 99, in register_to_controller 2024-07-03 09:13:01 | ERROR | stderr | r = requests.post(url, json=data) 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/local/lib/python3.8/dist-packages/requests/api.py", line 115, in post 2024-07-03 09:13:01 | ERROR | stderr | return request("post", url, data=data, json=json, **kwargs) 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/local/lib/python3.8/dist-packages/requests/api.py", line 59, in request 2024-07-03 09:13:01 | ERROR | stderr | return session.request(method=method, url=url, **kwargs) 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/local/lib/python3.8/dist-packages/requests/sessions.py", line 589, in request 2024-07-03 09:13:01 | ERROR | stderr | resp = self.send(prep, **send_kwargs) 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/local/lib/python3.8/dist-packages/requests/sessions.py", line 703, in send 2024-07-03 09:13:01 | ERROR | stderr | r = adapter.send(request, **kwargs) 2024-07-03 09:13:01 | ERROR | stderr | File "/usr/local/lib/python3.8/dist-packages/requests/adapters.py", line 700, in send 2024-07-03 09:13:01 | ERROR | stderr | raise ConnectionError(e, request=request) 2024-07-03 09:13:01 | ERROR | stderr | requests.exceptions.ConnectionError: HTTPConnectionPool(host='localhost', port=21001): Max retries exceeded with url: /register_worker (Caused by NewConnectionError(': Failed to establish a new connection: [Errno 111] Connection refused'))