runtime error
compiled kernel found. Compiling kernels : /home/user/.cache/huggingface/modules/transformers_modules/THUDM/chatglm-6b-int8/22906aeb32fd7952ce323dc9d25e01693b270da6/quantization_kernels_parallel.c Compiling gcc -O3 -fPIC -pthread -fopenmp -std=c99 /home/user/.cache/huggingface/modules/transformers_modules/THUDM/chatglm-6b-int8/22906aeb32fd7952ce323dc9d25e01693b270da6/quantization_kernels_parallel.c -shared -o /home/user/.cache/huggingface/modules/transformers_modules/THUDM/chatglm-6b-int8/22906aeb32fd7952ce323dc9d25e01693b270da6/quantization_kernels_parallel.so Load kernel : /home/user/.cache/huggingface/modules/transformers_modules/THUDM/chatglm-6b-int8/22906aeb32fd7952ce323dc9d25e01693b270da6/quantization_kernels_parallel.so Setting CPU quantization kernel threads to 8 Using quantization cache Applying quantization to glm layers Traceback (most recent call last): File "/home/user/app/app.py", line 6, in <module> model = AutoModel.from_pretrained("THUDM/chatglm-6b-int8", trust_remote_code=True, ignore_mismatched_sizes=True).half().cuda() File "/home/user/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 905, in cuda return self._apply(lambda t: t.cuda(device)) File "/home/user/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 797, in _apply module._apply(fn) File "/home/user/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 797, in _apply module._apply(fn) File "/home/user/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 820, in _apply param_applied = fn(param) File "/home/user/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 905, in <lambda> return self._apply(lambda t: t.cuda(device)) File "/home/user/.local/lib/python3.10/site-packages/torch/cuda/__init__.py", line 247, in _lazy_init torch._C._cuda_init() RuntimeError: Found no NVIDIA driver on your system. Please check that you have an NVIDIA GPU and installed a driver from http://www.nvidia.com/Download/index.aspx
Container logs:
Fetching error logs...