"RuntimeError: probability tensor contains either `inf`, `nan` or element < 0" when running in multi-gpu

#53
by greeksharifa - opened

If I run the code like...

(...)
outputs = model.generate(**inputs, max_new_tokens=30)

Then this erorr occurs:

RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

Environments:

# python 3.10
# 6 x A6000 GPUs
transformers==4.45.2
torch==2.4.1
torchaudio==2.4.1
torchvision==0.19.1
accelerate==1.0.0

Question) What is the recommended CUDA version? I used CUDA 12.2 or 11.8.

seeing the same issue. Did you figure it out?

Sign up or log in to comment