streamlit transformers torch bitsandbytes optimum accelerate auto-gptq