sentence-transformers openai bitsandbytes transformers peft accelerate llama-cpp-python flagai bminf auto-gptq einops