bitsandbytes transformers peft accelerate trl datasets==2.16.0 wandb torch torchvision