Running 1.79k 1.79k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 β’ 75