Can any of these quantizations run on GPUs? Specifically, A100s, I have access to 18 in 9 nodes of 2 each, and can create a ray cluster.
· Sign up or log in to comment