just wonder which is fast?
Does anyone know it? really appreciate it
You can use vLLM, performance will not be as good as with Infery-LLM. (DeciLM is documented in the vLLM readme).
· Sign up or log in to comment