How about unsing vLLM frame instead of InferyLLM ?

#13
by youaaa - opened

just wonder which is fast?

Does anyone know it? really appreciate it

Deci AI org

You can use vLLM, performance will not be as good as with Infery-LLM. (DeciLM is documented in the vLLM readme).

Sign up or log in to comment