I have made an api using the wizardcoder llm for evaluating codes. I am facing an issue as the model is taking more than 2mins generate a response.
Does anyone have a solution to reduce the time for generating a response?
· Sign up or log in to comment