Compute Instance Requirement

#28
by iammano - opened

Hi there,

I was trying to build agent agent-based application by using llama3.1 models and it is on AWS EC2. I need suggestions of which instance should I opt for which will be capable of running the models cost-effectively.

I explored the GPU requirement of the model from the hugging face blog, here
https://huggingface.co./blog/llama31#whats-new-with-llama-31

But I'm still sceptical about choosing which instance type should I go for.

Thanks for your idea and for taking the time to reply to this topic.

Sign up or log in to comment