Llama-2-7b-coreml / generation-cache-processor.mlmodelc

Commit History

Update Sonoma model with faster 8x8 conv and split einsum attention
dba673f

smpanaro commited on

Add model
a76a14d

smpanaro commited on