Llama-2-7b-coreml / sequoia /Llama-2-7b-hf_chunk12.mlmodelc

Commit History

Update sequoia mode with transposed value cache and 4:508 input:cache length
722eedf
verified

smpanaro commited on

Upload Sequoia model
f554427
verified

smpanaro commited on