optimum-neuron-cache / inference-cache-config

Commit History

Add Mistral-v2
20e585f
verified

dacorvo HF staff commited on

Create stable-diffusion.json (#43)
32561fe
verified

philschmid HF staff Jingya HF staff commited on

Remove SalesForce embedding model
1cd13f9
verified

dacorvo HF staff commited on

Add Zephyr to mistral variants
9164704
verified

dacorvo HF staff commited on

Remove variants from main mistral config
ef07aca
verified

dacorvo HF staff commited on

Add mistral most popular variants
d3983e8
verified

dacorvo HF staff commited on

Add most popular llama variants
594abb2
verified

dacorvo HF staff commited on

Added teknium/OpenHermes-2.5-Mistral-7B
1518247
verified

dacorvo HF staff commited on

Added Llama-70b batch_size 4 to inference cache
593822e
verified

dacorvo HF staff commited on

Create mistral.json
b5d0afd
verified

philschmid HF staff commited on

Create gpt2.json
3bdb891
verified

philschmid HF staff commited on

Create inference-cache-config/llama.json
1960ccb
verified

philschmid HF staff commited on