Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
12
Follow
AWS Inferentia and Trainium
61
License:
apache-2.0
Model card
Files
Files and versions
Community
240
f74222d
optimum-neuron-cache
/
inference-cache-config
Commit History
Remove SalesForce embedding model
1cd13f9
verified
dacorvo
HF staff
commited on
Mar 25
Add Zephyr to mistral variants
9164704
verified
dacorvo
HF staff
commited on
Mar 21
Remove variants from main mistral config
ef07aca
verified
dacorvo
HF staff
commited on
Mar 21
Add mistral most popular variants
d3983e8
verified
dacorvo
HF staff
commited on
Mar 21
Add most popular llama variants
594abb2
verified
dacorvo
HF staff
commited on
Mar 21
Added teknium/OpenHermes-2.5-Mistral-7B
1518247
verified
dacorvo
HF staff
commited on
Mar 8
Added Llama-70b batch_size 4 to inference cache
593822e
verified
dacorvo
HF staff
commited on
Mar 8
Create mistral.json
b5d0afd
verified
philschmid
HF staff
commited on
Mar 5
Create gpt2.json
3bdb891
verified
philschmid
HF staff
commited on
Mar 5
Create inference-cache-config/llama.json
1960ccb
verified
philschmid
HF staff
commited on
Mar 5