Update Transformers.js config to use fp16 kv cache for q4f16 model
#3 opened 12 days ago
by
Xenova

Shape incompatibility when ```past_key_values``` are made persistent for context retention
#2 opened 8 months ago
by
bekatan