Update Transformers.js config to use fp16 kv cache for q4f16 model
#3 opened about 16 hours ago
by
Xenova
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61b253b7ac5ecaae3d1efe0c/hwiQ0uvz3t-L5a-NtBIO6.png)
Shape incompatibility when ```past_key_values``` are made persistent for context retention
#2 opened 8 months ago
by
bekatan