updating n_kv_heads in config.json

by fwtan - opened Nov 30, 2023

Nov 30, 2023

•

edited Nov 30, 2023

In config.json, it seems n_kv_heads was set to be the default value (32), which does not align with the actual setting (i.e. 24). The same case for other Sheared Llama models.
This may lead to an error when using transformers with a version > 4.28.

SinanAkkoyun

Dec 4, 2023

I just saw, shouldn't it be 16 like in the base shear?

fwtan

Dec 7, 2023

I just saw, shouldn't it be 16 like in the base shear?

Yes, you're correct! It should be 16 for this model. Transformers <= 4.28 uses n_kv_heads == n_heads by default.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment