NeMo
Nemotron-4-340B-Instruct / model_weights /model.decoder.layers.self_attention.linear_qkv._extra_state
okuchaiev's picture
Add files using large-upload tool
44eaec7 verified