NeMo
nvidia
Nemotron-4-340B-Base / model_weights /model.decoder.layers.self_attention.linear_qkv._extra_state
jiaqiz's picture
Add files using large-upload tool
91d516f verified