Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
Llama-3_1-Nemotron-51B-Instruct
like
202
Follow
NVIDIA
8.59k
Text Generation
Transformers
Safetensors
PyTorch
English
nemotron-nas
nvidia
llama-3
conversational
custom_code
arxiv:
4 papers
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
24
Train
Use this model
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#18
by
tomer-nv
- opened
Oct 13, 2024
base:
refs/heads/main
←
from:
refs/pr/18
Discussion
Files changed
+19
-0
tomer-nv
NVIDIA org
Oct 13, 2024
No description provided.
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
d71a214b
tomer-nv
changed pull request status to
closed
Oct 13, 2024
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment