nvidia
/

Llama-3.1-Minitron-4B-Width-Base

srvm commited on Aug 15

Commit

f7b3900

•

1 Parent(s): 3d3f8ae

Add transformers install instructions

Files changed (1) hide show

README.md CHANGED Viewed

@@ -48,6 +48,23 @@ Llama-3.1-Minitron-4B-Width-Base uses a model embedding size of 4096, 32 attenti
 ## Usage
 ```python
 import torch

 ## Usage
+Pull requests
+to support this model in Hugging Face Transformers are currently under review
+([#32495](https://github.com/huggingface/transformers/pull/32495) and [#32502](https://github.com/huggingface/transformers/pull/32502))
+and are expected to be merged soon. In the meantime,
+please follow the installation instructions below:
+```
+# Fetch PR 32502
+$ git clone -b suhara/llama-kv-channels --single-branch https://github.com/suhara/transformers.git && cd transformers
+# Fetch changes from PR 32495
+$ git fetch https://github.com/suiyoubi/transformers.git aot/head_dim_rope && git cherry-pick FETCH_HEAD --strategy-option theirs
+# Install transformers
+$ pip install -e .
+```
+We can now run inference on this model:
 ```python
 import torch