nvidia
/

Llama-3.1-Minitron-4B-Width-Base

Model card Files Files and versions Community

srvm commited on Aug 20

Commit

6a7be52

•

1 Parent(s): a60471e

Update usage instructions

Files changed (1) hide show

README.md +2 -14

README.md CHANGED Viewed

@@ -48,21 +48,9 @@ Llama-3.1-Minitron-4B-Width-Base uses a model embedding size of 3072, 32 attenti
 ## Usage
-Pull requests
-to support this model in Hugging Face Transformers are currently under review
-([#32495](https://github.com/huggingface/transformers/pull/32495) and [#32502](https://github.com/huggingface/transformers/pull/32502))
-and are expected to be merged soon. In the meantime,
-please follow the installation instructions below:
 ```
-# Fetch PR 32502
-$ git clone -b suhara/llama-kv-channels --single-branch https://github.com/suhara/transformers.git && cd transformers
-# Fetch changes from PR 32495
-$ git fetch https://github.com/suiyoubi/transformers.git aot/head_dim_rope && git cherry-pick FETCH_HEAD --strategy-option theirs
-# Install transformers
-$ pip install -e .
 ```
 We can now run inference on this model:

 ## Usage
+Support for this model will be added in the upcoming `transformers` release. In the meantime, please install the library from source:
 ```
+pip install git+https://github.com/huggingface/transformers
 ```
 We can now run inference on this model: