awni00
/

DAT-sa8-ra8-nr32-ns1024-sh8-nkvh4-343M

awni00 commited on Aug 19, 2024

Commit

49337c2

•

1 Parent(s): 0f45558

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -50,23 +50,6 @@ from dual_attention.hf import DualAttnTransformerLM_HFHub
 DualAttnTransformerLM_HFHub.from_pretrained('awni00/DAT-sa8-ra8-nr32-ns1024-sh8-nkvh4-343M')
 ```
-Alternatively, you can download the pytorch checkpoint containing the state dict.
-To download the PyTorch checkpoint, run:
-```wget https://huggingface.co/awni00/DAT-sa8-ra8-nr32-ns1024-sh8-nkvh4-343M/resolve/main/pytorch_checkpoint.pt```
-Then, you can load model weights via:
-```
-from dual_attention.language_models import DualAttnTransformerLM
-ckpt = torch.load(ckpt_path)
-model_config = ckpt['config']
-model_state_dict = ckpt['model']
-model = DualAttnTransformerLM(**model_config)
-model.load_state_dict(model_state_dict)
-```
 ## Training Details
 The model was trained using the following setup:

 DualAttnTransformerLM_HFHub.from_pretrained('awni00/DAT-sa8-ra8-nr32-ns1024-sh8-nkvh4-343M')
 ```
 ## Training Details
 The model was trained using the following setup: