dylanAtHum
commited on
Commit
•
9bbc2d0
1
Parent(s):
e33bf52
Update Replication Instructions to Use Script to Load Pretrained Model
Browse files- Replication.txt +1 -1
- load_mosaic.py +23 -0
Replication.txt
CHANGED
@@ -22,7 +22,7 @@ In order to run the training process with our specific model, we need to make a
|
|
22 |
|
23 |
To alter the sentence-transformers library, clone the repository from https://github.com/UKPLab/sentence-transformers locally and replace the SentenceTransformer.py and Transformer.py files located within the sentence-transformers/sentence_transformers/ and sentence-transformers/sentence_transformers/models/ directories of the cloned repository, respectively, with those located inside dev/ folder. (This has already been done in this notebook instance, but this will have to be completed if training on another system.)
|
24 |
|
25 |
-
Before conducting actual training, we also need to clone the mosaic-bert-base-seqlen-2048 model locally and make a few small changes to its config.json file. Running
|
26 |
|
27 |
Training
|
28 |
|
|
|
22 |
|
23 |
To alter the sentence-transformers library, clone the repository from https://github.com/UKPLab/sentence-transformers locally and replace the SentenceTransformer.py and Transformer.py files located within the sentence-transformers/sentence_transformers/ and sentence-transformers/sentence_transformers/models/ directories of the cloned repository, respectively, with those located inside dev/ folder. (This has already been done in this notebook instance, but this will have to be completed if training on another system.)
|
24 |
|
25 |
+
Before conducting actual training, we also need to clone the mosaic-bert-base-seqlen-2048 model locally and make a few small changes to its config.json file. Running load_mosiac.py will execute this process and get the model ready to begin training. (Again, this has already been done in this notebook instance, but this will have to be completed if training on another system.)
|
26 |
|
27 |
Training
|
28 |
|
load_mosaic.py
ADDED
@@ -0,0 +1,23 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
from huggingface_hub import snapshot_download
|
2 |
+
import json
|
3 |
+
import os
|
4 |
+
|
5 |
+
REPO_ID = "mosaicml/mosaic-bert-base-seqlen-2048"
|
6 |
+
MODEL_DIRECTORY = "mosaic-bert-base-seqlen-2048"
|
7 |
+
|
8 |
+
|
9 |
+
def main():
|
10 |
+
snapshot_download(repo_id=REPO_ID, local_dir=MODEL_DIRECTORY)
|
11 |
+
|
12 |
+
# modify the model's config.json file to satisfy our requirements
|
13 |
+
config_file_path = os.path.join(MODEL_DIRECTORY, 'config.json')
|
14 |
+
contents = json.load(open(config_file_path))
|
15 |
+
contents['architectures'] = ['BertModel']
|
16 |
+
contents['auto_map']['AutoModel'] = 'bert_layers.BertModel'
|
17 |
+
contents['torch_dtype'] = 'bfloat16'
|
18 |
+
contents['transformers_version'] = '4.28.1'
|
19 |
+
contents['_name_or_path'] = 'mosaic-bert-base-seqlen-2048'
|
20 |
+
json.dump(contents, open(config_file_path, 'w'), ensure_ascii=True)
|
21 |
+
|
22 |
+
if __name__ == '__main__':
|
23 |
+
main()
|