Commit
·
cda4b61
1
Parent(s):
b51dd50
Update README.md
Browse files
README.md
CHANGED
@@ -41,6 +41,11 @@ https://github.com/minhtriphan/LongFinBERT-base/tree/main
|
|
41 |
* The masking probability is 15%;
|
42 |
* Details about the training configuration are given in the log file named `train_v1a_0803_1144_seed_1.log`;
|
43 |
|
|
|
|
|
|
|
|
|
|
|
44 |
# Instruction to load the pre-trained model
|
45 |
* Clone the git repo
|
46 |
```
|
|
|
41 |
* The masking probability is 15%;
|
42 |
* Details about the training configuration are given in the log file named `train_v1a_0803_1144_seed_1.log`;
|
43 |
|
44 |
+
# Versions
|
45 |
+
There are 2 versions of the pre-trained model,
|
46 |
+
* v1 - Random Masking: We randomly choose tokens to mask in the MLM task;
|
47 |
+
* v2 - Selective Masking: As we want the model to learn more about the financial context, we selectively choose tokens to mask in the MLM task. We rely on the Loughran-McDonald dictionary to choose the important tokens to masked.
|
48 |
+
|
49 |
# Instruction to load the pre-trained model
|
50 |
* Clone the git repo
|
51 |
```
|