minhtriphan commited on
Commit
cda4b61
·
1 Parent(s): b51dd50

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -41,6 +41,11 @@ https://github.com/minhtriphan/LongFinBERT-base/tree/main
41
  * The masking probability is 15%;
42
  * Details about the training configuration are given in the log file named `train_v1a_0803_1144_seed_1.log`;
43
 
 
 
 
 
 
44
  # Instruction to load the pre-trained model
45
  * Clone the git repo
46
  ```
 
41
  * The masking probability is 15%;
42
  * Details about the training configuration are given in the log file named `train_v1a_0803_1144_seed_1.log`;
43
 
44
+ # Versions
45
+ There are 2 versions of the pre-trained model,
46
+ * v1 - Random Masking: We randomly choose tokens to mask in the MLM task;
47
+ * v2 - Selective Masking: As we want the model to learn more about the financial context, we selectively choose tokens to mask in the MLM task. We rely on the Loughran-McDonald dictionary to choose the important tokens to masked.
48
+
49
  # Instruction to load the pre-trained model
50
  * Clone the git repo
51
  ```