minhtriphan
/

LongFinBERT-base

Model card Files Files and versions

minhtriphan commited on Sep 8, 2023

Commit

cda4b61

·

1 Parent(s): b51dd50

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -41,6 +41,11 @@ https://github.com/minhtriphan/LongFinBERT-base/tree/main
 * The masking probability is 15%;
 * Details about the training configuration are given in the log file named `train_v1a_0803_1144_seed_1.log`;
 # Instruction to load the pre-trained model
 * Clone the git repo
 ```

 * The masking probability is 15%;
 * Details about the training configuration are given in the log file named `train_v1a_0803_1144_seed_1.log`;
+# Versions
+There are 2 versions of the pre-trained model,
+* v1 - Random Masking: We randomly choose tokens to mask in the MLM task;
+* v2 - Selective Masking: As we want the model to learn more about the financial context, we selectively choose tokens to mask in the MLM task. We rely on the Loughran-McDonald dictionary to choose the important tokens to masked.
 # Instruction to load the pre-trained model
 * Clone the git repo
 ```