Thomas Lemberger commited on
Commit
fb6b7e9
1 Parent(s): 10b1efd

card updated

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -51,9 +51,9 @@ The training was run on a NVIDIA DGX Station with 4XTesla V100 GPUs.
51
 
52
  Training code is available at https://github.com/source-data/soda-roberta
53
 
54
- - Command: `python -m tokcl.train /data/json/sd_panels NER --num_train_epochs=3.5`
55
  - Tokenizer vocab size: 50265
56
- - Training data: EMBO/biolang MLM
57
  - Training with 31410 examples.
58
  - Evaluating on 8861 examples.
59
  - Training on 15 features: O, I-SMALL_MOLECULE, B-SMALL_MOLECULE, I-GENEPROD, B-GENEPROD, I-SUBCELLULAR, B-SUBCELLULAR, I-CELL, B-CELL, I-TISSUE, B-TISSUE, I-ORGANISM, B-ORGANISM, I-EXP_ASSAY, B-EXP_ASSAY
51
 
52
  Training code is available at https://github.com/source-data/soda-roberta
53
 
54
+ - Command: `python -m tokcl.train NER --num_train_epochs=3.5`
55
  - Tokenizer vocab size: 50265
56
+ - Training data: EMBO/sd-nlp NER
57
  - Training with 31410 examples.
58
  - Evaluating on 8861 examples.
59
  - Training on 15 features: O, I-SMALL_MOLECULE, B-SMALL_MOLECULE, I-GENEPROD, B-GENEPROD, I-SUBCELLULAR, B-SUBCELLULAR, I-CELL, B-CELL, I-TISSUE, B-TISSUE, I-ORGANISM, B-ORGANISM, I-EXP_ASSAY, B-EXP_ASSAY