justin13barrett commited on
Commit
97e7fe6
1 Parent(s): 983f956

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -10
README.md CHANGED
@@ -8,25 +8,28 @@ model-index:
8
  results: []
9
  ---
10
 
11
- <!-- This model card has been generated automatically according to the information Keras had access to. You should
12
- probably proofread and complete it, then remove this comment. -->
13
 
14
  # bert-base-multilingual-cased-finetuned-openalex-topic-classification-title-abstract
15
 
16
- This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on an unknown dataset.
17
- It achieves the following results on the evaluation set:
18
- - Train Loss: 2.4942
19
- - Validation Loss: 3.0737
20
- - Validation Accuracy: 0.4846
21
- - Epoch: 7
22
 
23
  ## Model description
24
 
25
- More information needed
 
 
 
 
26
 
27
  ## Intended uses & limitations
28
 
29
- More information needed
 
 
30
 
31
  ## Training and evaluation data
32
 
 
8
  results: []
9
  ---
10
 
 
 
11
 
12
  # bert-base-multilingual-cased-finetuned-openalex-topic-classification-title-abstract
13
 
14
+ This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on a labeled dataset provided by CWTS:
15
+ [CWTS Labeled Data].
16
+
17
+ This is NOT the full model being used to tag OpenAlex works with a topic. For that, check out the following github repo:
18
+ [OpenAlex Topic Classification](https://github.com/ourresearch/openalex-topic-classification)
 
19
 
20
  ## Model description
21
 
22
+ The input data is expected to be in the following format:
23
+
24
+ "<TITLE> {insert-processed-title-here}\n<ABSTRACT> {insert-processed-abstract-here}"
25
+
26
+ Since this was train
27
 
28
  ## Intended uses & limitations
29
 
30
+ The model is intended to be used as part of a larger model that also incorporates journal information and citation features. However, this model is good if you want to use it for quickly generating a topic based only on a title/abstract.
31
+
32
+ Since this model was fine-tuned on a BERT model, all of the biases seen in that model will most likely show up in this model as well.
33
 
34
  ## Training and evaluation data
35