z-dickson commited on
Commit
cdb5e0a
1 Parent(s): f947c93
Files changed (3) hide show
  1. README.md +41 -18
  2. config.json +1 -2
  3. tf_model.h5 +3 -0
README.md CHANGED
@@ -1,34 +1,57 @@
1
  ---
2
- license: afl-3.0
3
-
4
- widget:
5
- - text: "as evidence mounts showing that covid leaked from a lab, it’s become clear fauci was funding coronavirus research in wuhan. and he lied about it. he’s lied about lockdowns, he’s lied about masks, and now we know he’s lied about the origins of the virus. https://nypost.com/2021/09/08/covering-up-faucis-role-in-covid-leak-devine/"
6
- example_title: "example 1"
7
- - text: "kansas small businesses are getting hit hard by the covid-19 outbreak and response. if you or someone you know needs guidance and resources, please use this link: https://www.sba.gov/page/coronavirus-covid-19-small-business-guidance-loan-resources #ksleg"
8
- example_title: "example 2"
9
-
10
  ---
11
 
12
- This model is trained to classify US legislators' tweets about COVID-19 as skeptical of preventative measures or not. The model is fine-tuned on 10k hand coded tweets using vinai/bertweet-covid19-base-uncased (https://github.com/VinAIResearch/BERTweet).
 
13
 
14
- The model is uncased (all text is lowercase). Model was trained with full tweets, including URLs and @USER mentions
15
 
 
 
 
 
 
 
 
16
 
 
17
 
18
- sparse_categorical_accuracy: 0.9618
19
 
 
20
 
21
- Sklearn validation metrics:
22
 
23
- Precision: 0.927190139237617
24
 
25
- Recall: 0.969193742478941
26
 
27
- F1 score: 0.9465885064563841
28
 
 
29
 
30
- Epoch 1/3 - loss: 0.4504 - sparse_categorical_accuracy: 0.8577 - val_loss: 0.3473 - val_sparse_categorical_accuracy: 0.8706
 
 
31
 
32
- Epoch 2/3 - loss: 0.3309 - sparse_categorical_accuracy: 0.8593 - val_loss: 0.2395 - val_sparse_categorical_accuracy: 0.8744
33
 
34
- Epoch 3/3 - loss: 0.1862 - sparse_categorical_accuracy: 0.9439 - val_loss: 0.1348 - val_sparse_categorical_accuracy: 0.9618
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ tags:
3
+ - generated_from_keras_callback
4
+ model-index:
5
+ - name: US_politicians_covid_skepticism
6
+ results: []
 
 
 
7
  ---
8
 
9
+ <!-- This model card has been generated automatically according to the information Keras had access to. You should
10
+ probably proofread and complete it, then remove this comment. -->
11
 
12
+ # US_politicians_covid_skepticism
13
 
14
+ This model is a fine-tuned version of [vinai/bertweet-covid19-base-uncased](https://huggingface.co/vinai/bertweet-covid19-base-uncased) on an unknown dataset.
15
+ It achieves the following results on the evaluation set:
16
+ - Train Loss: 0.1745
17
+ - Train Sparse Categorical Accuracy: 0.9680
18
+ - Validation Loss: 0.1170
19
+ - Validation Sparse Categorical Accuracy: 0.9814
20
+ - Epoch: 3
21
 
22
+ ## Model description
23
 
24
+ More information needed
25
 
26
+ ## Intended uses & limitations
27
 
28
+ More information needed
29
 
30
+ ## Training and evaluation data
31
 
32
+ More information needed
33
 
34
+ ## Training procedure
35
 
36
+ ### Training hyperparameters
37
 
38
+ The following hyperparameters were used during training:
39
+ - optimizer: {'name': 'Adam', 'learning_rate': 5e-07, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
40
+ - training_precision: float32
41
 
42
+ ### Training results
43
 
44
+ | Train Loss | Train Sparse Categorical Accuracy | Validation Loss | Validation Sparse Categorical Accuracy | Epoch |
45
+ |:----------:|:---------------------------------:|:---------------:|:--------------------------------------:|:-----:|
46
+ | 0.6242 | 0.6733 | 0.5466 | 0.6911 | 0 |
47
+ | 0.4520 | 0.8283 | 0.2888 | 0.9628 | 1 |
48
+ | 0.2561 | 0.9587 | 0.1626 | 0.9783 | 2 |
49
+ | 0.1745 | 0.9680 | 0.1170 | 0.9814 | 3 |
50
+
51
+
52
+ ### Framework versions
53
+
54
+ - Transformers 4.20.1
55
+ - TensorFlow 2.8.2
56
+ - Datasets 2.3.2
57
+ - Tokenizers 0.12.1
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "/content/drive/MyDrive/NLP_models/covid_skepticism_tweets-bert-uncased.t5",
3
  "architectures": [
4
  "RobertaForSequenceClassification"
5
  ],
@@ -21,7 +21,6 @@
21
  "pad_token_id": 1,
22
  "position_embedding_type": "absolute",
23
  "tokenizer_class": "BertweetTokenizer",
24
- "torch_dtype": "float32",
25
  "transformers_version": "4.20.1",
26
  "type_vocab_size": 1,
27
  "use_cache": true,
 
1
  {
2
+ "_name_or_path": "vinai/bertweet-covid19-base-uncased",
3
  "architectures": [
4
  "RobertaForSequenceClassification"
5
  ],
 
21
  "pad_token_id": 1,
22
  "position_embedding_type": "absolute",
23
  "tokenizer_class": "BertweetTokenizer",
 
24
  "transformers_version": "4.20.1",
25
  "type_vocab_size": 1,
26
  "use_cache": true,
tf_model.h5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e7285a5bc51ebef61b437ca087df4065c1e1ebb7f4387865f4201981d839c57
3
+ size 539895680