indiejoseph commited on
Commit
c4d49d9
1 Parent(s): fc82bfc

Training in progress, step 500

Browse files
Files changed (3) hide show
  1. README.md +3 -9
  2. pytorch_model.bin +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -2,15 +2,9 @@
2
  base_model: /notebooks/cantonese/bert-base-cantonese
3
  tags:
4
  - generated_from_trainer
5
- - Cantonese
6
- - bert
7
  model-index:
8
  - name: bert-base-cantonese
9
  results: []
10
- license: cc-by-4.0
11
- language:
12
- - yue
13
- pipeline_tag: fill-mask
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,11 +12,11 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # bert-base-cantonese
20
 
21
- This model is a continue pre-train version of [indiejoseph/cantonese/bert-base-cantonese](https://huggingface.co//notebooks/cantonese/bert-base-cantonese) on [indiejoseph/wikipedia-zh-yue-filtered](https://huggingface.co/datasets/indiejoseph/wikipedia-zh-yue-filtered).
22
 
23
  ## Model description
24
 
25
- This model has extended 500 more Chinese characters which very common in Cantonese, such as `冧`, `噉`, `麪`, `笪`, `冚`, `乸` etc, and continue pre-trained with [indiejoseph/wikipedia-zh-yue-filtered](https://huggingface.co/datasets/indiejoseph/wikipedia-zh-yue-filtered)
26
 
27
  ## Intended uses & limitations
28
 
@@ -56,4 +50,4 @@ The following hyperparameters were used during training:
56
  - Transformers 4.34.0.dev0
57
  - Pytorch 2.0.1+cu118
58
  - Datasets 2.14.5
59
- - Tokenizers 0.14.0
 
2
  base_model: /notebooks/cantonese/bert-base-cantonese
3
  tags:
4
  - generated_from_trainer
 
 
5
  model-index:
6
  - name: bert-base-cantonese
7
  results: []
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
12
 
13
  # bert-base-cantonese
14
 
15
+ This model is a fine-tuned version of [/notebooks/cantonese/bert-base-cantonese](https://huggingface.co//notebooks/cantonese/bert-base-cantonese) on an unknown dataset.
16
 
17
  ## Model description
18
 
19
+ More information needed
20
 
21
  ## Intended uses & limitations
22
 
 
50
  - Transformers 4.34.0.dev0
51
  - Pytorch 2.0.1+cu118
52
  - Datasets 2.14.5
53
+ - Tokenizers 0.14.0
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2b855ecef316f5c2d8ff1333c9950d277252996423e95c50c31e770e37289e97
3
  size 410768181
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e4705d76f02e1507ede29f729f8fb0af0cfe3d6f317a00275f9d611583b9750
3
  size 410768181
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2f1239a0f6d2b529a0df86cc5f54fb2e5bc3bafc4050c71059aaa75c1d1c59e6
3
  size 4091
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89a8cf157dbdebd450f7bd93a32210167bee67f38f72531bf287d84dd08eab53
3
  size 4091