indiejoseph
/

bert-base-cantonese

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

indiejoseph commited on Oct 13, 2023

Commit

c4d49d9

•

1 Parent(s): fc82bfc

Training in progress, step 500

Files changed (3) hide show

README.md +3 -9
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -2,15 +2,9 @@
 base_model: /notebooks/cantonese/bert-base-cantonese
 tags:
 - generated_from_trainer
-- Cantonese
-- bert
 model-index:
 - name: bert-base-cantonese
   results: []
-license: cc-by-4.0
-language:
-- yue
-pipeline_tag: fill-mask
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,11 +12,11 @@ should probably proofread and complete it, then remove this comment. -->
 # bert-base-cantonese
-This model is a continue pre-train version of [indiejoseph/cantonese/bert-base-cantonese](https://huggingface.co//notebooks/cantonese/bert-base-cantonese) on [indiejoseph/wikipedia-zh-yue-filtered](https://huggingface.co/datasets/indiejoseph/wikipedia-zh-yue-filtered).
 ## Model description
-This model has extended 500 more Chinese characters which very common in Cantonese, such as `冧`, `噉`, `麪`, `笪`, `冚`, `乸` etc, and continue pre-trained with [indiejoseph/wikipedia-zh-yue-filtered](https://huggingface.co/datasets/indiejoseph/wikipedia-zh-yue-filtered)
 ## Intended uses & limitations
@@ -56,4 +50,4 @@ The following hyperparameters were used during training:
 - Transformers 4.34.0.dev0
 - Pytorch 2.0.1+cu118
 - Datasets 2.14.5
-- Tokenizers 0.14.0

 base_model: /notebooks/cantonese/bert-base-cantonese
 tags:
 - generated_from_trainer
 model-index:
 - name: bert-base-cantonese
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # bert-base-cantonese
+This model is a fine-tuned version of [/notebooks/cantonese/bert-base-cantonese](https://huggingface.co//notebooks/cantonese/bert-base-cantonese) on an unknown dataset.
 ## Model description
+More information needed
 ## Intended uses & limitations
 - Transformers 4.34.0.dev0
 - Pytorch 2.0.1+cu118
 - Datasets 2.14.5
+- Tokenizers 0.14.0

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2b855ecef316f5c2d8ff1333c9950d277252996423e95c50c31e770e37289e97
 size 410768181

 version https://git-lfs.github.com/spec/v1
+oid sha256:7e4705d76f02e1507ede29f729f8fb0af0cfe3d6f317a00275f9d611583b9750
 size 410768181

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2f1239a0f6d2b529a0df86cc5f54fb2e5bc3bafc4050c71059aaa75c1d1c59e6
 size 4091

 version https://git-lfs.github.com/spec/v1
+oid sha256:89a8cf157dbdebd450f7bd93a32210167bee67f38f72531bf287d84dd08eab53
 size 4091