hhou435
/

chinese_roberta_L-2_H-128

Model card Files Files and versions Community

hhou435 commited on Nov 16, 2020

Commit

cf37f5a

1 Parent(s): bf05a6e

Delete

Browse files

Files changed (8) hide show

.gitattributes +0 -8
README.md +0 -70
config.json +0 -20
pytorch_model.bin +0 -3
special_tokens_map.json +0 -1
tf_model.h5 +0 -3
tokenizer_config.json +0 -1
vocab.txt +0 -0

.gitattributes DELETED Viewed

@@ -1,8 +0,0 @@
-*.bin.* filter=lfs diff=lfs merge=lfs -text
-*.lfs.* filter=lfs diff=lfs merge=lfs -text
-*.bin filter=lfs diff=lfs merge=lfs -text
-*.h5 filter=lfs diff=lfs merge=lfs -text
-*.tflite filter=lfs diff=lfs merge=lfs -text
-*.tar.gz filter=lfs diff=lfs merge=lfs -text
-*.ot filter=lfs diff=lfs merge=lfs -text
-*.onnx filter=lfs diff=lfs merge=lfs -text

README.md DELETED Viewed

@@ -1,70 +0,0 @@
----
-language: zh
-widget: text: "中国的首都是[MASK]京。"
-thumbnail:
-tags:
-license:
-datasets:
-metrics:
----
-# MyModelName
-## Model description
-## Intended uses & limitations
-#### How to use
-You can use this model directly with a pipeline for masked language modeling:
-```python
->>> from transformers import pipeline
->>> unmasker = pipeline('fill-mask', model='hhou435/chinese_roberta_L-2_H-128')
->>> unmasker("中国的首都是[MASK]京。")
-[
-    {'sequence': '[CLS] 中 国 的 首 都 是 北 京 。 [SEP]',
-     'score': 0.9427323937416077,
-     'token': 1266,
-     'token_str': '北'},
-    {'sequence': '[CLS] 中 国 的 首 都 是 南 京 。 [SEP]',
-     'score': 0.029202355071902275,
-     'token': 1298,
-     'token_str': '南'},
-    {'sequence': '[CLS] 中 国 的 首 都 是 东 京 。 [SEP]',
-     'score': 0.00977553054690361,
-     'token': 691,
-     'token_str': '东'},
-    {'sequence': '[CLS] 中 国 的 首 都 是 葡 京 。 [SEP]',
-     'score': 0.00489805219694972,
-     'token': 5868,
-     'token_str': '葡'},
-    {'sequence': '[CLS] 中 国 的 首 都 是 新 京 。 [SEP]',
-     'score': 0.0027360401581972837,
-     'token': 3173,
-     'token_str': '新'}
-]
-```
-Here is how to use this model to get the features of a given text in PyTorch:
-```python
-from transformers import BertTokenizer, BertModel
-tokenizer = BertTokenizer.from_pretrained('hhou435/chinese_roberta_L-2_H-128')
-model = BertModel.from_pretrained("hhou435/chinese_roberta_L-2_H-128")
-text = "用你喜欢的任何文本替换我。"
-encoded_input = tokenizer(text, return_tensors='pt')
-output = model(**encoded_input)
-```
-and in TensorFlow:
-```python
-from transformers import BertTokenizer, TFBertModel
-tokenizer = BertTokenizer.from_pretrained('hhou435/chinese_roberta_L-2_H-128')
-model = TFBertModel.from_pretrained("hhou435/chinese_roberta_L-2_H-128")
-text = "用你喜欢的任何文本替换我。"
-encoded_input = tokenizer(text, return_tensors='tf')
-output = model(encoded_input)
-```

config.json DELETED Viewed

@@ -1,20 +0,0 @@
-{
-  "architectures": [
-    "BertForMaskedLM"
-  ],
-  "attention_probs_dropout_prob": 0.1,
-  "gradient_checkpointing": false,
-  "hidden_act": "gelu",
-  "hidden_dropout_prob": 0.1,
-  "hidden_size": 128,
-  "initializer_range": 0.02,
-  "intermediate_size": 512,
-  "layer_norm_eps": 1e-12,
-  "max_position_embeddings": 512,
-  "model_type": "bert",
-  "num_attention_heads": 2,
-  "num_hidden_layers": 2,
-  "pad_token_id": 0,
-  "type_vocab_size": 2,
-  "vocab_size": 21128
-}

pytorch_model.bin DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:5bc50a2bc8ac3bc07c8c5865c6893c813150b1d9b8bd3332af4940b347c850bf
-size 12840967

special_tokens_map.json DELETED Viewed

	@@ -1 +0,0 @@
1	- {"unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]"}

tf_model.h5 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:88841e2f97913b66ed4ce2aa5be275352edc000b7b36a1daa3232aae29d9bd99
-size 24044192

tokenizer_config.json DELETED Viewed

	@@ -1 +0,0 @@
1	- {"do_lower_case": false, "do_basic_tokenize": true, "never_split": null, "unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "tokenize_chinese_chars": true, "strip_accents": null, "model_max_length": 512}

vocab.txt DELETED Viewed

The diff for this file is too large to render. See raw diff