hoodiexxx
/

Bert_Chinese_Text_Classification_Model

Text Classification

Model card Files Files and versions Community

hoodiexxx commited on Feb 27

Commit

190cc68

•

1 Parent(s): ef00e45

Update README.md

Files changed (1) hide show

README.md +30 -39

README.md CHANGED Viewed

@@ -6,48 +6,39 @@ pipeline_tag: text-classification
 ---
 # Bert Chinese Text Classification Model
 this a Bert Model that train for customer service of logistics companies
 ## Word Label(word, index, number of occurences)
 ```sh
-我 1 18719
-个 2 12236
-快 3 8152
-一 4 8097
-递 5 7295
-那 6 7118
-了 7 6923
-的 8 6684
-是 9 6632
-到 10 6434
-你 11 5144
-没 12 4989
-有 13 4664
-下 14 4433
-这 15 4219
-在 16 4219
-么 17 4010
-查 18 3964
-就 19 3570
-好 20 3524
 ```
 ## Tokenizer

 ---
 # Bert Chinese Text Classification Model
 this a Bert Model that train for customer service of logistics companies
+### data(with noise since it from ASR text)
+train: 10878 rows
+dev:2720 rows
+total: 13598 rows
+### param
+embed_dim: 128
+batch size: 64
+contextsize: 20
+n_head: 2
+epoches: 100
 ## Word Label(word, index, number of occurences)
 ```sh
+我 1 18719
+个 2 12236
+快 3 8152
+一 4 8097
+递 5 7295
+那 6 7118
+了 7 6923
+的 8 6684
+是 9 6632
+到 10 6434
+你 11 5144
+没 12 4989
+有 13 4664
+下 14 4433
+这 15 4219
+在 16 4219
+么 17 4010
+查 18 3964
+就 19 3570
+好 20 3524
 ```
 ## Tokenizer