--- language: - tr pipeline_tag: token-classification tags: - ner widget: - text: "Lütfen yardım Piyalepasa mahallesi Rüzgar sokak Meltem apartmanı no: 22 Hatay akrabalarım göçük altında #dummy" --- ## Address NER - **Language**: Turkish - **PLM**: dbmdz/bert-base-turkish-128k-cased - **Macro-F1 Score**: 84% - **Dataset**: [NER v2 dataset](https://huggingface.co/datasets/deprem-private/ner_v12) - **Hyperparameters**: per_device_train_batch_size = 16, per_device_eval_batch_size = 32, num_train_epochs = 5, weight_decay = 0.1, warmup_ratio = 0.1, learning_rate = 5e-5 ### Model Comparison | | Macro-F1 | |----------------------------------------------------|----------| | dbmdz/bert-base-turkish-128k-cased | 0.84 | | dbmdz/bert-base-turkish-cased | 0.83 | | bert-base-multilingual-cased | 0.79 | | dbmdz/electra-base-turkish-mc4-cased-discriminator | 0.76 | | xlm-roberta-base | 0.75 | | dbmdz/convbert-base-turkish-cased | 0.70 | ### Class Performance | | support | precision | recall | f1 | |:----------|----------:|------------:|---------:|-----:| | overall | 957 | 0.84 | 0.88 | 0.86 | | bina | 66 | 0.66 | 0.74 | 0.7 | | bulvar | 13 | 0.92 | 0.92 | 0.92 | | cadde | 57 | 0.77 | 0.84 | 0.81 | | diskapino | 70 | 0.69 | 0.73 | 0.71 | | ilce | 117 | 0.89 | 0.96 | 0.92 | | isim | 113 | 0.86 | 0.9 | 0.88 | | mahalle | 120 | 0.77 | 0.82 | 0.79 | | sehir | 146 | 0.98 | 0.97 | 0.97 | | site | 18 | 0.79 | 0.61 | 0.69 | | sokak | 62 | 0.72 | 0.74 | 0.73 | | soyisim | 98 | 0.94 | 0.95 | 0.94 | | telefonno | 77 | 0.99 | 1 | 0.99 |