finer_ner_finetuning_0130

This model is a fine-tuned version of distilbert-base-uncased on a subset of the nlpaueb/finer-139 dataset.

It is only finetuned on the following labels:

  • 'O'
  • 'B-DebtInstrumentBasisSpreadOnVariableRate1',
  • 'B-DebtInstrumentFaceAmount',
  • 'B-DebtInstrumentInterestRateStatedPercentage',
  • 'B-LineOfCreditFacilityMaximumBorrowingCapacity'

It achieves the following results on the evaluation set:

  • Loss: 0.0024
  • Accuracy: 0.9995
  • Precision: 0.7342
  • Recall: 0.9159
  • F1: 0.8150
  • Classification Report: {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.791095890410959, 'recall': 0.9602137767220903, 'f1-score': 0.8674892703862661, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.6338946224877784, 'recall': 0.8670133729569094, 'f1-score': 0.7323501725760904, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.774822695035461, 'recall': 0.9494839760999457, 'f1-score': 0.8533072980229436, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.7227383863080684, 'recall': 0.8740390301596689, 'f1-score': 0.791220556745182, 'support': 1691}, 'micro avg': {'precision': 0.7341803078426582, 'recall': 0.9158793050899117, 'f1-score': 0.8150257662055873, 'support': 6562}, 'macro avg': {'precision': 0.7306378985605667, 'recall': 0.9126875389846535, 'f1-score': 0.8110918244326205, 'support': 6562}, 'weighted avg': {'precision': 0.7366697400377676, 'recall': 0.9158793050899117, 'f1-score': 0.8161365377528546, 'support': 6562}}

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

Subset of nlpaueb/finer-139 train split and and full validation split. Given the original proportion of 'O' labels (approx. 80%), we reduced the train split records to match the original majority class proportion given the reduced set of labels. Original train split size: 900384 records Subset train split size: 142513 records (~16% of original amount of records)

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 192
  • eval_batch_size: 192
  • seed: 42
  • optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 500
  • num_epochs: 20
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Accuracy Precision Recall F1 Classification Report
0.0063 0.6729 500 0.0035 0.9987 0.4732 0.9122 0.6232 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.6499202551834131, 'recall': 0.9679334916864608, 'f1-score': 0.7776717557251909, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.3665031534688157, 'recall': 0.7771173848439822, 'f1-score': 0.49809523809523815, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.5214669051878354, 'recall': 0.9500271591526345, 'f1-score': 0.673339749759384, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.3968980422069667, 'recall': 0.9231224127735068, 'f1-score': 0.5551209103840683, 'support': 1691}, 'micro avg': {'precision': 0.4732389912246027, 'recall': 0.9122218835720817, 'f1-score': 0.62318463380355, 'support': 6562}, 'macro avg': {'precision': 0.4836970890117577, 'recall': 0.904550112114146, 'f1-score': 0.6260569134909704, 'support': 6562}, 'weighted avg': {'precision': 0.49054466871695807, 'recall': 0.9122218835720817, 'f1-score': 0.6337036522224775, 'support': 6562}}
0.0044 1.3459 1000 0.0019 0.9993 0.6299 0.8856 0.7361 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.7140974967061924, 'recall': 0.9655581947743468, 'f1-score': 0.8210047967684928, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.5513164965072541, 'recall': 0.7622585438335809, 'f1-score': 0.6398503274087932, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.6613592233009709, 'recall': 0.9250407387289517, 'f1-score': 0.771286231884058, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.5793871866295265, 'recall': 0.8610289769367239, 'f1-score': 0.6926736441484301, 'support': 1691}, 'micro avg': {'precision': 0.6298504227184045, 'recall': 0.8855531850045718, 'f1-score': 0.7361287053458322, 'support': 6562}, 'macro avg': {'precision': 0.626540100785986, 'recall': 0.8784716135684008, 'f1-score': 0.7312037500524435, 'support': 6562}, 'weighted avg': {'precision': 0.6311975390794893, 'recall': 0.8855531850045718, 'f1-score': 0.7368271416647247, 'support': 6562}}
0.0038 2.0188 1500 0.0018 0.9994 0.6685 0.8735 0.7573 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.8146964856230032, 'recall': 0.9085510688836105, 'f1-score': 0.8590679393599102, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.5430597771023303, 'recall': 0.7964338781575037, 'f1-score': 0.6457831325301205, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.6825586015097338, 'recall': 0.933188484519283, 'f1-score': 0.7884350619550253, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.6400725294650952, 'recall': 0.8350088704908338, 'f1-score': 0.7246599948678469, 'support': 1691}, 'micro avg': {'precision': 0.6684548104956268, 'recall': 0.8735141725083816, 'f1-score': 0.7573495408601439, 'support': 6562}, 'macro avg': {'precision': 0.6700968484250407, 'recall': 0.8682955755128078, 'f1-score': 0.7544865321782257, 'support': 6562}, 'weighted avg': {'precision': 0.6769064880331865, 'recall': 0.8735141725083816, 'f1-score': 0.7608661241463521, 'support': 6562}}
0.003 2.6918 2000 0.0017 0.9994 0.6856 0.9012 0.7788 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.7413636363636363, 'recall': 0.9685273159144893, 'f1-score': 0.8398558187435634, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.598568281938326, 'recall': 0.8075780089153046, 'f1-score': 0.6875395319418091, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.7389801210025929, 'recall': 0.928843020097773, 'f1-score': 0.8231046931407943, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.6472125435540069, 'recall': 0.8787699586043761, 'f1-score': 0.7454226235264609, 'support': 1691}, 'micro avg': {'precision': 0.685601669371667, 'recall': 0.9012496190185919, 'f1-score': 0.7787727153015538, 'support': 6562}, 'macro avg': {'precision': 0.6815311457146406, 'recall': 0.8959295758829857, 'f1-score': 0.773980666838157, 'support': 6562}, 'weighted avg': {'precision': 0.6871423476136771, 'recall': 0.9012496190185919, 'f1-score': 0.7795779953083334, 'support': 6562}}
0.0022 3.3647 2500 0.0018 0.9994 0.6925 0.8973 0.7817 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.7884519661523146, 'recall': 0.9406175771971497, 'f1-score': 0.8578391551584078, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.6066779852857951, 'recall': 0.7964338781575037, 'f1-score': 0.6887247028589785, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.729933110367893, 'recall': 0.9483976099945681, 'f1-score': 0.8249468462083628, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.6364025695931478, 'recall': 0.8787699586043761, 'f1-score': 0.7382016890213613, 'support': 1691}, 'micro avg': {'precision': 0.6924614841820534, 'recall': 0.8972874123742761, 'f1-score': 0.7816793893129771, 'support': 6562}, 'macro avg': {'precision': 0.6903664078497876, 'recall': 0.8910547559883993, 'f1-score': 0.7774280983117776, 'support': 6562}, 'weighted avg': {'precision': 0.695566181128388, 'recall': 0.8972874123742761, 'f1-score': 0.7830921650929078, 'support': 6562}}
0.0022 4.0377 3000 0.0019 0.9994 0.6973 0.9098 0.7895 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.764594209776934, 'recall': 0.9566508313539193, 'f1-score': 0.8499076760749142, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.6087877183695076, 'recall': 0.8543833580980683, 'f1-score': 0.7109737248840804, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.7853860294117647, 'recall': 0.9282998370450842, 'f1-score': 0.8508837440876277, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.6276150627615062, 'recall': 0.8870490833826138, 'f1-score': 0.7351139426611124, 'support': 1691}, 'micro avg': {'precision': 0.6972669936930623, 'recall': 0.9097836025601951, 'f1-score': 0.7894736842105262, 'support': 6562}, 'macro avg': {'precision': 0.6965957550799281, 'recall': 0.9065957774699215, 'f1-score': 0.7867197719269337, 'support': 6562}, 'weighted avg': {'precision': 0.7031694101594759, 'recall': 0.9097836025601951, 'f1-score': 0.7921014645092033, 'support': 6562}}
0.0017 4.7106 3500 0.0018 0.9995 0.7319 0.8888 0.8028 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.8151781104801239, 'recall': 0.9376484560570071, 'f1-score': 0.872134769400718, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.6239267315397825, 'recall': 0.8098068350668648, 'f1-score': 0.704817329453605, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.7620508326029798, 'recall': 0.9445953286257469, 'f1-score': 0.8435605141886975, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.7112887112887113, 'recall': 0.8421052631578947, 'f1-score': 0.7711887354454373, 'support': 1691}, 'micro avg': {'precision': 0.7319277108433735, 'recall': 0.8887534288326729, 'f1-score': 0.8027529249827942, 'support': 6562}, 'macro avg': {'precision': 0.7281110964778994, 'recall': 0.8835389707268784, 'f1-score': 0.7979253371221144, 'support': 6562}, 'weighted avg': {'precision': 0.7342715806632691, 'recall': 0.8887534288326729, 'f1-score': 0.8037845375457159, 'support': 6562}}
0.0013 5.3836 4000 0.0020 0.9995 0.7302 0.9075 0.8093 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.7983991995997999, 'recall': 0.9477434679334917, 'f1-score': 0.8666847678522944, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.6646489104116223, 'recall': 0.8157503714710252, 'f1-score': 0.7324883255503669, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.7903669724770642, 'recall': 0.9359043997827268, 'f1-score': 0.8570007460830639, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.6617900172117039, 'recall': 0.9095209934949734, 'f1-score': 0.7661270236612702, 'support': 1691}, 'micro avg': {'precision': 0.730226854690374, 'recall': 0.9074977141115513, 'f1-score': 0.809268193245906, 'support': 6562}, 'macro avg': {'precision': 0.7288012749250476, 'recall': 0.9022298081705543, 'f1-score': 0.8055752157867488, 'support': 6562}, 'weighted avg': {'precision': 0.7335071930776247, 'recall': 0.9074977141115513, 'f1-score': 0.8105281325516894, 'support': 6562}}
0.0012 6.0565 4500 0.0018 0.9996 0.7783 0.8785 0.8254 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.798810703666997, 'recall': 0.9572446555819477, 'f1-score': 0.8708806050783361, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.72812291249165, 'recall': 0.8098068350668648, 'f1-score': 0.7667956384101302, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.8257650542941757, 'recall': 0.908745247148289, 'f1-score': 0.865270235324541, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.744908896034298, 'recall': 0.8219988172678888, 'f1-score': 0.7815574922687659, 'support': 1691}, 'micro avg': {'precision': 0.7783178074794114, 'recall': 0.8785431270953977, 'f1-score': 0.8253990980027203, 'support': 6562}, 'macro avg': {'precision': 0.7744018916217801, 'recall': 0.8744488887662476, 'f1-score': 0.8211259927704433, 'support': 6562}, 'weighted avg': {'precision': 0.777983095601731, 'recall': 0.8785431270953977, 'f1-score': 0.8249384472585973, 'support': 6562}}
0.0011 6.7295 5000 0.0022 0.9995 0.7162 0.9163 0.8040 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.7986980470706059, 'recall': 0.9471496437054632, 'f1-score': 0.8666123336049986, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.6109375, 'recall': 0.8714710252600297, 'f1-score': 0.7183098591549296, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.7634455618714473, 'recall': 0.9483976099945681, 'f1-score': 0.8459302325581395, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.6838503649635036, 'recall': 0.8864577173270254, 'f1-score': 0.7720834406386814, 'support': 1691}, 'micro avg': {'precision': 0.7161743687470223, 'recall': 0.9163364827796403, 'f1-score': 0.8039844899050675, 'support': 6562}, 'macro avg': {'precision': 0.7142328684763892, 'recall': 0.9133689990717716, 'f1-score': 0.8007339664891873, 'support': 6562}, 'weighted avg': {'precision': 0.7206985115552452, 'recall': 0.9163364827796403, 'f1-score': 0.8060303103433248, 'support': 6562}}
0.0007 7.4024 5500 0.0021 0.9995 0.7513 0.9060 0.8214 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.8039314516129032, 'recall': 0.9471496437054632, 'f1-score': 0.8696837513631407, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.6957605985037406, 'recall': 0.8291233283803864, 'f1-score': 0.7566101694915255, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.7822182308037718, 'recall': 0.9462248777838131, 'f1-score': 0.8564405113077679, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.7111534795042898, 'recall': 0.8823181549379066, 'f1-score': 0.7875428873053576, 'support': 1691}, 'micro avg': {'precision': 0.7512953367875648, 'recall': 0.9059737884791222, 'f1-score': 0.8214162348877374, 'support': 6562}, 'macro avg': {'precision': 0.7482659401061764, 'recall': 0.9012040012018924, 'f1-score': 0.817569329866948, 'support': 6562}, 'weighted avg': {'precision': 0.7517431616662088, 'recall': 0.9059737884791222, 'f1-score': 0.8216072430938864, 'support': 6562}}
0.0007 8.0754 6000 0.0024 0.9995 0.7342 0.9159 0.8150 {'DebtInstrumentBasisSpreadOnVariableRate1': {'precision': 0.791095890410959, 'recall': 0.9602137767220903, 'f1-score': 0.8674892703862661, 'support': 1684}, 'DebtInstrumentFaceAmount': {'precision': 0.6338946224877784, 'recall': 0.8670133729569094, 'f1-score': 0.7323501725760904, 'support': 1346}, 'DebtInstrumentInterestRateStatedPercentage': {'precision': 0.774822695035461, 'recall': 0.9494839760999457, 'f1-score': 0.8533072980229436, 'support': 1841}, 'LineOfCreditFacilityMaximumBorrowingCapacity': {'precision': 0.7227383863080684, 'recall': 0.8740390301596689, 'f1-score': 0.791220556745182, 'support': 1691}, 'micro avg': {'precision': 0.7341803078426582, 'recall': 0.9158793050899117, 'f1-score': 0.8150257662055873, 'support': 6562}, 'macro avg': {'precision': 0.7306378985605667, 'recall': 0.9126875389846535, 'f1-score': 0.8110918244326205, 'support': 6562}, 'weighted avg': {'precision': 0.7366697400377676, 'recall': 0.9158793050899117, 'f1-score': 0.8161365377528546, 'support': 6562}}

Framework versions

  • Transformers 4.46.3
  • Pytorch 2.5.1+cu124
  • Datasets 3.2.0
  • Tokenizers 0.20.3

How to use this model

Use a pipeline as a high-level helper

from transformers import pipeline

pipe = pipeline("token-classification", model="martindevoto/finer_ner_finetuning_0130")

Load model directly

from transformers import AutoTokenizer, AutoModelForTokenClassification

tokenizer = AutoTokenizer.from_pretrained("martindevoto/finer_ner_finetuning_0130") model = AutoModelForTokenClassification.from_pretrained("martindevoto/finer_ner_finetuning_0130")

Downloads last month
3
Safetensors
Model size
66.4M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for martindevoto/finer_ner_finetuning_0130

Finetuned
(7321)
this model

Dataset used to train martindevoto/finer_ner_finetuning_0130