language:
- en
license: apache-2.0
tags:
- generated_from_trainer
- finance
- intent-classification
datasets:
- banking77
metrics:
- accuracy
pipeline_tag: text-classification
base_model: distilbert-base-uncased
model-index:
- name: banking-intent-distilbert-classifier
results: []
banking-intent-distilbert-classifier
This model is a fine-tuned version of distilbert-base-uncased on the banking77 dataset. It achieves the following results on the evaluation set:
- eval_loss: 0.2885
- eval_accuracy: 0.9244
- eval_runtime: 1.9357
- eval_samples_per_second: 1591.148
- eval_steps_per_second: 99.705
- epoch: 10.0
- step: 3130
Note: This is just a simple example of fine-tuning a DistilBERT model for multi-class classification task to see how much it costs to train this model on Google Cloud (using a T4 GPU). It costs me about 1.07 SGD and takes less than 20 mins to complete the training. Although my intention was just to test it out on Google Cloud, the model has been appropriately trained and is now ready to be used. Hopefully, it is what you're looking for.
Inference example
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("lxyuan/banking-intent-distilbert-classifier")
model = AutoModelForSequenceClassification.from_pretrained("lxyuan/banking-intent-distilbert-classifier")
banking_intend_classifier = TextClassificationPipeline(
model=model,
tokenizer=tokenizer,
device=0
)
banking_intend_classifier("How to report lost card?")
>>> [{'label': 'lost_or_stolen_card', 'score': 0.9518502950668335}]
Training and evaluation data
The BANKING77 dataset consists of online banking queries labeled with their corresponding intents, offering a comprehensive collection of 77 finely categorized intents within the banking domain. With a total of 13,083 customer service queries, it specifically emphasizes precise intent detection within a single domain.
Training procedure
- To reproduce the result, please refer to this notebook
- To run the evaluation, please refer to this evaluation notebook
Evaluation
Evaluation result
Classification Report: precision recall f1-score support
activate_my_card 1.0000 0.9750 0.9873 40
age_limit 0.9756 1.0000 0.9877 40
apple_pay_or_google_pay 1.0000 1.0000 1.0000 40
atm_support 0.9750 0.9750 0.9750 40
automatic_top_up 1.0000 0.9000 0.9474 40
balance_not_updated_after_bank_transfer 0.8205 0.8000 0.8101 40
balance_not_updated_after_cheque_or_cash_deposit 1.0000 0.9750 0.9873 40
beneficiary_not_allowed 0.9250 0.9250 0.9250 40
cancel_transfer 1.0000 0.9750 0.9873 40
card_about_to_expire 0.9756 1.0000 0.9877 40
card_acceptance 0.9189 0.8500 0.8831 40
card_arrival 0.9459 0.8750 0.9091 40
card_delivery_estimate 0.8605 0.9250 0.8916 40
card_linking 0.9302 1.0000 0.9639 40
card_not_working 0.8478 0.9750 0.9070 40
card_payment_fee_charged 0.7917 0.9500 0.8636 40
card_payment_not_recognised 0.9231 0.9000 0.9114 40
card_payment_wrong_exchange_rate 0.9048 0.9500 0.9268 40
card_swallowed 1.0000 0.8750 0.9333 40
cash_withdrawal_charge 0.9744 0.9500 0.9620 40
cash_withdrawal_not_recognised 0.8667 0.9750 0.9176 40
change_pin 0.9302 1.0000 0.9639 40
compromised_card 0.8889 0.8000 0.8421 40
contactless_not_working 1.0000 0.9000 0.9474 40
country_support 0.9512 0.9750 0.9630 40
declined_card_payment 0.8125 0.9750 0.8864 40
declined_cash_withdrawal 0.7843 1.0000 0.8791 40
declined_transfer 0.9667 0.7250 0.8286 40
direct_debit_payment_not_recognised 0.9444 0.8500 0.8947 40
disposable_card_limits 0.8974 0.8750 0.8861 40
edit_personal_details 0.9302 1.0000 0.9639 40
exchange_charge 0.9722 0.8750 0.9211 40
exchange_rate 0.9091 1.0000 0.9524 40
exchange_via_app 0.8085 0.9500 0.8736 40
extra_charge_on_statement 1.0000 0.9500 0.9744 40
failed_transfer 0.8333 0.8750 0.8537 40
fiat_currency_support 0.8718 0.8500 0.8608 40
get_disposable_virtual_card 0.9722 0.8750 0.9211 40
get_physical_card 0.9756 1.0000 0.9877 40
getting_spare_card 0.9500 0.9500 0.9500 40
getting_virtual_card 0.8667 0.9750 0.9176 40
lost_or_stolen_card 0.8261 0.9500 0.8837 40
lost_or_stolen_phone 0.9750 0.9750 0.9750 40
order_physical_card 0.9231 0.9000 0.9114 40
passcode_forgotten 1.0000 1.0000 1.0000 40
pending_card_payment 0.9500 0.9500 0.9500 40
pending_cash_withdrawal 1.0000 0.9500 0.9744 40
pending_top_up 0.9268 0.9500 0.9383 40
pending_transfer 0.8611 0.7750 0.8158 40
pin_blocked 0.9714 0.8500 0.9067 40
receiving_money 1.0000 0.9250 0.9610 40
Refund_not_showing_up 1.0000 0.9250 0.9610 40
request_refund 0.9512 0.9750 0.9630 40
reverted_card_payment? 0.9286 0.9750 0.9512 40
supported_cards_and_currencies 0.9744 0.9500 0.9620 40
terminate_account 0.9302 1.0000 0.9639 40
top_up_by_bank_transfer_charge 1.0000 0.8250 0.9041 40
top_up_by_card_charge 0.9286 0.9750 0.9512 40
top_up_by_cash_or_cheque 0.8810 0.9250 0.9024 40
top_up_failed 0.9024 0.9250 0.9136 40
top_up_limits 0.9286 0.9750 0.9512 40
top_up_reverted 0.9706 0.8250 0.8919 40
topping_up_by_card 0.8421 0.8000 0.8205 40
transaction_charged_twice 0.9302 1.0000 0.9639 40
transfer_fee_charged 0.9024 0.9250 0.9136 40
transfer_into_account 0.9167 0.8250 0.8684 40
transfer_not_received_by_recipient 0.7778 0.8750 0.8235 40
transfer_timing 0.8372 0.9000 0.8675 40
unable_to_verify_identity 0.9250 0.9250 0.9250 40
verify_my_identity 0.7955 0.8750 0.8333 40
verify_source_of_funds 0.9524 1.0000 0.9756 40
verify_top_up 1.0000 1.0000 1.0000 40
virtual_card_not_working 1.0000 0.9250 0.9610 40
visa_or_mastercard 0.9737 0.9250 0.9487 40
why_verify_identity 0.9118 0.7750 0.8378 40
wrong_amount_of_cash_received 1.0000 0.8750 0.9333 40
wrong_exchange_rate_for_cash_withdrawal 0.9730 0.9000 0.9351 40
accuracy 0.9244 3080
macro avg 0.9282 0.9244 0.9243 3080
weighted avg 0.9282 0.9244 0.9243 3080
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 16
- eval_batch_size: 16
- seed: 42
- gradient_accumulation_steps: 2
- total_train_batch_size: 32
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 10
- mixed_precision_training: Native AMP
Framework versions
- Transformers 4.29.2
- Pytorch 1.9.0+cu111
- Datasets 2.12.0
- Tokenizers 0.13.3