tmp_trainer

This model is a fine-tuned version of facebook/opt-350m on the addressWithContext dataset.

Model description

Make sure to set max_new_tokens = 20; otherwise, the model will generate one token at a time.

nlp = pipeline("text-generation",
                model="piazzola/tmp_trainer",
                max_new_tokens=20)
                
nlp("I live at 15 Firstfield Road.")

Note that if you would like to try longer sentences using the Hosted inference API on the right hand side on this website, you might need to click "Compute" more than one time to get the address.

Intended uses & limitations

The model is intended to detect addresses that occur in a sentence.

Training and evaluation data

This model is trained on piazzola/addressWithContext.

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 3.0

Framework versions

  • Transformers 4.34.0
  • Pytorch 2.0.1+cu118
  • Datasets 2.14.5
  • Tokenizers 0.14.1
Downloads last month
37
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for piazzola/address-detection-model

Base model

facebook/opt-350m
Finetuned
(108)
this model