|
--- |
|
language: en |
|
license: cc-by-4.0 |
|
pipeline_tag: summarization |
|
widget: |
|
- text: "Example question body." |
|
--- |
|
|
|
# Titlewave: t5-base |
|
|
|
## Model description |
|
|
|
Titlewave is a Chrome extension that helps you choose better titles for your Stack Overflow questions. See https://github.com/tennessejoyce/TitleWave for more information. |
|
This is one of two NLP models used in the Titlewave project, and its purpose is to suggests a new title based on on the body of the question. The companion model (https://huggingface.co/tennessejoyce/titlewave-bert-base-uncased) classifies whether question will be answered or not just based on the title |
|
|
|
## Intended use |
|
|
|
Try out different titles for your Stack Overflow post, and see which one gives you the best chance of recieving an answer. |
|
This model can be used in your browser as a Chrome extension by following the installation instructions at https://github.com/tennessejoyce/TitleWave. |
|
Or load it in Python like this (which will automatically download the model to your machine): |
|
|
|
```python |
|
>>> from transformers import pipeline |
|
>>> classifier = pipeline('summarization', model='tennessejoyce/titlewave-t5-base') |
|
>>> body = """"Example question body.""" |
|
>>> classifier(body) |
|
|
|
[{'summary_text': 'Example title suggestion?'}] |
|
``` |
|
|
|
|
|
## Training data |
|
|
|
The weights were initialized from the BERT base model (https://huggingface.co/bert-base-uncased), which was trained on BookCorpus and English Wikipedia. |
|
Then the model was fine-tuned on the dataset of previous Stack Overflow post titles (https://archive.org/details/stackexchange). |
|
Specifically I used three years of posts from 2017-2019, filtered out posts which were closed, and selected 25% of the remaining posts at random to use in |
|
the training set. In order to improve the quality of the titles generated, the model was trained only on questions with an accepted answer. |
|
|
|
## Evaluation |
|
|
|
See https://github.com/tennessejoyce/TitleWave/blob/master/model_training/test_summarizer.ipynb for the performance of the title generation model on the test set. |
|
|
|
|