README.md · valhalla/distilt5-qg-hl-6-4 at refs/pr/1

metadata

datasets:
  - squad
tags:
  - question-generation
  - distilt5
  - distilt5-qg
widget:
  - text: <hl> 42 <hl> is the answer to life, the universe and everything. </s>
  - text: >-
      Python is a programming language. It is developed by <hl> Guido Van Rossum
      <hl>. </s>
  - text: Although <hl> practicality <hl> beats purity </s>
license: mit

DistilT5 for question-generation

This is distilled version of t5-small-qa-qg-hl model trained for answer aware question generation task. The answer spans are highlighted within the text with special highlight tokens.

The model is distilled using the No Teacher Distillation method proposed by Huggingface, here.

We just copy alternating layers from t5-small-qa-qg-hl and finetune more on the same data. Following table lists other distilled models and their metrics.

Name	BLEU-4	METEOR	ROUGE-L	QA-EM	QA-F1
distilt5-qg-hl-6-4	18.4141	24.8417	40.3435	-	-
distilt5-qa-qg-hl-6-4	18.6493	24.9685	40.5605	76.13	84.659
distilt5-qg-hl-12-6	20.5275	26.5010	43.2676	-	-
distilt5-qa-qg-hl-12-6	20.6109	26.4533	43.0895	81.61	89.831

You can play with the model using the inference API, just highlight the answer spans with <hl> tokens. For example

<hl> 42 <hl> is the answer to life, the universe and everything.

For more deatils see this repo.

Model in action 🚀

You'll need to clone the repo.

from pipelines import pipeline
nlp = pipeline("question-generation", model="valhalla/distilt5-qg-hl-6-4")
nlp("42 is the answer to life, universe and everything.")
=> [{'answer': '42', 'question': 'What is the answer to life?'}]