ashish-shrivastava
/

dont-know-response

Text2Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

dont-know-response / README.md

ashish-shrivastava's picture

ashish-shrivastava

changed font

b1b7cea over 3 years ago

|

raw history blame contribute delete

No virus

1.67 kB

	## Natural Don't Know Response Model

	Fine-tuned on [Google's T5](https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html) using a combination of a dependency-rule based data and [Quora Question Pairs(QQP)](https://huggingface.co/nlp/viewer/?dataset=quora) dataset for Don't Know Response Generation task.

	Additional information about this model:
	- Paper : [Saying No is An Art: Contextualized Fallback Responses for
	Unanswerable Dialogue Queries](https://arxiv.org/pdf/2012.01873.pdf)
	- Github Repo: https://github.com/kaustubhdhole/natural-dont-know

	#### How to use
	```python
	from transformers import T5ForConditionalGeneration, T5Tokenizer
	model_name = "ashish-shrivastava/dont-know-response"
	model = T5ForConditionalGeneration.from_pretrained(model_name)
	tokenizer = T5Tokenizer.from_pretrained(model_name)

	input = "Where can I find good Italian food ?"
	input_ids = tokenizer.encode(input, return_tensors="pt")
	outputs = model.generate(input_ids)
	decoded_output = tokenizer.decode(outputs[0], skip_special_tokens=True)
	print(decoded_output) # I'm not sure where you can get good quality Italian food.

	```

	#### Hyperparameters

	```
	n_epochs = 2
	base_LM_model = "T5-base"
	max_seq_len = 256
	learning_rate = 3e-4
	adam_epsilon = 1e-8
	train_batch_size = 6
	```

	#### BibTeX entry and citation info

	```bibtex
	@misc{shrivastava2020saying,
	title={Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries},
	author={Ashish Shrivastava and Kaustubh Dhole and Abhinav Bhatt and Sharvani Raghunath},
	year={2020},
	eprint={2012.01873},
	archivePrefix={arXiv},
	primaryClass={cs.CL}
	}
	```