khhuang
/

zerofec-daqa-t5-base

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

zerofec-daqa-t5-base / README.md

khuangaf

update README

edc7e25 over 1 year ago

|

2.8 kB

	---
	language: en
	widget:
	- text: What is Night of the Living Dead? \n Night of the Living Dead is a 1968 American independent horror film , directed by George A. Romero , starring Duane Jones and Judith O'Dea . George A. Romero George A. Romero Duane Jones Duane Jones Judith O'Dea Judith O'Dea independent Independent film horror film horror film.
	---

	# Domain-adapted QA Model From ZeroFEC

	ZeroFEC is a faithful and interpetable factual error correction framework introduced in the paper [Zero-shot Faithful Factual Error Correction](https://aclanthology.org/2023.acl-long.311/). It involves a QA component, which is a UnifiedQA model continue fine-tuned on two additional biomedical QA datasets. The associated code is released in [this](https://github.com/khuangaf/ZeroFEC) repository.

	### How to use
	Using Huggingface pipeline abstraction:
	```python
	from transformers import pipeline

	nlp = pipeline("text2text-generation", model='khhuang/zerofec-daqa-t5-base', tokenizer='khhuang/zerofec-daqa-t5-base')

	QUESTION = "What is Night of the Living Dead?"
	CONTEXT = "Night of the Living Dead is a 1968 American independent horror film , directed by George A."

	def format_inputs(context: str, question: str):
	return f"{question} \n {context}"

	text = format_inputs(CONTEXT, QUESTION)

	nlp(text)
	# should output [{'generated_text': 'a 1968 american independent horror film'}]
	```

	Using the pre-trained model directly:
	```python
	from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

	tokenizer = AutoTokenizer.from_pretrained('khhuang/zerofec-daqa-t5-base')
	model = AutoModelForSeq2SeqLM.from_pretrained('khhuang/zerofec-daqa-t5-base')

	QUESTION = "What is Night of the Living Dead?"
	CONTEXT = "Night of the Living Dead is a 1968 American independent horror film , directed by George A."

	def format_inputs(context: str, question: str):
	return f"{question} \n {context}"

	text = format_inputs(CONTEXT, QUESTION)


	input_ids = tokenizer(text, return_tensors="pt").input_ids
	generated_ids = model.generate(input_ids, max_length=32, num_beams=4)
	output = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)
	print(output)
	# should output "a 1968 american independent horror film"
	```

	### Citation
	```
	@inproceedings{huang-etal-2023-zero,
	title = "Zero-shot Faithful Factual Error Correction",
	author = "Huang, Kung-Hsiang and
	Chan, Hou Pong and
	Ji, Heng",
	booktitle = "Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
	month = jul,
	year = "2023",
	address = "Toronto, Canada",
	publisher = "Association for Computational Linguistics",
	url = "https://aclanthology.org/2023.acl-long.311",
	doi = "10.18653/v1/2023.acl-long.311",
	pages = "5660--5676",
	}
	```