QARAC / DataSets.md
PeteBleackley
Coreference Resolution for WikiQA dataset
e149b0f
|
raw
history blame
560 Bytes
# Datasets
We are planning to use the following datasets to train the models.
## Base Model Training
[The British National Corpus](http://www.natcorp.ox.ac.uk/)
## Question Answering
[WikiQA (Wikipedia Open-Domain Question Answering](https://paperswithcode.com/dataset/wikiqa)
## Reasoning
[Avicenna: Syllogistic Commonsense Reasoning](https://github.com/ZeinabAghahadi/Syllogistic-Commonsense-Reasoning)
## Consistency
[Stanford Natural Language Inference Corpus](https://www.kaggle.com/datasets/stanfordu/stanford-natural-language-inference-corpus)