# Datasets We are planning to use the following datasets to train the models. ## Base Model Training [The British National Corpus](http://www.natcorp.ox.ac.uk/) ## Question Answering [WikiQA (Wikipedia Open-Domain Question Answering](https://paperswithcode.com/dataset/wikiqa) ## Reasoning [Avicenna: Syllogistic Commonsense Reasoning](https://github.com/ZeinabAghahadi/Syllogistic-Commonsense-Reasoning) ## Consistency [Stanford Natural Language Inference Corpus](https://www.kaggle.com/datasets/stanfordu/stanford-natural-language-inference-corpus)