This is a version of the SciBERT encoder trained for the purpose of retrieving datasets by textual description given a natural language query.

If useful, please cite

@inproceedings{viswanathan23acl,
    title = {DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions},
    author = {Vijay Viswanathan and Luyu Gao and Tongshuang Wu and Pengfei Liu and Graham Neubig},
    booktitle = {Annual Conference of the Association for Computational Linguistics (ACL)},
    address = {Toronto, Canada},
    month = {July},
    url = {https://arxiv.org/abs/2305.16636},
    year = {2023}
}
Downloads last month
7
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.