nthakur 's Collections

🦢SWIM-IR Dataset [NAACL'24]

29 million Synthetic Wikipedia-based Multilingual Retrieval Training Pairs.