geonmin-kim's picture
Upload folder using huggingface_hub
d6585f5
|
raw
history blame
No virus
1.32 kB

UniCOIL with ElasticSearch

  1. Setup ElasticSearch with Docker by following document here.
docker pull docker.elastic.co/elasticsearch/elasticsearch:8.2.3
docker network create elastic
docker run --name es01 --net elastic -d -p 9200:9200 -p 9300:9300 \
           -e "discovery.type=single-node" \
           -e "xpack.security.enabled=false" \
           -it docker.elastic.co/elasticsearch/elasticsearch:8.2.2
  1. (Optional) Setup Kibana by following document here.
docker pull docker.elastic.co/kibana/kibana:8.2.2
docker run -d --name kib-01 --net elastic -p 5601:5601 docker.elastic.co/kibana/kibana:8.2.2
  1. Create ES index
python create_es.py

This will create index based on two search field:

  • document: contains raw text for BM25 search
  • vector: contains pseudo text from uniCOIL for impact search
  1. Create document entry for BM25 index
python index_bm25.py
  1. Add uniCOIL encoded document for impact search
python index_unicoil_update.py
  1. BM25 search
python search_bm25.py
  1. uniCOIL search
python search_unicoil.py
  1. Hybrid Search
python search_unicoil.py