bolete / README.md
apjanco
working on plan
5e74ada
metadata
title: Bolete
emoji: πŸ„
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.10.0
app_file: app.py
pinned: false
license: mit

bolete

An information extraction and exploration app. Upload files with text. Bolete will then extract text, identify common keywords and entities, and create a simple search interface to explore the corpus.

  • Search of collection texts

    • how best to search collection?
    • Holmes?
  • Entities and frequencies

    • TODO run ner, return filters for most frequent ents in corpus
  • Fun

    • use with scispaCy rather than generic model en_core_sci_lg + en_core_sci_scibert