bolete / README.md
apjanco
working on plan
5e74ada
---
title: Bolete
emoji: πŸ„
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.10.0
app_file: app.py
pinned: false
license: mit
---
# bolete
An information extraction and exploration app. Upload files with text. Bolete will then extract text, identify common keywords and entities, and create a simple search interface to explore the corpus.
- Search of collection texts
- how best to search collection?
- Holmes?
- Entities and frequencies
- TODO run ner, return filters for most frequent ents in corpus
- Fun
- use with scispaCy rather than generic model
en_core_sci_lg + en_core_sci_scibert