File size: 620 Bytes
f62f59a
 
5e74ada
f62f59a
 
 
9f20055
f62f59a
 
 
 
ce57bf8
86d01a8
ce57bf8
 
 
5e74ada
 
ce57bf8
5e74ada
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
---
title: Bolete
emoji: 🍄
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.10.0
app_file: app.py
pinned: false
license: mit
---
# bolete 

An information extraction and exploration app.  Upload files with text. Bolete will then extract text, identify common keywords and entities, and create a simple search interface to explore the corpus.

- Search of collection texts
  - how best to search collection?
  - Holmes?
- Entities and frequencies
  - TODO run ner, return filters for most frequent ents in corpus

- Fun
  - use with scispaCy rather than generic model 
  en_core_sci_lg + en_core_sci_scibert