regel-corpus's picture
Upload folder using huggingface_hub
19e407c verified
---
tags:
- flair
- hunflair
- token-classification
- sequence-tagger-model
language: en
widget:
- text: It contains a functional GCGGCGGCG Egr-1-binding site
---
## HunFlair2 model for TFBS
[HunFlair](https://github.com/flairNLP/flair/blob/master/resources/docs/HUNFLAIR2.md) (biomedical flair) for enhancer entity:
- pre-trained language model: michiyasunaga/BioLinkBERT-base
- fine-tuned on RegEl corpus for `Tfbs` entity type
Predicts 1 tag:
| **tag** | **meaning** |
| ------- | ---------------------------------------- |
| Tfbs | DNA region bound by transcription factor |
______________________________________________________________________
## Info
### Demo: How to use in Flair
Requires:
- **[Flair](https://github.com/flairNLP/flair/)>=0.14.0** (`pip install flair` or `pip install git+https://github.com/flairNLP/flair.git`)
```python
from flair.data import Sentence
from flair.nn import Classifier
from flair.tokenization import SciSpacyTokenizer
text = "We found that Egr-1 specifically binds to the PTEN 5' untranslated region, which contains a functional GCGGCGGCG Egr-1-binding site."
sentence = Sentence(text, use_tokenizer=SciSpacyTokenizer())
tagger = Classifier.load("regel-corpus/hunflair2-regel-tfbs")
tagger.predict(sentence)
print('The following NER tags are found:')
# iterate over entities and print
for entity in sentence.get_spans('ner'):
print(entity)
```