Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
nicolay-rΒ 
posted an update Oct 4
Post
739
πŸ“’ The fast application of named entity recognition (NER) model towards vast amout of texts usually serves two major pitfalls:
πŸ”΄ Limitation of the input window size
πŸ”΄ Drastically slows down the downstream pipeline of the whole application

⭐ https://github.com/nicolay-r/bulk-ner

To address these problems, bulk-ner represent a no-string framework with the handy wrapping over any dynamically linked NER-ml model by providing:
β˜‘οΈ Native long-input contexts handling.
β˜‘οΈ Native support of batching (assuming that ML-model engine has the related support too)

To quick start, sharing the wrapper over DeepPavlov NER models.
With the application of such models you can play and bulk your data here:
πŸ“™ https://colab.research.google.com/github/nicolay-r/ner-service/blob/main/NER_annotation_service.ipynb
(You have to have your data in CSV / JSONL format)

Lastly, it is powered by AREkit pipelines, and therefore could be a part of the relation extraction and complex information retrieval systems:
πŸ’» https://github.com/nicolay-r/AREkit
πŸ“„ https://openreview.net/forum?id=nRybAsJMUt
In this post