view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 159
GLiREL -- Generalist Model for Zero-Shot Relation Extraction Paper • 2501.03172 • Published Jan 6 • 1
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Paper • 2408.04303 • Published Aug 8, 2024 • 20
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 9 items • Updated 22 days ago • 60
Positions Datasets Collection Datasets where each row is a chess position • 4 items • Updated Jan 9 • 7
Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 30
Tucano Collection Tucano is a series of decoder-transformers based on the Llama 2 architecture, natively pre-trained in Portuguese. • 17 items • Updated Nov 13, 2024 • 2