Tweeties in a Tweety World
community
AI & ML interests
Multilingual and Low-Resource NLP
Recent Activity
View all activity
Organization Card
The Tweeties is a series of foundation models incorporating native tokenizers for each language, for a better understanding and generation of text in these languages. These models are adapted from existing models using trans-tokenization, and further pre-trained on existing corpora.
Collections
1
models
6
Tweeties/tweety-7b-dutch-v24a
Text Generation
•
Updated
•
96
•
12
Tweeties/tweety-tatar-hydra-mt-7b-v24a
Text Generation
•
Updated
•
19
Tweeties/tweety-tatar-hydra-base-7b-v24a
Text Generation
•
Updated
•
18
Tweeties/tweety-7b-tatar-v24a
Text Generation
•
Updated
•
21
•
10
Tweeties/tweety-7b-armenian-v24a
Text Generation
•
Updated
•
31
•
1
Tweeties/tweety-7b-italian-v24a
Text Generation
•
Updated
•
16
•
2
datasets
None public yet