Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
1
Catherine Arnett
catherinearnett
Follow
BramVanroy's profile picture
robotBLZ's profile picture
shirkey's profile picture
22 followers
·
6 following
https://catherinearnett.github.io/
linguist_cat
catherinearnett
catherinearnett.bsky.social
AI & ML interests
multilingual NLP, tokenization
Recent Activity
upvoted
an
article
16 days ago
They Said It Couldn’t Be Done
updated
a model
17 days ago
PleIAs/Pleias-Nano
updated
a model
17 days ago
PleIAs/Pleias-1.2b-Preview
View all activity
Articles
They Said It Couldn’t Be Done
16 days ago
•
71
Releasing the largest multilingual open pretraining dataset
Nov 13
•
98
Detoxifying the Commons
Oct 31
•
6
wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??
Sep 27
•
38
Organizations
catherinearnett
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
6 months ago
ambean/lingOly
Viewer
•
Updated
Jun 11
•
90
•
106
•
7