Language Modelling with Phonemes

https://github.com/codebyzeb/PhonemeTransformers

tweetbyzeb

codebyzeb

Activity Feed Request to join this org

AI & ML interests

Child language acquisition, CHILDES, word segmentation, phonemes, BabyLM

Recent Activity

codebyzeb updated a model 14 days ago

phonemetransformers/babylm-subwords-text-gpt2_lm-model

codebyzeb published a model 15 days ago

phonemetransformers/babylm-subwords-text-gpt2_lm-model

codebyzeb updated a model 16 days ago

phonemetransformers/BABYLM-TOKENIZER-MEAN-ENTROPY-TXT

View all activity

Collections 1

spaces 1

Runtime error

segmentation_scores

🚀

models 125

datasets 2

phonemetransformers/CHILDES

Viewer • Updated Feb 17 • 12.5M • 1.38k • 1

phonemetransformers/BabyLM-phonemized

Viewer • Updated Jan 13 • 12.5M • 300

Language Modelling with Phonemes

AI & ML interests

Recent Activity

Collections 1

phonemetransformers/BabyLM-phonemized

phonemetransformers/BABYLM-TOKENIZER-CHAR-PHON

phonemetransformers/BABYLM-TOKENIZER-BPE-PHON

phonemetransformers/BABYLM-TOKENIZER-CHAR-TXT

spaces 1

segmentation_scores

models 125

phonemetransformers/babylm-subwords-text-gpt2_lm-model

phonemetransformers/BABYLM-TOKENIZER-MEAN-ENTROPY-TXT

phonemetransformers/babylm-subwords-2-gpt2_lm-model

phonemetransformers/babylm-subwords-gpt2_lm-model

phonemetransformers/BABYLM-TOKENIZER-MEAN-Entropy-SPACELESS

phonemetransformers/BABYLM-TOKENIZER-MIN-Entropy-SPACELESS

phonemetransformers/BABYLM-TOKENIZER-MIN-Boundaryprediction-SPACELESS

phonemetransformers/BABYLM-TOKENIZER-MEAN-Boundaryprediction-SPACELESS

phonemetransformers/childes-segmentation-random-18M-gpt2_lm-model

phonemetransformers/childes-segmentation-100k-gpt2_lm-model

datasets 2

phonemetransformers/CHILDES

phonemetransformers/BabyLM-phonemized

AI & ML interests

Recent Activity

Team members 1

Collections 1

spaces 1

segmentation_scores

models 125 Sort: Recently updated

datasets 2 Sort: Recently updated

models 125

datasets 2