devngho's picture

devngho PRO

devngho

·

https://ngho.dev

devngho

AI & ML interests

Efficient Korean NLP, Fine Korean datasets

Recent Activity

updated a model 1 day ago

devngho/gaenari-llama-3.2-3b-inst-preview

published a model 2 days ago

devngho/gaenari-llama-3.2-3b-inst-preview

updated a model 4 days ago

devngho/llama3-jamo-tokenizer

View all activity

Organizations

devngho's activity

upvoted a paper 6 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

upvoted a collection 7 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 599

upvoted a paper 7 months ago

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 85

upvoted 3 papers 8 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 94

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Paper • 2304.01373 • Published Apr 3, 2023 • 9

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 57

upvoted a paper 9 months ago

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8, 2024 • 21