-
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 84 -
YaRN: Efficient Context Window Extension of Large Language Models
Paper • 2309.00071 • Published • 65 -
Language Modeling Is Compression
Paper • 2309.10668 • Published • 82
Georgi Kirov
crispy-g
AI & ML interests
RL, Graphs, general mayhem
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet