Language and Cognition Lab (UCSD)

university

https://langcoglab.ucsd.edu/

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

catherinearnett authored a paper about 2 months ago

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training

catherinearnett authored a paper about 2 months ago

Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models

catherinearnett authored a paper about 2 months ago

Toxicity of the Commons: Curating Open-Source Pre-Training Data

View all activity

language-and-cognition-ucsd's activity

catherinearnett

authored 3 papers about 2 months ago

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training

Paper • 2409.04599 • Published Sep 6 • 1

Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models

Paper • 2311.09194 • Published Nov 15, 2023

Toxicity of the Commons: Curating Open-Source Pre-Training Data

Paper • 2410.22587 • Published Oct 29 • 9

camrobjones

authored a paper 2 months ago

People cannot distinguish GPT-4 from a human in a Turing test

Paper • 2405.08007 • Published May 9

catherinearnett

authored 3 papers 4 months ago

Goldfish: Monolingual Language Models for 350 Languages

Paper • 2408.10441 • Published Aug 19

Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement

Paper • 2403.13754 • Published Mar 20

A Bit of a Problem: Measurement Disparities in Dataset Sizes Across Languages

Paper • 2403.00686 • Published Mar 1

tylerachang

authored a paper 4 months ago

Goldfish: Monolingual Language Models for 350 Languages

Paper • 2408.10441 • Published Aug 19

catherinearnett

authored a paper 9 months ago

When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages

Paper • 2311.09205 • Published Nov 15, 2023

camrobjones

authored a paper about 1 year ago

Does GPT-4 Pass the Turing Test?

Paper • 2310.20216 • Published Oct 31, 2023 • 17

AI & ML interests

Recent Activity

Team members 10

language-and-cognition-ucsd's activity