Language and Cognition Lab (UCSD)

university

https://langcoglab.ucsd.edu/

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

catherinearnett authored a paper 2 months ago

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training

catherinearnett authored a paper 2 months ago

Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models

catherinearnett authored a paper 2 months ago

Toxicity of the Commons: Curating Open-Source Pre-Training Data

View all activity

language-and-cognition-ucsd's activity

catherinearnett

authored 3 papers 2 months ago

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training

Paper • 2409.04599 • Published Sep 6, 2024 • 1

Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models

Paper • 2311.09194 • Published Nov 15, 2023

Toxicity of the Commons: Curating Open-Source Pre-Training Data

Paper • 2410.22587 • Published Oct 29, 2024 • 10

camrobjones

authored a paper 3 months ago

People cannot distinguish GPT-4 from a human in a Turing test

Paper • 2405.08007 • Published May 9, 2024

catherinearnett

authored 3 papers 5 months ago

Goldfish: Monolingual Language Models for 350 Languages

Paper • 2408.10441 • Published Aug 19, 2024

Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement

Paper • 2403.13754 • Published Mar 20, 2024

A Bit of a Problem: Measurement Disparities in Dataset Sizes Across Languages

Paper • 2403.00686 • Published Mar 1, 2024

tylerachang

authored a paper 5 months ago

Goldfish: Monolingual Language Models for 350 Languages

Paper • 2408.10441 • Published Aug 19, 2024

catherinearnett

authored a paper 10 months ago

When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages

Paper • 2311.09205 • Published Nov 15, 2023

camrobjones

authored a paper about 1 year ago

Does GPT-4 Pass the Turing Test?

Paper • 2310.20216 • Published Oct 31, 2023 • 17

AI & ML interests

Recent Activity

Team members 10

language-and-cognition-ucsd's activity