Allyson Ettinger's picture

1

Allyson Ettinger

aettinger

·

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Faith and Fate: Limits of Transformers on Compositionality

authored a paper about 2 months ago

COMPS: Conceptual Minimal Pair Sentences for testing Robust Property Knowledge and its Inheritance in Pre-trained Language Models

authored a paper about 2 months ago

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

View all activity

Organizations

aettinger's activity

authored 4 papers about 2 months ago

Faith and Fate: Limits of Transformers on Compositionality

Paper • 2305.18654 • Published May 29, 2023 • 6

COMPS: Conceptual Minimal Pair Sentences for testing Robust Property Knowledge and its Inheritance in Pre-trained Language Models

Paper • 2210.01963 • Published Oct 5, 2022

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26 • 12

AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text

Paper • 2410.04265 • Published Oct 5

authored a paper 6 months ago

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Paper • 2406.18510 • Published Jun 26 • 8

authored a paper about 1 year ago

The Generative AI Paradox: "What It Can Create, It May Not Understand"

Paper • 2311.00059 • Published Oct 31, 2023 • 18