CMU LTI Wav2Gloss Project

university

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

gneubig authored a paper 6 days ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

gneubig authored a paper 16 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

gneubig authored a paper 19 days ago

Evaluating Language Models as Synthetic Data Generators

View all activity

wav2gloss's activity

gneubig

authored a paper 6 days ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 6 days ago • 43

gneubig

authored a paper 16 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 19 days ago • 45

gneubig

authored a paper 19 days ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published 20 days ago • 43

gneubig

authored a paper about 1 month ago

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published Nov 21 • 28

gneubig

authored 2 papers 2 months ago

JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation

Paper • 2410.17250 • Published Oct 22 • 14

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21 • 43

AnjaliRuban

authored a paper 2 months ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21 • 43

gneubig

authored 2 papers 2 months ago

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples

Paper • 2410.14669 • Published Oct 18 • 36

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17 • 29

gneubig

authored a paper 3 months ago

Agent Workflow Memory

Paper • 2409.07429 • Published Sep 11 • 28

gneubig

authored a paper 4 months ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4 • 28

juice500

updated a dataset 5 months ago

wav2gloss/fieldwork

Viewer • Updated Aug 1 • 80.5k • 60

gneubig

authored a paper 5 months ago

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23 • 68

gneubig

authored 2 papers 6 months ago

VIMI: Grounding Video Generation through Multi-modal Instruction

Paper • 2407.06304 • Published Jul 8 • 9

Training Task Experts through Retrieval Based Distillation

Paper • 2407.05463 • Published Jul 7 • 7

sw005320

authored a paper 6 months ago

Towards Robust Speech Representation Learning for Thousands of Languages

Paper • 2407.00837 • Published Jun 30 • 10

gneubig

authored a paper 8 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 119

gneubig

authored 2 papers 10 months ago

SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents

Paper • 2403.08715 • Published Mar 13 • 20

Instruction-tuned Language Models are Better Knowledge Learners

Paper • 2402.12847 • Published Feb 20 • 25

sw005320

authored a paper 11 months ago

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Paper • 2401.16658 • Published Jan 30 • 13

AI & ML interests

Recent Activity

Team members 12

wav2gloss's activity