Arunkumar Venkataramanan (Personal/Hobby)'s picture

13 46

Arunkumar Venkataramanan (Personal/Hobby)

ArunkumarBVR

·

AI & ML interests

This account is for personal and hobby projects only.

Organizations

ArunkumarBVR's activity

upvoted 3 papers 3 months ago

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Paper • 2410.07095 • Published Oct 9, 2024 • 6

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 63

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8, 2024 • 108

upvoted a collection 4 months ago

Daily Papers

1 item • Updated Oct 26, 2023 • 66

upvoted a collection 10 months ago

DBRX

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27, 2024 • 92

upvoted a collection 11 months ago

Whisper Release

Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 93

upvoted a paper 11 months ago

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 114

upvoted 6 collections 11 months ago

Handbook v0.1 models and datasets

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24

DPO vs KTO vs IPO

A collection of datasets and models used for the Aligning LLMs with Direct Preference Optimization Methods blogpost • 2 items • Updated Jan 16, 2024 • 12

Constitutional AI

A collection of datasets and models that accompany the Constitutional AI recipe. See hf.co/blog/constitutional-ai for more details. • 9 items • Updated Feb 1, 2024 • 5

Tulu V2 Suite

The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated 5 days ago • 43

Paloma

Dataset and baseline models for Paloma, a benchmark of language model fit to 546 textual domains • 8 items • Updated 5 days ago • 15

OLMo Suite

Artifacts for the first set of OLMo models. • 18 items • Updated 5 days ago • 70