EleutherAI

non-profit

Verified

https://eleuther.ai

AIEleuther

EleutherAI

Activity Feed Request to join this org

AI & ML interests

Large language models, scaling laws, AI Alignment, democratization of DL

Recent Activity

hyunwoongko authored a paper 1 day ago

Kanana: Compute-efficient Bilingual Language Models

pietrolesci authored a paper 1 day ago

Self-Training Large Language Models for Tool-Use Without Demonstrations

oskarvanderwal published a dataset 1 day ago

EleutherAI/polypythias-evals

View all activity

EleutherAI's activity

hyunwoongko

authored a paper 1 day ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published 3 days ago • 50

pietrolesci

authored a paper 1 day ago

Self-Training Large Language Models for Tool-Use Without Demonstrations

Paper • 2502.05867 • Published 19 days ago

oskarvanderwal

published a dataset 1 day ago

EleutherAI/polypythias-evals

Preview • Updated Sep 11, 2024

bzantium

authored a paper 1 day ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published 3 days ago • 50

Kyle1668

published a dataset 2 days ago

EleutherAI/filtering-pretraining-mix

Updated 2 days ago • 5

avi-skowron

authored a paper 3 days ago

Beyond Release: Access Considerations for Generative AI Systems

Paper • 2502.16701 • Published 5 days ago • 9

amphora

authored a paper 3 days ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published 4 days ago • 22

craffel

authored a paper 22 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 24 days ago • 195

stellaathena

authored a paper about 1 month ago

Open Problems in Mechanistic Interpretability

Paper • 2501.16496 • Published Jan 27 • 19

storytracer

authored a paper about 1 month ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

avi-skowron

authored a paper about 1 month ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

stellaathena

authored a paper about 1 month ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

Skylion007

authored a paper about 2 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 88

ncoop57

authored 2 papers 2 months ago

Stable Code Technical Report

Paper • 2404.01226 • Published Apr 1, 2024 • 1

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

akhaliq

posted an update 2 months ago

Post

11863

Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: akhaliq/anychat