Iman998 (Iman Barati)

upvoted 4 papers about 1 month ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30 • 53

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21 • 27

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27 • 89

upvoted 2 papers about 2 months ago

LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published Sep 18 • 30

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18 • 36

upvoted an article about 2 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 198

upvoted an article 3 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 102

upvoted 2 papers 4 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 94

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 155

upvoted 2 papers 5 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 85

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

Iman Barati

AI & ML interests

Organizations

Iman998's activity