EPFL LLM Team

university

https://epfllm.github.io/Megatron-LLM/

epfLLM

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

angelika authored a paper 19 days ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

zechen-nlp authored a paper 21 days ago

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

angelika authored a paper 23 days ago

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

View all activity

epfl-llm's activity

lewtun

posted an update 9 days ago

Post

6417

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!

2 replies

·

angelika

authored a paper 19 days ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published 21 days ago • 17

zechen-nlp

authored a paper 21 days ago

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Paper • 2411.19799 • Published 26 days ago • 10

angelika

authored 2 papers 23 days ago

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

Paper • 2311.16079 • Published Nov 27, 2023 • 20

CRAB: Assessing the Strength of Causal Relationships Between Real-world Events

Paper • 2311.04284 • Published Nov 7, 2023

atcbosselut

authored 15 papers 23 days ago

COMET: Commonsense Transformers for Automatic Knowledge Graph Construction

Paper • 1906.05317 • Published Jun 12, 2019

Discovering Knowledge-Critical Subnetworks in Pretrained Language Models

Paper • 2310.03084 • Published Oct 4, 2023

RECKONING: Reasoning through Dynamic Knowledge Encoding

Paper • 2305.06349 • Published May 10, 2023 • 1

Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention

Paper • 2310.15258 • Published Oct 23, 2023 • 2

CRAB: Assessing the Strength of Causal Relationships Between Real-world Events

Paper • 2311.04284 • Published Nov 7, 2023

Mitigating Label Biases for In-context Learning

Paper • 2305.19148 • Published May 28, 2023

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

Paper • 2311.16079 • Published Nov 27, 2023 • 20

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

Paper • 2102.01672 • Published Feb 2, 2021

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

Paper • 2104.06378 • Published Apr 13, 2021

Evaluating Language Model Agency through Negotiations

Paper • 2401.04536 • Published Jan 9 • 1

On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021

Fast Model Editing at Scale

Paper • 2110.11309 • Published Oct 21, 2021

GreaseLM: Graph REASoning Enhanced Language Models for Question Answering

Paper • 2201.08860 • Published Jan 21, 2022

Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning

Paper • 2402.13950 • Published Feb 21

Deep Bidirectional Language-Knowledge Graph Pretraining

Paper • 2210.09338 • Published Oct 17, 2022 • 1