Muhammad Farrukh Mehmood

sfarrukhm

AI & ML interests

Generative AI, LLM, SLM

Recent Activity

updated a model 2 days ago

sfarrukhm/mistral-7b-clpsych25-v1

published a model 2 days ago

sfarrukhm/mistral-7b-clpsych25-v1

published a model 11 days ago

sfarrukhm/flan-t5-clpysch-summary

View all activity

Organizations

sfarrukhm's activity

upvoted 2 papers 30 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published about 1 month ago • 55

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published Jan 21 • 11

upvoted 3 articles about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 783

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

Jan 20

• 36

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18, 2024

• 47

upvoted a collection about 1 month ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 40

upvoted a paper about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

upvoted 2 articles about 1 month ago

Article

The Large Language Model Course

•

Jan 16

• 114

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 151

upvoted an article 7 months ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 75