Daniel Dahlmeier's picture

1 2

Daniel Dahlmeier

ddahlmeier

ddahlmeier

AI & ML interests

NLP

Recent Activity

new activity 24 days ago

McGill-NLP/feedbackQA:NonMatchingChecksumError

reacted to philschmid's post with 👍 4 months ago

What's the best way to fine-tune open LLMs in 2024? Look no further! 👀 I am excited to share “How to Fine-Tune LLMs in 2024 with Hugging Face” using the latest research techniques, including Flash Attention, Q-LoRA, OpenAI dataset formats (messages), ChatML, Packing, all built with Hugging Face TRL. 🚀 It is created for consumer-size GPUs (24GB) covering the full end-to-end lifecycle with: 💡Define and understand use cases for fine-tuning 🧑🏻‍💻 Setup of the development environment 🧮 Create and prepare dataset (OpenAI format) 🏋️‍♀️ Fine-tune LLM using TRL and the SFTTrainer 🥇 Test and evaluate the LLM 🚀 Deploy for production with TGI 👉 https://www.philschmid.de/fine-tune-llms-in-2024-with-trl Coming soon: Advanced Guides for multi-GPU/multi-Node full fine-tuning and alignment using DPO & KTO. 🔜

upvoted a paper 4 months ago

LMDX: Language Model-based Document Information Extraction and Localization

View all activity

Organizations

None yet

ddahlmeier's activity

New activity in McGill-NLP/feedbackQA 24 days ago

NonMatchingChecksumError

#3 opened 24 days ago by