Lysandre's picture

Lysandre

lysandre

·

http://lysand.re

AI & ML interests

chief open-source officer @ hf

Organizations

lysandre's activity

upvoted an article 17 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

17 days ago

• 25

upvoted an article 20 days ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 399

upvoted a paper 4 months ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30, 2024 • 8

upvoted an article 4 months ago

Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

Oct 22, 2024

• 67

upvoted an article 5 months ago

Article

Tool Use, Unified

Aug 12, 2024

• 85

upvoted a collection 5 months ago

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14, 2024 • 11

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 223

upvoted an article 6 months ago

Article

Don't repeat yourself - 🤗 Transformers Design Philosophy

Apr 5, 2022

• 20

upvoted 3 articles 7 months ago

Article

MobileNet Baselines

By

•

Jul 26, 2024

• 23

Article

MobileNet-V4 (now in timm)

By

•

Jun 17, 2024

• 43

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22, 2024

• 58

upvoted a collection 9 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Jan 17 • 162

upvoted a collection 10 months ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co./datasets?other=sentence-transformers • 68 items • Updated 3 days ago • 109

upvoted 2 articles 10 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 129

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24, 2024

• 61

upvoted a collection 11 months ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Dec 13, 2024 • 330

upvoted 2 collections about 1 year ago

Canonical models

This collection lists all the historical (pre-"Hub") canonical model checkpoints, i.e. repos that were not under an org or user namespace • 68 items • Updated Feb 13, 2024 • 14

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 9 days ago • 53

upvoted a paper over 1 year ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118

upvoted a collection over 1 year ago

Switch-Transformers release

This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. • 9 items • Updated Dec 13, 2024 • 17