PZ's picture

PZ PRO

philipp-zettl

·

philipp-zettl

AI & ML interests

NLP/CV/Multimodal learning

Organizations

philipp-zettl's activity

upvoted a collection 8 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 7 days ago • 163

upvoted a collection about 1 month ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 215

upvoted an article about 1 month ago

Article

HTRflow - A tool for HTR and OCR

By

•

Oct 1

• 14

upvoted a collection about 1 month ago

Realistic Vision (SD1.5)

8 items • Updated Dec 4, 2023 • 33

upvoted an article about 1 month ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 164

upvoted a paper about 2 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 84

upvoted a collection 3 months ago

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 112

upvoted an article 3 months ago

Article

Introducing TextImage Augmentation for Document Images

Aug 6

• 31

upvoted an article 4 months ago

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16

• 32

upvoted an article 5 months ago

Article

Thoughts on LoRA Training #1

By

•

Jun 18

• 31

upvoted a collection 5 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Sep 18 • 346

upvoted an article 5 months ago

Article

quanto: a pytorch quantization toolkit

Mar 18

• 31

upvoted 2 papers 5 months ago

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Paper • 2403.04692 • Published Mar 7 • 40

Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27 • 51

upvoted an article 5 months ago

Article

Space secrets security update

May 31

• 50

upvoted a collection 5 months ago

em🍞ing series

crispy sentence embedding family • 5 items • Updated 28 days ago • 21

upvoted an article 6 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28

• 156

upvoted 2 collections 6 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 13 days ago • 489

PaliGemma FT Models

108 items • Updated Jul 31 • 27

upvoted a paper 7 months ago

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

Paper • 2404.07413 • Published Apr 11 • 36