kelechic's picture

kelechic

tensorkelechi

·

https://kelechi-c.github.io/

AI & ML interests

vision

Recent Activity

liked a Space 7 days ago

Lightricks/LTX-Video-Playground

liked a model 7 days ago

HuggingFaceTB/SmolLM2-135M

published a model 9 days ago

tensorkelechi/whisper_base_jax

View all activity

Organizations

tensorkelechi's activity

upvoted an article 10 days ago

Article

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Jan 26, 2023

• 49

upvoted a collection 22 days ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 11 days ago • 552

upvoted a paper 22 days ago

SoundStorm: Efficient Parallel Audio Generation

Paper • 2305.09636 • Published May 16, 2023 • 5

upvoted a collection 23 days ago

CLAP: Contrastive Language-Audio Pretraining

CLAP is to audio what CLIP is to image. • 5 items • Updated Oct 31, 2023 • 10

upvoted an article 28 days ago

Article

Design choices for Vision Language Models in 2024

By

•

Apr 16, 2024

• 27

upvoted a paper 28 days ago

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Paper • 2402.01831 • Published Feb 2, 2024 • 15

upvoted 2 articles 29 days ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 212

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 146

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 199

upvoted an article about 1 month ago

Article

State of open video generation models in Diffusers

Jan 27

• 50

upvoted an article about 2 months ago

Article

Upgrading Kokoro: natural TTS for short bursts

By

•

Nov 22, 2024

• 27

upvoted a paper 2 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 108

upvoted a collection 2 months ago

Cosmos Tokenizer

A suite of image and video tokenizers • 13 items • Updated Jan 17 • 39

upvoted 3 papers 2 months ago

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Paper • 2402.03766 • Published Feb 6, 2024 • 14

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 43

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published Dec 21, 2024 • 22

upvoted a paper 3 months ago

Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Paper • 2407.21705 • Published Jul 31, 2024 • 27

upvoted an article 4 months ago

Article

Custom architectures with HuggingFace 🤗

By

•

Apr 22, 2024

• 26

upvoted a paper 4 months ago

Unbounded: A Generative Infinite Game of Character Life Simulation

Paper • 2410.18975 • Published Oct 24, 2024 • 37

upvoted a collection 4 months ago

Dinov2

5 items • Updated Jan 16, 2024 • 16