Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 1 day ago • 134
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. • 7 items • Updated 26 days ago • 9
Depth Anything v2 Release Collection A comprehensive collection on DAv2 • 5 items • Updated Jun 18 • 10
Vision Language Models Papers 🖼️💬📝 Collection Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30 • 32
Awesome Document AI Collection A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11 • 65
OWL-series 🦉 Collection Models and applications of OWL-ViT and OWLv2. • 13 items • Updated Mar 11 • 5
plant-image-datasets Collection Image datasets about the kingdom Plantae. • 4 items • Updated Feb 29 • 2
Zero-shot Image Classification Models 🖼️ Collection This is a collection for models that can be used for zero-shot image classification. • 10 items • Updated Sep 19, 2023 • 1
Image Segmentation Models 💜 Collection A collection of instance/semantic/panoptic segmentation models. • 17 items • Updated Sep 19, 2023 • 1
Image-to-Image Models 🎨 Collection Collection of image to image editing, image enhancement (SR, deblur, brighten) and text-to-image adapter models. • 24 items • Updated Sep 19, 2023 • 2
Image-to-Text Models 📝 Collection This collection contains image captioning and OCR models. • 15 items • Updated Sep 19, 2023 • 5
Foundation Models for Vision 🧩 Collection Foundation models for computer vision. • 24 items • Updated Mar 11 • 17
Computer Vision Backbones 🧩 Collection Collection of useful computer vision backbones to fine-tune. It also includes large image classification models, that can be used as backbone. • 22 items • Updated Sep 19, 2023 • 17
Historical - Spaces of the Week Collection All Spaces of the Week...from all weeks • 636 items • Updated Jan 17 • 19