AViLaMa Model Series
Collection
This is the collection of all the different type of the AViLaMa models. From v1-small etc..
•
1 item
•
Updated
•
1
Learning Visual Concepts Directly From African Languages Supervision. Paper is coming
AViLaMa is the large open-source text-vision alignment pre-training model in African languages. It brings a way to learn visual concepts directly from African languages supervision. Inspired from OpenAI CLIP, but with more based on African languages to capture the nuances, cultural context, and social aspect use of our languages that are so impossible to get just from machine translation. It includes techniques like agnostic languages encoding, data filtering network etc... All for more than 12 African languages, trained on the #AViLaDa-2B datasets of filtered image-text pairs.
import torch
from transformers import AutoModel, AutoTokenizer
model = AutoModel.from_pretrained("sartifyllc/AViLaMa")
tokenizer = AutoTokenizer.from_pretrained("sartifyllc/AViLaMa")
model = model.eval()
BibTeX:
AViLaMa paper
@article{sartifyllc2023africanvision,
title={AViLaMa: Learning Visual Concepts Directly From African Languages Supervision},
author={Sartify LLC Research Team},
journal={To be inserted},
year={2024}
}