Text Generation
Transformers
Safetensors
llama
llama2
meta
indic
Dravida
LLM
text-generation-inference
Inference Endpoints
Edit model card

Model Details

Model Description

The Dravida Llama is a state-of-the-art Multilingual Large Language Model (LLM) fine-tuned for the four South Indian languages (KaTeMaTa) using Meta's Llama-2 as a foundation.

  • Developed by: PosteriorAI
  • Model type: Large Language Model (LLM), specifically fine-tuned Llama-2 model for Kannada, Telugu, Malayalam & Tamil.
  • Language(s) (NLP): Kannada, Tamil, Malayalam & Telugu.
  • License: Open-source releases on Hugging Face, MIT Licensed.
  • Finetuned from model [optional]: Llama-2

Model Sources [optional]

Uses

This model is intended for various stakeholders, including researchers, developers, and the broader community. It aims to enhance communication, education, and technology access by providing an AI tool that understands and interacts in the four languages. The model's applications range from personal assistance to educational content creation and more. It addresses the gap in AI for Indic languages and is designed to promote inclusivity in technology.

Find mode details in our blog post at Dravida Llama: LLM for South Indian (ಕతెമத) Languages

Downloads last month
15
Safetensors
Model size
7.03B params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train PosteriorAI/dravida_llama2_7b