Base Model: LLaMa2 7B Chat HF

Extend vocab to 44,800 for better Vietnamese understanding
Continual Pre-Train with >2B tokens Vietnamese
Trainning profile: LoRa (rank=32, alpha=128, 16fp), 1 epoch, block size = 512. Takes 300GPU Hours x RXT4090 24GB

Can be better use for

Futher training / Fine-tuning for Vietnamese tasks

Downloads last month: 9

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

infCapital
/

llama2-7b-chatvi

Base Model: LLaMa2 7B Chat HF

Can be better use for

Dataset used to train infCapital/llama2-7b-chatvi