Bengali Word2Vec Model

This is a pre-trained word2vec model for Bengali language.

This model is build for bengalinlp package.

Datasets

Training details

  • Word2Vec word embedding dimension = 100, min_count=5, window=5, epochs=10

Usage

  • pip install -U bengalinlp_toolkit

  • Generate Vector using pretrain model

    from bengalinlp import BengaliWord2Vec
    
    bwv = BengaliWord2Vec()
    model_path = "bengali_word2vec.model"
    word = 'গ্রাম'
    vector = bwv.generate_word_vector(model_path, word)
    print(vector.shape)
    print(vector)
    
  • Find Most Similar Word Using Pretrained Model

    from bengalinlp import BengaliWord2Vec
    
    bwv = BengaliWord2Vec()
    model_path = "bengali_word2vec.model"
    word = 'গ্রাম'
    similar = bwv.most_similar(model_path, word, topn=10)
    print(similar)
    
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.