cnmoro's picture
Update README.md
6e4920b verified
metadata
library_name: sentence-transformers
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - autotrain
base_model: google/bert_uncased_L-2_H-128_A-2
widget:
  - source_sentence: dogs are playful
    sentences:
      - i love cats
      - i love dogs
pipeline_tag: sentence-similarity
datasets:
  - cnmoro/PremiseHypothesisLabel_ENPT

Model Trained Using AutoTrain

  • Problem type: Sentence Transformers

Validation Metrics

loss: 0.056979671120643616

Info

This is the bert-tiny model finetuned on 15B tokens for embedding/feature extraction, for English and Brazillian Portuguese languages.

The output vector size is 128.

This model only has 4.4M params but the quality of the embeddings punch way above its size after tuning.

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the Hugging Face Hub
model = SentenceTransformer("cnmoro/bert-tiny-embeddings-english-portuguese")
# Run inference
sentences = [
    'first passage',
    'second passage'
]
embeddings = model.encode(sentences)
print(embeddings.shape)