RishuD7's picture
Add new SentenceTransformer model
cf0c8d4 verified
metadata
language:
  - en
license: apache-2.0
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - generated_from_trainer
  - dataset_size:8290
  - loss:MultipleNegativesRankingLoss
base_model: BAAI/bge-base-en-v1.5
widget:
  - source_sentence: >-
      30. HOLDING OVER. If Tenant remains in possession of the Leased Premises
      after expiration of the Term, or after any termination of the Lease by
      Landlord without written agreement between the parties, Tenant shall be a
      tenant at sufferance and such tenancy shall be subject to the provisions
      hereof, except that Base Rent for said holdover period shall be one
      hundred twenty five percent (125%) of the amount of Base Rent due in the
      last month of the Term. Nothing in this Section shall be construed as
      consent by Landlord to the possession of the Leased Premises by Tenant
      after the expiration of the Term or termination of the Lease by Landlord.
    sentences:
      - Holding Over
      - Does landlord confirm to no eminent domain on the property
      - Holding Over
  - source_sentence: >2-
           Lease other than as specifically provided in the Lease..
                 (j)    To the knowledge of Landlord and/or Assignor, there has been no casualty with respect
           to the Premises..
                 (k)   There does not exist any pending, or to the knowledge of Landlord, contemplated,
           condemnation or eminent domain proceedings that affect the Premises or any part thereof, and Landlord
           has received no notice, oral or written, of the intention of any governmental body or other entity to take or
           use all or any part thereof.

      (d) The Premises contain approximately 3,739 rentable square feet, which
      is 4.18% of the. rentable area of the Project. (e) There are no existing
      defaults on the part of Landlord or Assignor under the Lease;. neither
      party to the Lease has delivered any notice of default to the other; and,
      to the knowledge of Landlord and/or Assignor, no event has occurred that,
      with the giving of notice or the passage of time or both, would constitute
      a default under the Lease.. (f) The current monthly Base Rent payable
      under the Lease is $7,322.21 per month. All rent. payable under the Lease
      has been paid through May 31, 2020. (g) There is currently no security
      deposit being held by Landlord under the Lease.. (h) All improvements to
      the Premises required to be made by Landlord or Assignor under the Lease
      have been made, and any improvement allowances to be paid to Assignor have
      been fully paid. (i) Landlord has no option to terminate or otherwise
      modify the terms and conditions of the.
    sentences:
      - Does landlord confirm to no eminent domain on the property
      - Signage Rights
      - Signage Rights
  - source_sentence: >-
      17. Condemnation. Either party may terminate this Lease if any material
      part of the Premises is taken or condemned for any public or quasi-public
      use under Law, by eminent domain or private purchase in lieu thereof (a
      "Taking"). Landlord shall also have the right to terminate this Lease if
      there is a Taking of any portion of the Building or Property which would
      have a material adverse effect on Landlord's ability to profitably operate
      the remainder of the Building. The terminating party shall provide written
      notice of termination to the other party within 45 days after it first
      receives notice of the Taking. The termination shall be effective as of
      the effective date of any order granting possession to, or vesting legal
      title in, the condemning authority. If this Lease is not terminated, Base
      Rent and Tenant's Pro Rata Share shall be appropriately adjusted to
      account for any reduction in the square footage of the Building or
      Premises. All compensation awarded for a Taking shall be the property of
      Landlord. The right to receive compensation or proceeds are expressly
      waived by Tenant, provided, however, Tenant may file a separate claim for
      Tenant's Property and Tenant's reasonable relocation expenses, provided
      the filing of the claim does not diminish the amount of Landlord's award.
      If only a part of the Premises is subject to a Taking and this Lease is
      not terminated, Landlord, with reasonable diligence, will restore the
      remaining portion of the Premises as nearly as practicable to the
      condition immediately prior to the Taking.
    sentences:
      - Eminent Domain(Detail)
      - Permitted Use
      - Permitted Use
  - source_sentence: |2
            5. Lessor reserves the right to relocate all or a part of parking spaces from floor to
            floor, within one floor, and/or to reasonably adjacent offsite location(s), and to
           reasonably allocate them between compact and standard size spaces, as Iong as
            the same complies with applicable laws, ordinances and regulations..
      .
           6. Users of the parking area will obey all posted signs and park only in the areas
           designated for vehicle parking.
      .
            7. Unless otherwise instructed, every person using the parking area is required to
            park and lock his own vehicle. Lessor will not be responsible for any damage to
            vehicles, injury to persons or loss of property, all of which risks are assumed by
            the party using the parking area.
      .
           8. Validation, if established, will be permissible only by such method or methods
           as Lessor and/or its licensee may establish at rates generally applicable to visitor
           parking.
      .
           9. The maintenance, washing, waxing or cleaning of vehicles in the parking
            structure or Common Areas is prohibited.
      .
            10. Lessee shall be responsible for seeing that all of its employees, agents and
            invitees comply with the applicable parking rules, regulations, laws and.
            agreements.
      .
            11. Lessor reserves the right to modify these rules and/or adopt such other
            reasonable and non-discriminatory rules and regulations as it may deem.
           necessary for the proper operation of the parking area..
      .
            12. Such parking use as is herein provided is intended merely as a license only
            and no bailment is intended or shall be created hereby..
      .
            4/5/2012                          14            initials O
            3. Lessor reserves the right to use parking stickers or identification devices
            which shall be the property of Lessor and be returned to Lessor by the holder
            thereof upon termination of the holder's parking privileges. Lessee will pay such
            replacement charge as is reasonably established by Lessor for the loss of such
            devices.
      .
            4. Lessor reserves the right to refuse the sale of monthly identification devices to
            any person or entity that willfully refuses to comply with the applicable rules,
            regulations, laws and/or agreements.
    sentences:
      - Does landlord confirm to no eminent domain on the property
      - Permitted Use
      - Right to Relocate (Detail)
  - source_sentence: >-
      31. HOLDING OVER. If Tenant remains in possession of the Leased Premises
      after

      expiration of the Term, or after any termination of the Lease by Landlord
      without written agreement

      between the parties, Tenant shall be a tenant at sufferance and such
      tenancy shall be subject to the

      provisions hereof, except that Rent for said holdover period shall be one
      hundred fifty percent (150%) of

      the amount of Rent due in the last month of the Term. Nothing in this
      Section 29 shall be construed as

      consent by Landlord to the possession of the Leased Premises by Tenant
      after the expiration of the Term

      or termination of the Lease by Landlord. 
    sentences:
      - Holding Over
      - Does landlord confirm to no eminent domain on the property
      - Holding Rent
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
  - cosine_accuracy@1
  - cosine_accuracy@3
  - cosine_accuracy@5
  - cosine_accuracy@10
  - cosine_precision@1
  - cosine_precision@3
  - cosine_precision@5
  - cosine_precision@10
  - cosine_recall@1
  - cosine_recall@3
  - cosine_recall@5
  - cosine_recall@10
  - cosine_ndcg@10
  - cosine_mrr@10
  - cosine_map@100
model-index:
  - name: BGE base En v1.5 Phase 5
    results:
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: dim 768
          type: dim_768
        metrics:
          - type: cosine_accuracy@1
            value: 0.011029411764705883
            name: Cosine Accuracy@1
          - type: cosine_accuracy@3
            value: 0.022794117647058822
            name: Cosine Accuracy@3
          - type: cosine_accuracy@5
            value: 0.03602941176470588
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.07720588235294118
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.011029411764705883
            name: Cosine Precision@1
          - type: cosine_precision@3
            value: 0.0075980392156862735
            name: Cosine Precision@3
          - type: cosine_precision@5
            value: 0.007205882352941177
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.007720588235294118
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.011029411764705883
            name: Cosine Recall@1
          - type: cosine_recall@3
            value: 0.022794117647058822
            name: Cosine Recall@3
          - type: cosine_recall@5
            value: 0.03602941176470588
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.07720588235294118
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.03623828989025581
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.024275501867413632
            name: Cosine Mrr@10
          - type: cosine_map@100
            value: 0.0367475670482882
            name: Cosine Map@100

BGE base En v1.5 Phase 5

This is a sentence-transformers model finetuned from BAAI/bge-base-en-v1.5. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: BAAI/bge-base-en-v1.5
  • Maximum Sequence Length: 512 tokens
  • Output Dimensionality: 768 dimensions
  • Similarity Function: Cosine Similarity
  • Language: en
  • License: apache-2.0

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("RishuD7/bge-base-en-v1.5-76-keys-phase-6-exp_v1")
# Run inference
sentences = [
    '31. HOLDING OVER. If Tenant remains in possession of the Leased Premises after\nexpiration of the Term, or after any termination of the Lease by Landlord without written agreement\nbetween the parties, Tenant shall be a tenant at sufferance and such tenancy shall be subject to the\nprovisions hereof, except that Rent for said holdover period shall be one hundred fifty percent (150%) of\nthe amount of Rent due in the last month of the Term. Nothing in this Section 29 shall be construed as\nconsent by Landlord to the possession of the Leased Premises by Tenant after the expiration of the Term\nor termination of the Lease by Landlord. ',
    'Holding Rent',
    'Does landlord confirm to no eminent domain on the property',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Information Retrieval

Metric Value
cosine_accuracy@1 0.011
cosine_accuracy@3 0.0228
cosine_accuracy@5 0.036
cosine_accuracy@10 0.0772
cosine_precision@1 0.011
cosine_precision@3 0.0076
cosine_precision@5 0.0072
cosine_precision@10 0.0077
cosine_recall@1 0.011
cosine_recall@3 0.0228
cosine_recall@5 0.036
cosine_recall@10 0.0772
cosine_ndcg@10 0.0362
cosine_mrr@10 0.0243
cosine_map@100 0.0367

Training Details

Training Dataset

Unnamed Dataset

  • Size: 8,290 training samples
  • Columns: positive and anchor
  • Approximate statistics based on the first 1000 samples:
    positive anchor
    type string string
    details
    • min: 98 tokens
    • mean: 298.43 tokens
    • max: 512 tokens
    • min: 4 tokens
    • mean: 5.99 tokens
    • max: 12 tokens
  • Samples:
    positive anchor
    The Landlord shall have the right, at any time during the Term, to relocate the Premises to other premises (the "New Premises") in the Development on the same terms and conditions as are set out in this Lease provided that: (a) the Landlord shall first have given not less than 90 days notice to the Tenant; (b) the Landlord shall endeavour to ensure that the New Premises be of comparable size and quality to the Premises; (c) the Landlord shall pay the reasonable costs incurred by the Tenant for: (i) its physical move; (ii) the reconnection of existing communication lines; and (iii) the reordering of new printed material plates and the printing of an equal quantity and quality of printed material the tenant has in stock as the time of the relocation; (d) if the Rentable Area of the New Premises is not the same as the Rentable Area of the Premises, the total Basic Rent payable under this Lease (but not the Basic Rent per square foot of Rentable Area) shall be adjusted accordingly; and (e)... Right to Relocate
    39. Holdover: If Tenant shall hold over after the expiration of the Lease Term, without written agreement providing otherwise, Tenant shall be deemed to be a tenant at sufferance on month to month basis, at a monthly rental, payable in advance, equal to double the base rent then being paid by Tenant, and Tenant shall be bound by all of the other terms, covenants and agreements of the Lease. Nothing contained herein shall be construed to give Tenant the right to hold over at any time, extend the Term or prevent Landlord from immediate recovery of possession of the Premises by summary proceedings or otherwise and Landlord may exercise any and all remedies at law or in equity to recover possession of the Premises, as well as any damages incurred by Landlord, by Tenant's failure to vacate the Premises and deliver possession to Landlord as herein provided. Holding Over
    30. HOLDING OVER. If Tenant remains in possession of the Leased Premises after expiration of the Term, or after any termination of the Lease by Landlord without written agreement between the parties, Tenant shall be a tenant at sufferance and such tenancy shall be subject to the provisions hereof, except that Gross Rent for said holdover period shall be one hundred twenty five percent (125%) of the amount of Gross Rent due in the last month of the Term. Nothing in this Section 30 shall be construed as consent by Landlord to the possession of the Leased Premises by Tenant after the expiration of the Term or termination of the Lease by Landlord. In the event Tenant provides written notice to Landlord of its intent to holdover at least sixty (60) days prior to the end of the Term and Landlord does not object to such request within thirty (30) days after receipt thereof, it shall be deemed that Landlord has consented to such holdover and this Lease shall continue on a month-to-month basis ... Holding Over
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: epoch
  • per_device_train_batch_size: 32
  • per_device_eval_batch_size: 16
  • gradient_accumulation_steps: 16
  • learning_rate: 2e-05
  • num_train_epochs: 30
  • lr_scheduler_type: cosine
  • warmup_ratio: 0.1
  • tf32: False
  • load_best_model_at_end: True
  • optim: adamw_torch_fused
  • batch_sampler: no_duplicates

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: epoch
  • prediction_loss_only: True
  • per_device_train_batch_size: 32
  • per_device_eval_batch_size: 16
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 16
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 2e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 30
  • max_steps: -1
  • lr_scheduler_type: cosine
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: False
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: True
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch_fused
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: False
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • eval_use_gather_object: False
  • prompts: None
  • batch_sampler: no_duplicates
  • multi_dataset_batch_sampler: proportional

Training Logs

Epoch Step Training Loss dim_768_cosine_ndcg@10
0.6154 10 2.5422 -
1.2308 20 1.3661 -
1.8462 30 0.1879 -
2.4615 40 0.0 -
3.0769 50 0.0 -
3.3846 55 - 0.0252
1.2846 60 0.8868 -
1.9 70 1.4243 -
2.5154 80 0.1644 -
3.1308 90 0.0041 -
3.7462 100 0.0 -
4.3615 110 0.0 0.0301
2.5692 120 1.0665 -
3.1846 130 0.4817 -
3.8 140 0.0021 -
4.4154 150 0.0 -
5.0308 160 0.0 -
5.4 166 - 0.0328
3.2385 170 0.4318 -
3.8538 180 0.7595 -
4.4692 190 0.0737 -
5.0846 200 0.0004 -
5.7 210 0.0 -
6.3154 220 0.0 -
6.3769 221 - 0.0354
4.5231 230 0.736 -
5.1385 240 0.3332 -
5.7538 250 0.0008 -
6.3692 260 0.0 -
6.9846 270 0.0 -
7.3538 276 - 0.0336
5.1923 280 0.3014 -
5.8077 290 0.5931 -
6.4231 300 0.0735 -
7.0385 310 0.0002 -
7.6538 320 0.0 -
8.2692 330 0.0 -
8.3923 332 - 0.0374
6.4769 340 0.5984 -
7.0923 350 0.2797 -
7.7077 360 0.0005 -
8.3231 370 0.0 -
8.9385 380 0.0 -
9.3692 387 - 0.0355
7.1462 390 0.1997 -
7.7615 400 0.5201 -
8.3769 410 0.0799 -
8.9923 420 0.0001 -
9.6077 430 0.0 -
10.2231 440 0.0 -
10.4077 443 - 0.0362
8.4308 450 0.5072 -
9.0462 460 0.2583 -
9.6615 470 0.0005 -
10.2769 480 0.0 0.0362
  • The bold row denotes the saved checkpoint.

Framework Versions

  • Python: 3.11.11
  • Sentence Transformers: 3.3.1
  • Transformers: 4.43.1
  • PyTorch: 2.5.1+cu124
  • Accelerate: 1.2.1
  • Datasets: 2.19.1
  • Tokenizers: 0.19.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}