metadata
base_model: BAAI/bge-base-en-v1.5
library_name: setfit
metrics:
- accuracy
pipeline_tag: text-classification
tags:
- setfit
- sentence-transformers
- text-classification
- generated_from_setfit_trainer
widget:
- text: >-
Reasoning why the answer may be good:
1. **Context Grounding:** The answer correctly interprets and references
the specific part of the provided document that mentions the response
status column.
2. **Relevance:** The answer directly addresses the question by explaining
what the percentage in the response status column indicates.
3. **Conciseness:** The answer is concise and avoids unnecessary
information, focusing solely on the meaning of the percentage in the
response status column.
4. **Specificity:** The answer gives a detailed explanation about what the
percentage represents—successful completion of response actions.
5. **Accuracy:** The key concept of the percentage indicating the total
amount of successful completion of response actions is correctly conveyed
as per the document.
Reasoning why the answer may be bad:
There is no apparent reason for the answer to be bad; it aligns well with
the document and addresses the question directly and concisely.
Final result: Good
- text: >-
Reasoning:
**Why the answer may be good:**
- It accurately states that the provided information does not address the
specific question.
- It directs the reader to seek additional information or context.
**Why the answer may be bad:**
- It does not attempt to relate or infer an answer based on the document
provided.
- The document does provide relevant details about endpoint controls,
including that they involve Device Control, Personal Firewall Control, and
Full Disk Encryption Visibility, which can imply their purpose.
- The response is somewhat evasive and does not leverage any context
offered by the document to give an informed answer.
Final Result: Bad
- text: >-
Reasoning why the answer may be good:
- The provided answer is addressing the purpose of an agent (collecting
and securely forwarding logs), aligning with the context of log collection
and forwarding described in the document.
Reasoning why the answer may be bad:
- The answer is missing specificity. It does not mention on-site
collection nor does it specify the direct forwarding feature mentioned in
the document.
- It lacks details specified in the document about the agent being used
for integrations that do not use cloud feeds, e.g., firewalls.
- The answer does not mention <ORGANIZATION> explicitly ties into the
detection and correlation engine.
Final result: Bad
- text: >-
### Reasoning
**Good Aspects of the Answer:**
1. **Context Grounding:** The answer correctly pulls context from the
document regarding email notifications.
2. **Relevance:** The answer attempts to address the purpose of the email
notifications checkbox directly.
**Bad Aspects of the Answer:**
1. **Key/Value/Event Name Accuracy:** The document outlines a checkbox
related to enabling or disabling email notifications, but the answer uses
placeholder text "ORGANIZATION_2," which needs replacement for accuracy.
2. **Conciseness:** The answer is somewhat repetitive with the phrasing
"ORGANIZATION_2 or disable," which should clearly state "enable or
disable."
3. **Specificity:** The answer lacks the specific detail from the document
that this is specific to stale or archived sensors and involves System
Admins.
### Final Evaluation
**Final Result:** `Bad`
The placeholder text makes the information ambiguous and not actionable.
Additionally, there is a slight redundancy and missing specific roles
(System Admin) and context (stale/archived sensors) necessary for
precision.
- text: >-
Reasoning why the answer may be good:
- The answer provides a specific URL, which is required by the question.
- It appears to be in the format expected for image URLs as hinted at in
the document.
Reasoning why the answer may be bad:
- The provided answer does not match the precise URL given in the
document.
- The correct URL for the second query should be
`..\/..\/_images\/hunting_http://miller.co`, while the answer contains
`hunting_http://www.flores.net/`, which is not mentioned in the document.
- The answer does not reflect careful cross-referencing with the provided
document.
Final result: Bad
inference: true
model-index:
- name: SetFit with BAAI/bge-base-en-v1.5
results:
- task:
type: text-classification
name: Text Classification
dataset:
name: Unknown
type: unknown
split: test
metrics:
- type: accuracy
value: 0.4788732394366197
name: Accuracy
SetFit with BAAI/bge-base-en-v1.5
This is a SetFit model that can be used for Text Classification. This SetFit model uses BAAI/bge-base-en-v1.5 as the Sentence Transformer embedding model. A LogisticRegression instance is used for classification.
The model has been trained using an efficient few-shot learning technique that involves:
- Fine-tuning a Sentence Transformer with contrastive learning.
- Training a classification head with features from the fine-tuned Sentence Transformer.
Model Details
Model Description
- Model Type: SetFit
- Sentence Transformer body: BAAI/bge-base-en-v1.5
- Classification head: a LogisticRegression instance
- Maximum Sequence Length: 512 tokens
- Number of Classes: 2 classes
Model Sources
- Repository: SetFit on GitHub
- Paper: Efficient Few-Shot Learning Without Prompts
- Blogpost: SetFit: Efficient Few-Shot Learning Without Prompts
Model Labels
Label | Examples |
---|---|
0 |
|
1 |
|
Evaluation
Metrics
Label | Accuracy |
---|---|
all | 0.4789 |
Uses
Direct Use for Inference
First install the SetFit library:
pip install setfit
Then you can load this model and run inference.
from setfit import SetFitModel
# Download from the 🤗 Hub
model = SetFitModel.from_pretrained("Netta1994/setfit_baai_cybereason_gpt-4o_improved-cot-instructions_two_reasoning_only_reasoning_")
# Run inference
preds = model("Reasoning why the answer may be good:
- The answer provides a specific URL, which is required by the question.
- It appears to be in the format expected for image URLs as hinted at in the document.
Reasoning why the answer may be bad:
- The provided answer does not match the precise URL given in the document.
- The correct URL for the second query should be `..\/..\/_images\/hunting_http://miller.co`, while the answer contains `hunting_http://www.flores.net/`, which is not mentioned in the document.
- The answer does not reflect careful cross-referencing with the provided document.
Final result: Bad")
Training Details
Training Set Metrics
Training set | Min | Median | Max |
---|---|---|---|
Word count | 60 | 128.2029 | 239 |
Label | Training Sample Count |
---|---|
0 | 34 |
1 | 35 |
Training Hyperparameters
- batch_size: (16, 16)
- num_epochs: (5, 5)
- max_steps: -1
- sampling_strategy: oversampling
- num_iterations: 20
- body_learning_rate: (2e-05, 2e-05)
- head_learning_rate: 2e-05
- loss: CosineSimilarityLoss
- distance_metric: cosine_distance
- margin: 0.25
- end_to_end: False
- use_amp: False
- warmup_proportion: 0.1
- l2_weight: 0.01
- seed: 42
- eval_max_steps: -1
- load_best_model_at_end: False
Training Results
Epoch | Step | Training Loss | Validation Loss |
---|---|---|---|
0.0058 | 1 | 0.2486 | - |
0.2890 | 50 | 0.2626 | - |
0.5780 | 100 | 0.2394 | - |
0.8671 | 150 | 0.1005 | - |
1.1561 | 200 | 0.0028 | - |
1.4451 | 250 | 0.002 | - |
1.7341 | 300 | 0.0018 | - |
2.0231 | 350 | 0.0016 | - |
2.3121 | 400 | 0.0016 | - |
2.6012 | 450 | 0.0014 | - |
2.8902 | 500 | 0.0013 | - |
3.1792 | 550 | 0.0012 | - |
3.4682 | 600 | 0.0012 | - |
3.7572 | 650 | 0.0012 | - |
4.0462 | 700 | 0.0012 | - |
4.3353 | 750 | 0.0012 | - |
4.6243 | 800 | 0.0011 | - |
4.9133 | 850 | 0.0011 | - |
Framework Versions
- Python: 3.10.14
- SetFit: 1.1.0
- Sentence Transformers: 3.1.0
- Transformers: 4.44.0
- PyTorch: 2.4.1+cu121
- Datasets: 2.19.2
- Tokenizers: 0.19.1
Citation
BibTeX
@article{https://doi.org/10.48550/arxiv.2209.11055,
doi = {10.48550/ARXIV.2209.11055},
url = {https://arxiv.org/abs/2209.11055},
author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
title = {Efficient Few-Shot Learning Without Prompts},
publisher = {arXiv},
year = {2022},
copyright = {Creative Commons Attribution 4.0 International}
}