Netta1994 commited on
Commit
4c438ef
1 Parent(s): 0d84efe

Add SetFit model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": true,
4
+ "pooling_mode_mean_tokens": false,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,336 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: BAAI/bge-base-en-v1.5
3
+ library_name: setfit
4
+ metrics:
5
+ - accuracy
6
+ pipeline_tag: text-classification
7
+ tags:
8
+ - setfit
9
+ - sentence-transformers
10
+ - text-classification
11
+ - generated_from_setfit_trainer
12
+ widget:
13
+ - text: 'Reasoning:
14
+
15
+
16
+ **Context Grounding:**
17
+
18
+ - The answer accurately pulls information directly from the provided document,
19
+ including specific changes Haribabu Kommi is making to the storage AM. It lists
20
+ the changes in a manner that seems consistent with the details given in the document.
21
+
22
+
23
+ **Relevance:**
24
+
25
+ - The answer is directly relevant to the question, which asked specifically about
26
+ the changes Haribabu Kommi is making to the storage AM. It enumerates the exact
27
+ modifications and additions that are being incorporated based on the email content.
28
+
29
+
30
+ **Conciseness:**
31
+
32
+ - The answer is concise and to the point, listing only the changes mentioned in
33
+ the supplied email without deviating into unrelated topics or providing extraneous
34
+ information.
35
+
36
+
37
+ Final Result: Good'
38
+ - text: '**Good**
39
+
40
+
41
+ **Reasoning:**
42
+
43
+ 1. **Context Grounding:** The answer "China''s Ning Zhongyan won the gold medal
44
+ in the men''s 1,500m final at the speed skating World Cup" is well-supported by
45
+ the provided document, which explicitly states that Ning Zhongyan won the gold
46
+ medal in the men''s 1,500m final.
47
+
48
+ 2. **Relevance:** The answer directly addresses the specific question asked, identifying
49
+ the athlete who won the gold medal in the men''s 1,500m final.
50
+
51
+ 3. **Conciseness:** The answer is clear and to the point, providing only the necessary
52
+ information without any additional, unrelated details.'
53
+ - text: 'Reasoning why the answer may be good:
54
+
55
+ 1. **Context Grounding:** The details in the answer about the sizes of the individual
56
+ and combined portraits are directly pulled from the provided document.
57
+
58
+ 2. **Relevance:** The answer strictly addresses the question about the available
59
+ sizes for the individual and combined portraits without deviating into unrelated
60
+ topics.
61
+
62
+ 3. **Conciseness:** The answer is concise, directly providing the requested size
63
+ information without including extraneous details.
64
+
65
+
66
+ Reasoning why the answer may be bad:
67
+
68
+ 1. There is no discernible reason why this answer may be bad based on the provided
69
+ criteria. It is well-supported by the document, directly answers the question,
70
+ and is concise.
71
+
72
+
73
+ Final Result: **Good**'
74
+ - text: 'Reasoning why the answer may be good:
75
+
76
+ 1. **Context Grounding:** The answer accurately lists the components found in
77
+ the provided document, such as comprehension questions, writing exercises, discussion
78
+ questions, an additional reading list, semester and full-year schedules, and a
79
+ bibliography. It also includes details about the organization of the guide into
80
+ units and lessons, which is mentioned in the document.
81
+
82
+ 2. **Relevance:** The answer specifically addresses the question by identifying
83
+ the components of the British Medieval Student Guide.
84
+
85
+ 3. **Conciseness:** The answer is relatively to the point, mentioning the main
86
+ components without unnecessary elaboration.
87
+
88
+
89
+ Reasoning why the answer may be bad:
90
+
91
+ 1. **Context Grounding:** Although the details are generally correct, some parts
92
+ of the provided description are omitted, such as the note that comprehension question
93
+ answers are in the Teacher''s Guide.
94
+
95
+ 2. **Relevance:** The initial part about the introductory question "Why read great
96
+ literature?" and some other additional comments are not directly related to the
97
+ components of the Student Guide.
98
+
99
+ 3. **Conciseness:** The answer could be more concise by excluding repeated and
100
+ unrelated information, focusing only on listing the components directly.
101
+
102
+
103
+ Final Result: **Bad**
104
+
105
+
106
+ The answer introduces unnecessary elements that are not related to enumerating
107
+ the components of the guide, and it overlooks some specific details provided in
108
+ the document. Overall, the response is correct but not optimal in addressing the
109
+ specific question concisely.'
110
+ - text: '**Reasoning:**
111
+
112
+
113
+ **Why the answer may be good:**
114
+
115
+ 1. It lists three names of Members of Congress, which directly responds to the
116
+ question.
117
+
118
+
119
+ **Why the answer may be bad:**
120
+
121
+ 1. **Context Grounding:** The provided document specifically names Rep. Danny
122
+ Davis as the third Member of Congress, and the first two were Reps. Keith Ellison
123
+ and Barbara Lee, not Andy Harris, Kyle Evans, or Jessica Smith. This indicates
124
+ that the actual names provided in the answer are incorrect and not grounded in
125
+ the given context.
126
+
127
+ 2. **Relevance:** The answer is irrelevant because it provides incorrect names,
128
+ which does not address the question accurately.
129
+
130
+ 3. **Conciseness:** The answer is concise, but since it’s incorrect, its brevity
131
+ doesn''t contribute to its correctness.
132
+
133
+
134
+ **Final Result:** Bad'
135
+ inference: true
136
+ model-index:
137
+ - name: SetFit with BAAI/bge-base-en-v1.5
138
+ results:
139
+ - task:
140
+ type: text-classification
141
+ name: Text Classification
142
+ dataset:
143
+ name: Unknown
144
+ type: unknown
145
+ split: test
146
+ metrics:
147
+ - type: accuracy
148
+ value: 0.92
149
+ name: Accuracy
150
+ ---
151
+
152
+ # SetFit with BAAI/bge-base-en-v1.5
153
+
154
+ This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Text Classification. This SetFit model uses [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification.
155
+
156
+ The model has been trained using an efficient few-shot learning technique that involves:
157
+
158
+ 1. Fine-tuning a [Sentence Transformer](https://www.sbert.net) with contrastive learning.
159
+ 2. Training a classification head with features from the fine-tuned Sentence Transformer.
160
+
161
+ ## Model Details
162
+
163
+ ### Model Description
164
+ - **Model Type:** SetFit
165
+ - **Sentence Transformer body:** [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5)
166
+ - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
167
+ - **Maximum Sequence Length:** 512 tokens
168
+ - **Number of Classes:** 2 classes
169
+ <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
170
+ <!-- - **Language:** Unknown -->
171
+ <!-- - **License:** Unknown -->
172
+
173
+ ### Model Sources
174
+
175
+ - **Repository:** [SetFit on GitHub](https://github.com/huggingface/setfit)
176
+ - **Paper:** [Efficient Few-Shot Learning Without Prompts](https://arxiv.org/abs/2209.11055)
177
+ - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
178
+
179
+ ### Model Labels
180
+ | Label | Examples |
181
+ |:------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
182
+ | 0 | <ul><li>"**Reasoning:**\n\n**Why the Answer May Be Good:**\n1. **Context Grounding:** The answer references the points made in the document, such as Coach Brian Shaw's strategy of pushing the ball after makes and misses as well as encouraging players to take the first available shot within the rhythm of the offense.\n2. **Relevance:** The answer directly addresses why the Nuggets are having an offensive outburst, highlighting the coaching strategy and players' adaptation.\n3. **Conciseness:** The answer is mostly to the point and focuses on the main question.\n\n**Why the Answer May Be Bad:**\n1. **Context Grounding:** The mention of a new training technique using virtual reality is not supported by any information within the document provided.\n2. **Conciseness:** The additional detail about the virtual reality training is unnecessary given that it is not referenced in the document and does not contribute to answering the specific question about the offensive outburst.\n \n**Final Result:**\nBased on the evaluation criteria, the inclusion of fictitious or unsupported information about the virtual reality training significantly detracts from the answer’s credibility and relevance.\n\n**Bad**"</li><li>'Reasoning why the answer may be good:\n1. **Context Grounding:** The provided answer cites specific information about film and digital photography directly from the provided document, showing a good grounding.\n2. **Relevance:** The answer addresses the specific question by discussing different aspects such as exposure tolerance, color capture, and overall image resolution between film and digital photography.\n3. **Conciseness:** The answer is relatively concise and sticks to the main points relevant to the question without unnecessary elaboration.\n\nReasoning why the answer may be bad:\n1. **Overly Detailed:** The answer could be seen as too detailed in certain segments, which might slightly detract from conciseness.\n2. **Possible Confusion:** The mention of specific technical details like "5MP digital sensors" could confuse readers who are not familiar with the technical specifications, detracting from clarity.\n3. **Omission of Key Comparison Points:** The answer does not touch upon some of the more subjective observations made by the author, like the practical advantages in using film for certain types of photography.\n\nFinal Result: Good'</li><li>'Reasoning:\n1. **Context Grounding**: The answer provided does not reference the third book of the Arcana Chronicles by Kresley Cole or even discuss any content relevant to it. Instead, it discusses an MMA event in Calgary, Alberta, Canada.\n2. **Relevance**: The answer is entirely irrelevant to the question. The question is about the main conflict in the third book of a specific book series, but the answer describes an MMA fight event.\n3. **Conciseness**: While the answer is concise in its context, it is entirely off-topic and therefore does not satisfy the conciseness criterion in a meaningful way.\n\nThe answer may be deemed bad because it does not address the question about the Arcana Chronicles at all and instead provides unrelated information about an MMA event.\n\nFinal result: Bad'</li></ul> |
183
+ | 1 | <ul><li>'Reasoning:\n\n1. Context Grounding:\n - Good: The answer is supported by the document. The suggestions mentioned (getting to know the client, signing a contract, and showcasing honesty and diplomacy) are directly referenced in the text provided.\n - Bad: There is no significant bad aspect in terms of context grounding; the answer sticks closely to the source material.\n\n2. Relevance:\n - Good: The answer is highly relevant to the question about best practices to avoid unnecessary revisions and conflicts. It addresses client understanding, contractual agreements, and the handling of extra charges—all crucial for minimizing conflicts.\n - Bad: There is no deviation from the topic. The answer is focused solely on the best practices, as asked in the question.\n\n3. Conciseness:\n - Good: The answer is concise and to the point, effectively summarizing the practices without unnecessary details.\n - Bad: The level of detail might be too succinct for some readers looking for more in-depth discussion, but this is minor given the criteria.\n\nFinal Result:\nGood'</li><li>"Reasoning for why the answer may be good:\n- The answer references the author’s emphasis on drawing from personal experiences of pain and emotion to create genuine and relatable characters, which is well-supported by the document.\n- It highlights the importance of genuineness and relatability, which aligns directly with the content provided in the document.\n- The answer stays focused on the specific question about creating a connection between the reader and the characters.\n\nReasoning for why the answer may be bad:\n- The answer could be seen as slightly verbose and might include more detail than necessary, rather than being extremely concise.\n- It does not explicitly mention the document's use of pain for romance authors specifically, which might add to the context.\n\nFinal result: Good"</li><li>"**Reasoning:**\n\n**Pros:**\n1. **Context Grounding:** The document explicitly states that Mauro Rubin is the CEO of JoinPad and mentions that he was speaking at the event, which directly supports the answer.\n2. **Relevance:** The answer directly and correctly responds to the question about the CEO's identity during the event.\n3. **Conciseness:** The answer is brief and to the point, providing only the necessary information.\n\n**Cons:**\n- There are no significant cons as the answer fulfills all criteria effectively.\n\n**Final Result:** Good"</li></ul> |
184
+
185
+ ## Evaluation
186
+
187
+ ### Metrics
188
+ | Label | Accuracy |
189
+ |:--------|:---------|
190
+ | **all** | 0.92 |
191
+
192
+ ## Uses
193
+
194
+ ### Direct Use for Inference
195
+
196
+ First install the SetFit library:
197
+
198
+ ```bash
199
+ pip install setfit
200
+ ```
201
+
202
+ Then you can load this model and run inference.
203
+
204
+ ```python
205
+ from setfit import SetFitModel
206
+
207
+ # Download from the 🤗 Hub
208
+ model = SetFitModel.from_pretrained("Netta1994/setfit_baai_rag_ds_gpt-4o_improved-cot-instructions_two_reasoning_only_reasoning_1726")
209
+ # Run inference
210
+ preds = model("**Good**
211
+
212
+ **Reasoning:**
213
+ 1. **Context Grounding:** The answer \"China's Ning Zhongyan won the gold medal in the men's 1,500m final at the speed skating World Cup\" is well-supported by the provided document, which explicitly states that Ning Zhongyan won the gold medal in the men's 1,500m final.
214
+ 2. **Relevance:** The answer directly addresses the specific question asked, identifying the athlete who won the gold medal in the men's 1,500m final.
215
+ 3. **Conciseness:** The answer is clear and to the point, providing only the necessary information without any additional, unrelated details.")
216
+ ```
217
+
218
+ <!--
219
+ ### Downstream Use
220
+
221
+ *List how someone could finetune this model on their own dataset.*
222
+ -->
223
+
224
+ <!--
225
+ ### Out-of-Scope Use
226
+
227
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
228
+ -->
229
+
230
+ <!--
231
+ ## Bias, Risks and Limitations
232
+
233
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
234
+ -->
235
+
236
+ <!--
237
+ ### Recommendations
238
+
239
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
240
+ -->
241
+
242
+ ## Training Details
243
+
244
+ ### Training Set Metrics
245
+ | Training set | Min | Median | Max |
246
+ |:-------------|:----|:---------|:----|
247
+ | Word count | 52 | 125.5070 | 199 |
248
+
249
+ | Label | Training Sample Count |
250
+ |:------|:----------------------|
251
+ | 0 | 34 |
252
+ | 1 | 37 |
253
+
254
+ ### Training Hyperparameters
255
+ - batch_size: (16, 16)
256
+ - num_epochs: (5, 5)
257
+ - max_steps: -1
258
+ - sampling_strategy: oversampling
259
+ - num_iterations: 20
260
+ - body_learning_rate: (2e-05, 2e-05)
261
+ - head_learning_rate: 2e-05
262
+ - loss: CosineSimilarityLoss
263
+ - distance_metric: cosine_distance
264
+ - margin: 0.25
265
+ - end_to_end: False
266
+ - use_amp: False
267
+ - warmup_proportion: 0.1
268
+ - l2_weight: 0.01
269
+ - seed: 42
270
+ - eval_max_steps: -1
271
+ - load_best_model_at_end: False
272
+
273
+ ### Training Results
274
+ | Epoch | Step | Training Loss | Validation Loss |
275
+ |:------:|:----:|:-------------:|:---------------:|
276
+ | 0.0056 | 1 | 0.2031 | - |
277
+ | 0.2809 | 50 | 0.2589 | - |
278
+ | 0.5618 | 100 | 0.2125 | - |
279
+ | 0.8427 | 150 | 0.0079 | - |
280
+ | 1.1236 | 200 | 0.0022 | - |
281
+ | 1.4045 | 250 | 0.0017 | - |
282
+ | 1.6854 | 300 | 0.0017 | - |
283
+ | 1.9663 | 350 | 0.0014 | - |
284
+ | 2.2472 | 400 | 0.0014 | - |
285
+ | 2.5281 | 450 | 0.0012 | - |
286
+ | 2.8090 | 500 | 0.0012 | - |
287
+ | 3.0899 | 550 | 0.0012 | - |
288
+ | 3.3708 | 600 | 0.0012 | - |
289
+ | 3.6517 | 650 | 0.0011 | - |
290
+ | 3.9326 | 700 | 0.0011 | - |
291
+ | 4.2135 | 750 | 0.0011 | - |
292
+ | 4.4944 | 800 | 0.0011 | - |
293
+ | 4.7753 | 850 | 0.001 | - |
294
+
295
+ ### Framework Versions
296
+ - Python: 3.10.14
297
+ - SetFit: 1.1.0
298
+ - Sentence Transformers: 3.1.0
299
+ - Transformers: 4.44.0
300
+ - PyTorch: 2.4.1+cu121
301
+ - Datasets: 2.19.2
302
+ - Tokenizers: 0.19.1
303
+
304
+ ## Citation
305
+
306
+ ### BibTeX
307
+ ```bibtex
308
+ @article{https://doi.org/10.48550/arxiv.2209.11055,
309
+ doi = {10.48550/ARXIV.2209.11055},
310
+ url = {https://arxiv.org/abs/2209.11055},
311
+ author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
312
+ keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
313
+ title = {Efficient Few-Shot Learning Without Prompts},
314
+ publisher = {arXiv},
315
+ year = {2022},
316
+ copyright = {Creative Commons Attribution 4.0 International}
317
+ }
318
+ ```
319
+
320
+ <!--
321
+ ## Glossary
322
+
323
+ *Clearly define terms in order to be accessible across audiences.*
324
+ -->
325
+
326
+ <!--
327
+ ## Model Card Authors
328
+
329
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
330
+ -->
331
+
332
+ <!--
333
+ ## Model Card Contact
334
+
335
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
336
+ -->
config.json ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "BAAI/bge-base-en-v1.5",
3
+ "architectures": [
4
+ "BertModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "classifier_dropout": null,
8
+ "gradient_checkpointing": false,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 768,
12
+ "id2label": {
13
+ "0": "LABEL_0"
14
+ },
15
+ "initializer_range": 0.02,
16
+ "intermediate_size": 3072,
17
+ "label2id": {
18
+ "LABEL_0": 0
19
+ },
20
+ "layer_norm_eps": 1e-12,
21
+ "max_position_embeddings": 512,
22
+ "model_type": "bert",
23
+ "num_attention_heads": 12,
24
+ "num_hidden_layers": 12,
25
+ "pad_token_id": 0,
26
+ "position_embedding_type": "absolute",
27
+ "torch_dtype": "float32",
28
+ "transformers_version": "4.44.0",
29
+ "type_vocab_size": 2,
30
+ "use_cache": true,
31
+ "vocab_size": 30522
32
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.1.0",
4
+ "transformers": "4.44.0",
5
+ "pytorch": "2.4.1+cu121"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": null
10
+ }
config_setfit.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "labels": null,
3
+ "normalize_embeddings": false
4
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4a01f209a6691c954758cfdaa9634c3955f3d8cfaeac23c57049843c521a057
3
+ size 437951328
model_head.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:908524f461bb509f792dd1eb46884571db2ad2c8278472596ddca545b967237a
3
+ size 7007
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": true
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,57 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": true,
45
+ "cls_token": "[CLS]",
46
+ "do_basic_tokenize": true,
47
+ "do_lower_case": true,
48
+ "mask_token": "[MASK]",
49
+ "model_max_length": 512,
50
+ "never_split": null,
51
+ "pad_token": "[PAD]",
52
+ "sep_token": "[SEP]",
53
+ "strip_accents": null,
54
+ "tokenize_chinese_chars": true,
55
+ "tokenizer_class": "BertTokenizer",
56
+ "unk_token": "[UNK]"
57
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff