northstaranlyticsma24 commited on
Commit
0a82693
1 Parent(s): 87981a0

Add new SentenceTransformer model.

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": true,
4
+ "pooling_mode_mean_tokens": false,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,833 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Snowflake/snowflake-arctic-embed-m
3
+ library_name: sentence-transformers
4
+ metrics:
5
+ - cosine_accuracy@1
6
+ - cosine_accuracy@3
7
+ - cosine_accuracy@5
8
+ - cosine_accuracy@10
9
+ - cosine_precision@1
10
+ - cosine_precision@3
11
+ - cosine_precision@5
12
+ - cosine_precision@10
13
+ - cosine_recall@1
14
+ - cosine_recall@3
15
+ - cosine_recall@5
16
+ - cosine_recall@10
17
+ - cosine_ndcg@10
18
+ - cosine_mrr@10
19
+ - cosine_map@100
20
+ - dot_accuracy@1
21
+ - dot_accuracy@3
22
+ - dot_accuracy@5
23
+ - dot_accuracy@10
24
+ - dot_precision@1
25
+ - dot_precision@3
26
+ - dot_precision@5
27
+ - dot_precision@10
28
+ - dot_recall@1
29
+ - dot_recall@3
30
+ - dot_recall@5
31
+ - dot_recall@10
32
+ - dot_ndcg@10
33
+ - dot_mrr@10
34
+ - dot_map@100
35
+ pipeline_tag: sentence-similarity
36
+ tags:
37
+ - sentence-transformers
38
+ - sentence-similarity
39
+ - feature-extraction
40
+ - generated_from_trainer
41
+ - dataset_size:363
42
+ - loss:MatryoshkaLoss
43
+ - loss:MultipleNegativesRankingLoss
44
+ widget:
45
+ - source_sentence: What are some examples of algorithmic discrimination mentioned
46
+ in the context, and how do they impact different areas such as hiring and healthcare?
47
+ sentences:
48
+ - "For example, facial recognition technology that can contribute to wrongful and\
49
+ \ discriminatory \narrests,31 hiring algorithms that inform discriminatory decisions,\
50
+ \ and healthcare algorithms that discount \nthe severity of certain diseases in\
51
+ \ Black Americans. Instances of discriminatory practices built into and \nresulting\
52
+ \ from AI and other automated systems exist across many industries, areas, and\
53
+ \ contexts. While automated \nsystems have the capacity to drive extraordinary\
54
+ \ advances and innovations, algorithmic discrimination \nprotections should be\
55
+ \ built into their design, deployment, and ongoing use. Many companies, non-profits,\
56
+ \ and federal government agencies are already taking steps to ensure the public\
57
+ \ \nis protected from algorithmic discrimination. Some companies have instituted\
58
+ \ bias testing as part of their product \nquality assessment and launch procedures,\
59
+ \ and in some cases this testing has led products to be changed or not \nlaunched,\
60
+ \ preventing harm to the public. Federal government agencies have been developing\
61
+ \ standards and guidance \nfor the use of automated systems in order to help prevent\
62
+ \ bias. Non-profits and companies have developed best \npractices for audits and\
63
+ \ impact assessments to help identify potential algorithmic discrimination and\
64
+ \ provide \ntransparency to the public in the mitigation of such biases. But there\
65
+ \ is much more work to do to protect the public from algorithmic discrimination\
66
+ \ to use and design \nautomated systems in an equitable way. The guardrails protecting\
67
+ \ the public from discrimination in their daily \nlives should include their digital\
68
+ \ lives and impacts—basic safeguards against abuse, bias, and discrimination to\
69
+ \ \nensure that all people are treated fairly when automated systems are used.\
70
+ \ This includes all dimensions of their \nlives, from hiring to loan approvals,\
71
+ \ from medical treatment and payment to encounters with the criminal \njustice\
72
+ \ system. Ensuring equity should also go beyond existing guardrails to consider\
73
+ \ the holistic impact that \nautomated systems make on underserved communities\
74
+ \ and to institute proactive protections that support these \ncommunities. •\n\
75
+ An automated system using nontraditional factors such as educational attainment\
76
+ \ and employment history as\npart of its loan underwriting and pricing model was\
77
+ \ found to be much more likely to charge an applicant who\nattended a Historically\
78
+ \ Black College or University (HBCU) higher loan prices for refinancing a student\
79
+ \ loan\nthan an applicant who did not attend an HBCU. This was found to be true\
80
+ \ even when controlling for\nother credit-related factors.32\n•\nA hiring tool\
81
+ \ that learned the features of a company's employees (predominantly men) rejected\
82
+ \ women appli­\ncants for spurious and discriminatory reasons; resumes with the\
83
+ \ word “women’s,” such as “women’s\nchess club captain,” were penalized in the\
84
+ \ candidate ranking.33\n•\nA predictive model marketed as being able to predict\
85
+ \ whether students are likely to drop out of school was\nused by more than 500\
86
+ \ universities across the country. The model was found to use race directly as\
87
+ \ a predictor,\nand also shown to have large disparities by race; Black students\
88
+ \ were as many as four times as likely as their\notherwise similar white peers\
89
+ \ to be deemed at high risk of dropping out. These risk scores are used by advisors\
90
+ \ \nto guide students towards or away from majors, and some worry that they are\
91
+ \ being used to guide\nBlack students away from math and science subjects.34\n\
92
+ •\nA risk assessment tool designed to predict the risk of recidivism for individuals\
93
+ \ in federal custody showed\nevidence of disparity in prediction. The tool overpredicts\
94
+ \ the risk of recidivism for some groups of color on the\ngeneral recidivism tools,\
95
+ \ and underpredicts the risk of recidivism for some groups of color on some of\
96
+ \ the\nviolent recidivism tools. The Department of Justice is working to reduce\
97
+ \ these disparities and has\npublicly released a report detailing its review of\
98
+ \ the tool.35 \n24\n"
99
+ - "SECTION: APPENDIX: EXAMPLES OF AUTOMATED SYSTEMS\nAPPENDIX\nSystems that impact\
100
+ \ the safety of communities such as automated traffic control systems, elec \n\
101
+ -ctrical grid controls, smart city technologies, and industrial emissions and\
102
+ \ environmental\nimpact control algorithms; and\nSystems related to access to\
103
+ \ benefits or services or assignment of penalties such as systems that\nsupport\
104
+ \ decision-makers who adjudicate benefits such as collating or analyzing information\
105
+ \ or\nmatching records, systems which similarly assist in the adjudication of\
106
+ \ administrative or criminal\npenalties, fraud detection algorithms, services\
107
+ \ or benefits access control algorithms, biometric\nsystems used as access control,\
108
+ \ and systems which make benefits or services related decisions on a\nfully or\
109
+ \ partially autonomous basis (such as a determination to revoke benefits). 54\n"
110
+ - "SECTION: SAFE AND EFFECTIVE SYSTEMS\n \n \n \n \n \n \n \nSAFE AND EFFECTIVE\
111
+ \ \nSYSTEMS \nWHAT SHOULD BE EXPECTED OF AUTOMATED SYSTEMS\nThe expectations for\
112
+ \ automated systems are meant to serve as a blueprint for the development of additional\
113
+ \ \ntechnical standards and practices that are tailored for particular sectors\
114
+ \ and contexts. In order to ensure that an automated system is safe and effective,\
115
+ \ it should include safeguards to protect the \npublic from harm in a proactive\
116
+ \ and ongoing manner; avoid use of data inappropriate for or irrelevant to the\
117
+ \ task \nat hand, including reuse that could cause compounded harm; and demonstrate\
118
+ \ the safety and effectiveness of \nthe system. These expectations are explained\
119
+ \ below. Protect the public from harm in a proactive and ongoing manner \nConsultation.\
120
+ \ The public should be consulted in the design, implementation, deployment, acquisition,\
121
+ \ and \nmaintenance phases of automated system development, with emphasis on early-stage\
122
+ \ consultation before a \nsystem is introduced or a large change implemented.\
123
+ \ This consultation should directly engage diverse impact­\ned communities to\
124
+ \ consider concerns and risks that may be unique to those communities, or disproportionate­\n\
125
+ ly prevalent or severe for them. The extent of this engagement and the form of\
126
+ \ outreach to relevant stakehold­\ners may differ depending on the specific automated\
127
+ \ system and development phase, but should include \nsubject matter, sector-specific,\
128
+ \ and context-specific experts as well as experts on potential impacts such as\
129
+ \ \ncivil rights, civil liberties, and privacy experts. For private sector applications,\
130
+ \ consultations before product \nlaunch may need to be confidential. Government\
131
+ \ applications, particularly law enforcement applications or \napplications that\
132
+ \ raise national security considerations, may require confidential or limited\
133
+ \ engagement based \non system sensitivities and preexisting oversight laws and\
134
+ \ structures. Concerns raised in this consultation \nshould be documented, and\
135
+ \ the automated system developers were proposing to create, use, or deploy should\
136
+ \ \nbe reconsidered based on this feedback."
137
+ - source_sentence: What are some key needs identified by panelists for the future
138
+ design of critical AI systems?
139
+ sentences:
140
+ - "It included discussion of the \ntechnical aspects \nof \ndesigning \nnon-discriminatory\
141
+ \ \ntechnology, \nexplainable \nAI, \nhuman-computer \ninteraction with an emphasis\
142
+ \ on community participation, and privacy-aware design. Welcome:\n•\nSorelle Friedler,\
143
+ \ Assistant Director for Data and Democracy, White House Office of Science and\n\
144
+ Technology Policy\n•\nJ. Bob Alotta, Vice President for Global Programs, Mozilla\
145
+ \ Foundation\n•\nNavrina Singh, Board Member, Mozilla Foundation\nModerator: Kathy\
146
+ \ Pham Evans, Deputy Chief Technology Officer for Product and Engineering, U.S\
147
+ \ \nFederal Trade Commission. Panelists: \n•\nLiz O’Sullivan, CEO, Parity AI\n\
148
+ •\nTimnit Gebru, Independent Scholar\n•\nJennifer Wortman Vaughan, Senior Principal\
149
+ \ Researcher, Microsoft Research, New York City\n•\nPamela Wisniewski, Associate\
150
+ \ Professor of Computer Science, University of Central Florida; Director,\nSocio-technical\
151
+ \ Interaction Research (STIR) Lab\n•\nSeny Kamara, Associate Professor of Computer\
152
+ \ Science, Brown University\nEach panelist individually emphasized the risks of\
153
+ \ using AI in high-stakes settings, including the potential for \nbiased data\
154
+ \ and discriminatory outcomes, opaque decision-making processes, and lack of public\
155
+ \ trust and \nunderstanding of the algorithmic systems. The interventions and\
156
+ \ key needs various panelists put forward as \nnecessary to the future design\
157
+ \ of critical AI systems included ongoing transparency, value sensitive and \n\
158
+ participatory design, explanations designed for relevant stakeholders, and public\
159
+ \ consultation. Various \npanelists emphasized the importance of placing trust\
160
+ \ in people, not technologies, and in engaging with \nimpacted communities to\
161
+ \ understand the potential harms of technologies and build protection by design\
162
+ \ into \nfuture systems. Panel 5: Social Welfare and Development. This event explored\
163
+ \ current and emerging uses of technology to \nimplement or improve social welfare\
164
+ \ systems, social development programs, and other systems that can impact \nlife\
165
+ \ chances. Welcome:\n•\nSuresh Venkatasubramanian, Assistant Director for Science\
166
+ \ and Justice, White House Office of Science\nand Technology Policy\n•\nAnne-Marie\
167
+ \ Slaughter, CEO, New America\nModerator: Michele Evermore, Deputy Director for\
168
+ \ Policy, Office of Unemployment Insurance \nModernization, Office of the Secretary,\
169
+ \ Department of Labor \nPanelists:\n•\nBlake Hall, CEO and Founder, ID.Me\n•\n\
170
+ Karrie Karahalios, Professor of Computer Science, University of Illinois, Urbana-Champaign\n\
171
+ •\nChristiaan van Veen, Director of Digital Welfare State and Human Rights Project,\
172
+ \ NYU School of Law's\nCenter for Human Rights and Global Justice\n58\n"
173
+ - '20, 2021. https://www.vice.com/en/article/88npjv/amazons-ai-cameras-are-punishing­
174
+
175
+ drivers-for-mistakes-they-didnt-make
176
+
177
+ 63
178
+
179
+ '
180
+ - 'Jan. 11, 2022. https://themarkup.org/machine-learning/2022/01/11/this-private-equity-firm-is-amassing-companies­
181
+
182
+ that-collect-data-on-americas-children
183
+
184
+ 77. Reed Albergotti. Every employee who leaves Apple becomes an ‘associate’: In
185
+ job databases used by
186
+
187
+ employers to verify resume information, every former Apple employee’s title gets
188
+ erased and replaced with
189
+
190
+ a generic title. The Washington Post.'
191
+ - source_sentence: How do automated identity controls at airports ensure assistance
192
+ for individuals facing misidentification?
193
+ sentences:
194
+ - "SECTION: ALGORITHMIC DISCRIMINATION PROTECTIONS\n ­­­­­­­\nALGORITHMIC DISCRIMINATION\
195
+ \ Protections\nYou should not face discrimination by algorithms \nand systems\
196
+ \ should be used and designed in an \nequitable \nway. Algorithmic \ndiscrimination\
197
+ \ \noccurs when \nautomated systems contribute to unjustified different treatment\
198
+ \ or \nimpacts disfavoring people based on their race, color, ethnicity, \nsex\
199
+ \ \n(including \npregnancy, \nchildbirth, \nand \nrelated \nmedical \nconditions,\
200
+ \ \ngender \nidentity, \nintersex \nstatus, \nand \nsexual \norientation), religion,\
201
+ \ age, national origin, disability, veteran status, \ngenetic infor-mation, or\
202
+ \ any other classification protected by law. Depending on the specific circumstances,\
203
+ \ such algorithmic \ndiscrimination may violate legal protections. Designers,\
204
+ \ developers, \nand deployers of automated systems should take proactive and \n\
205
+ continuous measures to protect individuals and communities \nfrom algorithmic\
206
+ \ discrimination and to use and design systems in \nan equitable way. This protection\
207
+ \ should include proactive equity \nassessments as part of the system design,\
208
+ \ use of representative data \nand protection against proxies for demographic\
209
+ \ features, ensuring \naccessibility for people with disabilities in design and\
210
+ \ development, \npre-deployment and ongoing disparity testing and mitigation,\
211
+ \ and \nclear organizational oversight. Independent evaluation and plain \nlanguage\
212
+ \ reporting in the form of an algorithmic impact assessment, \nincluding disparity\
213
+ \ testing results and mitigation information, \nshould be performed and made public\
214
+ \ whenever possible to confirm \nthese protections."
215
+ - "These critical protections have been adopted in some scenarios. Where automated\
216
+ \ systems have been introduced to \nprovide the public access to government benefits,\
217
+ \ existing human paper and phone-based processes are generally still \nin place,\
218
+ \ providing an important alternative to ensure access. Companies that have introduced\
219
+ \ automated call centers \noften retain the option of dialing zero to reach an\
220
+ \ operator. When automated identity controls are in place to board an \nairplane\
221
+ \ or enter the country, there is a person supervising the systems who can be turned\
222
+ \ to for help or to appeal a \nmisidentification. The American people deserve\
223
+ \ the reassurance that such procedures are in place to protect their rights, opportunities,\
224
+ \ \nand access."
225
+ - "SECTION: APPENDIX: EXAMPLES OF AUTOMATED SYSTEMS\nAPPENDIX\nSystems that impact\
226
+ \ the safety of communities such as automated traffic control systems, elec \n\
227
+ -ctrical grid controls, smart city technologies, and industrial emissions and\
228
+ \ environmental\nimpact control algorithms; and\nSystems related to access to\
229
+ \ benefits or services or assignment of penalties such as systems that\nsupport\
230
+ \ decision-makers who adjudicate benefits such as collating or analyzing information\
231
+ \ or\nmatching records, systems which similarly assist in the adjudication of\
232
+ \ administrative or criminal\npenalties, fraud detection algorithms, services\
233
+ \ or benefits access control algorithms, biometric\nsystems used as access control,\
234
+ \ and systems which make benefits or services related decisions on a\nfully or\
235
+ \ partially autonomous basis (such as a determination to revoke benefits). 54\n"
236
+ - source_sentence: How should the availability of human consideration and fallback
237
+ mechanisms be determined in relation to the potential impact of automated systems
238
+ on rights, opportunities, or access?
239
+ sentences:
240
+ - "In many scenarios, there is a reasonable expectation \nof human involvement in\
241
+ \ attaining rights, opportunities, or access. When automated systems make up part\
242
+ \ of \nthe attainment process, alternative timely human-driven processes should\
243
+ \ be provided. The use of a human \nalternative should be triggered by an opt-out\
244
+ \ process. Timely and not burdensome human alternative. Opting out should be timely\
245
+ \ and not unreasonably \nburdensome in both the process of requesting to opt-out\
246
+ \ and the human-driven alternative provided. Provide timely human consideration\
247
+ \ and remedy by a fallback and escalation system in the \nevent that an automated\
248
+ \ system fails, produces error, or you would like to appeal or con­\ntest its\
249
+ \ impacts on you \nProportionate. The availability of human consideration and\
250
+ \ fallback, along with associated training and \nsafeguards against human bias,\
251
+ \ should be proportionate to the potential of the automated system to meaning­\n\
252
+ fully impact rights, opportunities, or access. Automated systems that have greater\
253
+ \ control over outcomes, \nprovide input to high-stakes decisions, relate to sensitive\
254
+ \ domains, or otherwise have a greater potential to \nmeaningfully impact rights,\
255
+ \ opportunities, or access should have greater availability (e.g., staffing) and\
256
+ \ over­\nsight of human consideration and fallback mechanisms. Accessible. Mechanisms\
257
+ \ for human consideration and fallback, whether in-person, on paper, by phone,\
258
+ \ or \notherwise provided, should be easy to find and use. These mechanisms should\
259
+ \ be tested to ensure that users \nwho have trouble with the automated system\
260
+ \ are able to use human consideration and fallback, with the under­\nstanding\
261
+ \ that it may be these users who are most likely to need the human assistance.\
262
+ \ Similarly, it should be \ntested to ensure that users with disabilities are\
263
+ \ able to find and use human consideration and fallback and also \nrequest reasonable\
264
+ \ accommodations or modifications. Convenient. Mechanisms for human consideration\
265
+ \ and fallback should not be unreasonably burdensome as \ncompared to the automated\
266
+ \ system’s equivalent. 49\n"
267
+ - "SECTION: DATA PRIVACY\n \n \n \n \n \n \nDATA PRIVACY \nWHAT SHOULD BE EXPECTED\
268
+ \ OF AUTOMATED SYSTEMS\nThe expectations for automated systems are meant to serve\
269
+ \ as a blueprint for the development of additional \ntechnical standards and practices\
270
+ \ that are tailored for particular sectors and contexts. Data access and correction.\
271
+ \ People whose data is collected, used, shared, or stored by automated \nsystems\
272
+ \ should be able to access data and metadata about themselves, know who has access\
273
+ \ to this data, and \nbe able to correct it if necessary. Entities should receive\
274
+ \ consent before sharing data with other entities and \nshould keep records of\
275
+ \ what data is shared and with whom. Consent withdrawal and data deletion. Entities\
276
+ \ should allow (to the extent legally permissible) with­\ndrawal of data access\
277
+ \ consent, resulting in the deletion of user data, metadata, and the timely removal\
278
+ \ of \ntheir data from any systems (e.g., machine learning models) derived from\
279
+ \ that data.68\nAutomated system support. Entities designing, developing, and\
280
+ \ deploying automated systems should \nestablish and maintain the capabilities\
281
+ \ that will allow individuals to use their own automated systems to help \nthem\
282
+ \ make consent, access, and control decisions in a complex data ecosystem. Capabilities\
283
+ \ include machine \nreadable data, standardized data formats, metadata or tags\
284
+ \ for expressing data processing permissions and \npreferences and data provenance\
285
+ \ and lineage, context of use and access-specific tags, and training models for\
286
+ \ \nassessing privacy risk. Demonstrate that data privacy and user control are\
287
+ \ protected \nIndependent evaluation. As described in the section on Safe and\
288
+ \ Effective Systems, entities should allow \nindependent evaluation of the claims\
289
+ \ made regarding data policies. These independent evaluations should be \nmade\
290
+ \ public whenever possible. Care will need to be taken to balance individual privacy\
291
+ \ with evaluation data \naccess needs."
292
+ - "SECTION: NOTICE AND EXPLANATION\n \n \n \n \n \nNOTICE & \nEXPLANATION \nWHY\
293
+ \ THIS PRINCIPLE IS IMPORTANT\nThis section provides a brief summary of the problems\
294
+ \ which the principle seeks to address and protect \nagainst, including illustrative\
295
+ \ examples. •\nA predictive policing system claimed to identify individuals at\
296
+ \ greatest risk to commit or become the victim of\ngun violence (based on automated\
297
+ \ analysis of social ties to gang members, criminal histories, previous experi­\n\
298
+ ences of gun violence, and other factors) and led to individuals being placed\
299
+ \ on a watch list with no\nexplanation or public transparency regarding how the\
300
+ \ system came to its conclusions.85 Both police and\nthe public deserve to understand\
301
+ \ why and how such a system is making these determinations. •\nA system awarding\
302
+ \ benefits changed its criteria invisibly."
303
+ - source_sentence: What topics were discussed during the meetings related to the development
304
+ of the Blueprint for an AI Bill of Rights?
305
+ sentences:
306
+ - " \nGAI systems can produce content that is inciting, radicalizing, or threatening,\
307
+ \ or that glorifies violence, \nwith greater ease and scale than other technologies.\
308
+ \ LLMs have been reported to generate dangerous or \nviolent recommendations,\
309
+ \ and some models have generated actionable instructions for dangerous or \n \n\
310
+ \ \n9 Confabulations of falsehoods are most commonly a problem for text-based\
311
+ \ outputs; for audio, image, or video \ncontent, creative generation of non-factual\
312
+ \ content can be a desired behavior. 10 For example, legal confabulations have\
313
+ \ been shown to be pervasive in current state-of-the-art LLMs. See also, \ne.g.,\
314
+ \ \n \n7 \nunethical behavior."
315
+ - 'SECTION: LISTENING TO THE AMERICAN PEOPLE
316
+
317
+ APPENDIX
318
+
319
+ • OSTP conducted meetings with a variety of stakeholders in the private sector
320
+ and civil society. Some of these
321
+
322
+ meetings were specifically focused on providing ideas related to the development
323
+ of the Blueprint for an AI
324
+
325
+ Bill of Rights while others provided useful general context on the positive use
326
+ cases, potential harms, and/or
327
+
328
+ oversight possibilities for these technologies.'
329
+ - "Transgender travelers have described degrading experiences associated\nwith these\
330
+ \ extra screenings.43 TSA has recently announced plans to implement a gender-neutral\
331
+ \ algorithm44 \nwhile simultaneously enhancing the security effectiveness capabilities\
332
+ \ of the existing technology. •\nThe National Disabled Law Students Association\
333
+ \ expressed concerns that individuals with disabilities were\nmore likely to be\
334
+ \ flagged as potentially suspicious by remote proctoring AI systems because of\
335
+ \ their disabili-\nty-specific access needs such as needing longer breaks or using\
336
+ \ screen readers or dictation software.45 \n•\nAn algorithm designed to identify\
337
+ \ patients with high needs for healthcare systematically assigned lower\nscores\
338
+ \ (indicating that they were not as high need) to Black patients than to those\
339
+ \ of white patients, even\nwhen those patients had similar numbers of chronic\
340
+ \ conditions and other markers of health.46 In addition,\nhealthcare clinical\
341
+ \ algorithms that are used by physicians to guide clinical decisions may include\n\
342
+ sociodemographic variables that adjust or “correct” the algorithm’s output on\
343
+ \ the basis of a patient’s race or\nethnicity, which can lead to race-based health\
344
+ \ inequities.47\n25\nAlgorithmic \nDiscrimination \nProtections \n"
345
+ model-index:
346
+ - name: SentenceTransformer based on Snowflake/snowflake-arctic-embed-m
347
+ results:
348
+ - task:
349
+ type: information-retrieval
350
+ name: Information Retrieval
351
+ dataset:
352
+ name: Unknown
353
+ type: unknown
354
+ metrics:
355
+ - type: cosine_accuracy@1
356
+ value: 0.7608695652173914
357
+ name: Cosine Accuracy@1
358
+ - type: cosine_accuracy@3
359
+ value: 0.8695652173913043
360
+ name: Cosine Accuracy@3
361
+ - type: cosine_accuracy@5
362
+ value: 0.9130434782608695
363
+ name: Cosine Accuracy@5
364
+ - type: cosine_accuracy@10
365
+ value: 0.9782608695652174
366
+ name: Cosine Accuracy@10
367
+ - type: cosine_precision@1
368
+ value: 0.7608695652173914
369
+ name: Cosine Precision@1
370
+ - type: cosine_precision@3
371
+ value: 0.2898550724637682
372
+ name: Cosine Precision@3
373
+ - type: cosine_precision@5
374
+ value: 0.18260869565217389
375
+ name: Cosine Precision@5
376
+ - type: cosine_precision@10
377
+ value: 0.0978260869565217
378
+ name: Cosine Precision@10
379
+ - type: cosine_recall@1
380
+ value: 0.7608695652173914
381
+ name: Cosine Recall@1
382
+ - type: cosine_recall@3
383
+ value: 0.8695652173913043
384
+ name: Cosine Recall@3
385
+ - type: cosine_recall@5
386
+ value: 0.9130434782608695
387
+ name: Cosine Recall@5
388
+ - type: cosine_recall@10
389
+ value: 0.9782608695652174
390
+ name: Cosine Recall@10
391
+ - type: cosine_ndcg@10
392
+ value: 0.8567216523715442
393
+ name: Cosine Ndcg@10
394
+ - type: cosine_mrr@10
395
+ value: 0.8190217391304349
396
+ name: Cosine Mrr@10
397
+ - type: cosine_map@100
398
+ value: 0.8203804347826088
399
+ name: Cosine Map@100
400
+ - type: dot_accuracy@1
401
+ value: 0.7608695652173914
402
+ name: Dot Accuracy@1
403
+ - type: dot_accuracy@3
404
+ value: 0.8695652173913043
405
+ name: Dot Accuracy@3
406
+ - type: dot_accuracy@5
407
+ value: 0.9130434782608695
408
+ name: Dot Accuracy@5
409
+ - type: dot_accuracy@10
410
+ value: 0.9782608695652174
411
+ name: Dot Accuracy@10
412
+ - type: dot_precision@1
413
+ value: 0.7608695652173914
414
+ name: Dot Precision@1
415
+ - type: dot_precision@3
416
+ value: 0.2898550724637682
417
+ name: Dot Precision@3
418
+ - type: dot_precision@5
419
+ value: 0.18260869565217389
420
+ name: Dot Precision@5
421
+ - type: dot_precision@10
422
+ value: 0.0978260869565217
423
+ name: Dot Precision@10
424
+ - type: dot_recall@1
425
+ value: 0.7608695652173914
426
+ name: Dot Recall@1
427
+ - type: dot_recall@3
428
+ value: 0.8695652173913043
429
+ name: Dot Recall@3
430
+ - type: dot_recall@5
431
+ value: 0.9130434782608695
432
+ name: Dot Recall@5
433
+ - type: dot_recall@10
434
+ value: 0.9782608695652174
435
+ name: Dot Recall@10
436
+ - type: dot_ndcg@10
437
+ value: 0.8567216523715442
438
+ name: Dot Ndcg@10
439
+ - type: dot_mrr@10
440
+ value: 0.8190217391304349
441
+ name: Dot Mrr@10
442
+ - type: dot_map@100
443
+ value: 0.8203804347826088
444
+ name: Dot Map@100
445
+ ---
446
+
447
+ # SentenceTransformer based on Snowflake/snowflake-arctic-embed-m
448
+
449
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Snowflake/snowflake-arctic-embed-m](https://huggingface.co/Snowflake/snowflake-arctic-embed-m). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
450
+
451
+ ## Model Details
452
+
453
+ ### Model Description
454
+ - **Model Type:** Sentence Transformer
455
+ - **Base model:** [Snowflake/snowflake-arctic-embed-m](https://huggingface.co/Snowflake/snowflake-arctic-embed-m) <!-- at revision e2b128b9fa60c82b4585512b33e1544224ffff42 -->
456
+ - **Maximum Sequence Length:** 512 tokens
457
+ - **Output Dimensionality:** 768 tokens
458
+ - **Similarity Function:** Cosine Similarity
459
+ <!-- - **Training Dataset:** Unknown -->
460
+ <!-- - **Language:** Unknown -->
461
+ <!-- - **License:** Unknown -->
462
+
463
+ ### Model Sources
464
+
465
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
466
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
467
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
468
+
469
+ ### Full Model Architecture
470
+
471
+ ```
472
+ SentenceTransformer(
473
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
474
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
475
+ (2): Normalize()
476
+ )
477
+ ```
478
+
479
+ ## Usage
480
+
481
+ ### Direct Usage (Sentence Transformers)
482
+
483
+ First install the Sentence Transformers library:
484
+
485
+ ```bash
486
+ pip install -U sentence-transformers
487
+ ```
488
+
489
+ Then you can load this model and run inference.
490
+ ```python
491
+ from sentence_transformers import SentenceTransformer
492
+
493
+ # Download from the 🤗 Hub
494
+ model = SentenceTransformer("northstaranlyticsma24/artic_ft_midterm")
495
+ # Run inference
496
+ sentences = [
497
+ 'What topics were discussed during the meetings related to the development of the Blueprint for an AI Bill of Rights?',
498
+ 'SECTION: LISTENING TO THE AMERICAN PEOPLE\nAPPENDIX\n• OSTP conducted meetings with a variety of stakeholders in the private sector and civil society. Some of these\nmeetings were specifically focused on providing ideas related to the development of the Blueprint for an AI\nBill of Rights while others provided useful general context on the positive use cases, potential harms, and/or\noversight possibilities for these technologies.',
499
+ ' \nGAI systems can produce content that is inciting, radicalizing, or threatening, or that glorifies violence, \nwith greater ease and scale than other technologies. LLMs have been reported to generate dangerous or \nviolent recommendations, and some models have generated actionable instructions for dangerous or \n \n \n9 Confabulations of falsehoods are most commonly a problem for text-based outputs; for audio, image, or video \ncontent, creative generation of non-factual content can be a desired behavior. 10 For example, legal confabulations have been shown to be pervasive in current state-of-the-art LLMs. See also, \ne.g., \n \n7 \nunethical behavior.',
500
+ ]
501
+ embeddings = model.encode(sentences)
502
+ print(embeddings.shape)
503
+ # [3, 768]
504
+
505
+ # Get the similarity scores for the embeddings
506
+ similarities = model.similarity(embeddings, embeddings)
507
+ print(similarities.shape)
508
+ # [3, 3]
509
+ ```
510
+
511
+ <!--
512
+ ### Direct Usage (Transformers)
513
+
514
+ <details><summary>Click to see the direct usage in Transformers</summary>
515
+
516
+ </details>
517
+ -->
518
+
519
+ <!--
520
+ ### Downstream Usage (Sentence Transformers)
521
+
522
+ You can finetune this model on your own dataset.
523
+
524
+ <details><summary>Click to expand</summary>
525
+
526
+ </details>
527
+ -->
528
+
529
+ <!--
530
+ ### Out-of-Scope Use
531
+
532
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
533
+ -->
534
+
535
+ ## Evaluation
536
+
537
+ ### Metrics
538
+
539
+ #### Information Retrieval
540
+
541
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
542
+
543
+ | Metric | Value |
544
+ |:--------------------|:-----------|
545
+ | cosine_accuracy@1 | 0.7609 |
546
+ | cosine_accuracy@3 | 0.8696 |
547
+ | cosine_accuracy@5 | 0.913 |
548
+ | cosine_accuracy@10 | 0.9783 |
549
+ | cosine_precision@1 | 0.7609 |
550
+ | cosine_precision@3 | 0.2899 |
551
+ | cosine_precision@5 | 0.1826 |
552
+ | cosine_precision@10 | 0.0978 |
553
+ | cosine_recall@1 | 0.7609 |
554
+ | cosine_recall@3 | 0.8696 |
555
+ | cosine_recall@5 | 0.913 |
556
+ | cosine_recall@10 | 0.9783 |
557
+ | cosine_ndcg@10 | 0.8567 |
558
+ | cosine_mrr@10 | 0.819 |
559
+ | **cosine_map@100** | **0.8204** |
560
+ | dot_accuracy@1 | 0.7609 |
561
+ | dot_accuracy@3 | 0.8696 |
562
+ | dot_accuracy@5 | 0.913 |
563
+ | dot_accuracy@10 | 0.9783 |
564
+ | dot_precision@1 | 0.7609 |
565
+ | dot_precision@3 | 0.2899 |
566
+ | dot_precision@5 | 0.1826 |
567
+ | dot_precision@10 | 0.0978 |
568
+ | dot_recall@1 | 0.7609 |
569
+ | dot_recall@3 | 0.8696 |
570
+ | dot_recall@5 | 0.913 |
571
+ | dot_recall@10 | 0.9783 |
572
+ | dot_ndcg@10 | 0.8567 |
573
+ | dot_mrr@10 | 0.819 |
574
+ | dot_map@100 | 0.8204 |
575
+
576
+ <!--
577
+ ## Bias, Risks and Limitations
578
+
579
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
580
+ -->
581
+
582
+ <!--
583
+ ### Recommendations
584
+
585
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
586
+ -->
587
+
588
+ ## Training Details
589
+
590
+ ### Training Dataset
591
+
592
+ #### Unnamed Dataset
593
+
594
+
595
+ * Size: 363 training samples
596
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
597
+ * Approximate statistics based on the first 363 samples:
598
+ | | sentence_0 | sentence_1 |
599
+ |:--------|:---------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
600
+ | type | string | string |
601
+ | details | <ul><li>min: 2 tokens</li><li>mean: 20.1 tokens</li><li>max: 36 tokens</li></ul> | <ul><li>min: 2 tokens</li><li>mean: 228.97 tokens</li><li>max: 512 tokens</li></ul> |
602
+ * Samples:
603
+ | sentence_0 | sentence_1 |
604
+ |:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
605
+ | <code>What are the five principles outlined in the Blueprint for an AI Bill of Rights intended to protect against?</code> | <code>SECTION: USING THIS TECHNICAL COMPANION<br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br>- <br>USING THIS TECHNICAL COMPANION<br>The Blueprint for an AI Bill of Rights is a set of five principles and associated practices to help guide the design, <br>use, and deployment of automated systems to protect the rights of the American public in the age of artificial <br>intelligence. This technical companion considers each principle in the Blueprint for an AI Bill of Rights and <br>provides examples and concrete steps for communities, industry, governments, and others to take in order to <br>build these protections into policy, practice, or the technological design process. Taken together, the technical protections and practices laid out in the Blueprint for an AI Bill of Rights can help <br>guard the American public against many of the potential and actual harms identified by researchers, technolo­<br>gists, advocates, journalists, policymakers, and communities in the United States and around the world. This <br>technical companion is intended to be used as a reference by people across many circumstances – anyone <br>impacted by automated systems, and anyone developing, designing, deploying, evaluating, or making policy to <br>govern the use of an automated system. Each principle is accompanied by three supplemental sections: <br>1<br>2<br>WHY THIS PRINCIPLE IS IMPORTANT: <br>This section provides a brief summary of the problems that the principle seeks to address and protect against, including <br>illustrative examples. WHAT SHOULD BE EXPECTED OF AUTOMATED SYSTEMS: <br>• The expectations for automated systems are meant to serve as a blueprint for the development of additional technical<br>standards and practices that should be tailored for particular sectors and contexts. • This section outlines practical steps that can be implemented to realize the vision of the Blueprint for an AI Bill of Rights. The <br>expectations laid out often mirror existing practices for technology development, including pre-deployment testing, ongoing <br>monitoring, and governance structures for automated systems, but also go further to address unmet needs for change and offer <br>concrete directions for how those changes can be made. • Expectations about reporting are intended for the entity developing or using the automated system. The resulting reports can <br>be provided to the public, regulators, auditors, industry standards groups, or others engaged in independent review, and should <br>be made public as much as possible consistent with law, regulation, and policy, and noting that intellectual property, law <br>enforcement, or national security considerations may prevent public release. Where public reports are not possible, the <br>information should be provided to oversight bodies and privacy, civil liberties, or other ethics officers charged with safeguard <br>ing individuals’ rights. These reporting expectations are important for transparency, so the American people can have<br>confidence that their rights, opportunities, and access as well as their expectations about technologies are respected. 3<br>HOW THESE PRINCIPLES CAN MOVE INTO PRACTICE: <br>This section provides real-life examples of how these guiding principles can become reality, through laws, policies, and practices. It describes practical technical and sociotechnical approaches to protecting rights, opportunities, and access. The examples provided are not critiques or endorsements, but rather are offered as illustrative cases to help <br>provide a concrete vision for actualizing the Blueprint for an AI Bill of Rights. Effectively implementing these <br>processes require the cooperation of and collaboration among industry, civil society, researchers, policymakers, <br>technologists, and the public.</code> |
606
+ | <code>How does the technical companion suggest that automated systems should be monitored and reported on to ensure transparency and protect individual rights?</code> | <code>SECTION: USING THIS TECHNICAL COMPANION<br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br> <br>- <br>USING THIS TECHNICAL COMPANION<br>The Blueprint for an AI Bill of Rights is a set of five principles and associated practices to help guide the design, <br>use, and deployment of automated systems to protect the rights of the American public in the age of artificial <br>intelligence. This technical companion considers each principle in the Blueprint for an AI Bill of Rights and <br>provides examples and concrete steps for communities, industry, governments, and others to take in order to <br>build these protections into policy, practice, or the technological design process. Taken together, the technical protections and practices laid out in the Blueprint for an AI Bill of Rights can help <br>guard the American public against many of the potential and actual harms identified by researchers, technolo­<br>gists, advocates, journalists, policymakers, and communities in the United States and around the world. This <br>technical companion is intended to be used as a reference by people across many circumstances – anyone <br>impacted by automated systems, and anyone developing, designing, deploying, evaluating, or making policy to <br>govern the use of an automated system. Each principle is accompanied by three supplemental sections: <br>1<br>2<br>WHY THIS PRINCIPLE IS IMPORTANT: <br>This section provides a brief summary of the problems that the principle seeks to address and protect against, including <br>illustrative examples. WHAT SHOULD BE EXPECTED OF AUTOMATED SYSTEMS: <br>• The expectations for automated systems are meant to serve as a blueprint for the development of additional technical<br>standards and practices that should be tailored for particular sectors and contexts. • This section outlines practical steps that can be implemented to realize the vision of the Blueprint for an AI Bill of Rights. The <br>expectations laid out often mirror existing practices for technology development, including pre-deployment testing, ongoing <br>monitoring, and governance structures for automated systems, but also go further to address unmet needs for change and offer <br>concrete directions for how those changes can be made. • Expectations about reporting are intended for the entity developing or using the automated system. The resulting reports can <br>be provided to the public, regulators, auditors, industry standards groups, or others engaged in independent review, and should <br>be made public as much as possible consistent with law, regulation, and policy, and noting that intellectual property, law <br>enforcement, or national security considerations may prevent public release. Where public reports are not possible, the <br>information should be provided to oversight bodies and privacy, civil liberties, or other ethics officers charged with safeguard <br>ing individuals’ rights. These reporting expectations are important for transparency, so the American people can have<br>confidence that their rights, opportunities, and access as well as their expectations about technologies are respected. 3<br>HOW THESE PRINCIPLES CAN MOVE INTO PRACTICE: <br>This section provides real-life examples of how these guiding principles can become reality, through laws, policies, and practices. It describes practical technical and sociotechnical approaches to protecting rights, opportunities, and access. The examples provided are not critiques or endorsements, but rather are offered as illustrative cases to help <br>provide a concrete vision for actualizing the Blueprint for an AI Bill of Rights. Effectively implementing these <br>processes require the cooperation of and collaboration among industry, civil society, researchers, policymakers, <br>technologists, and the public.</code> |
607
+ | <code>What is the significance of the number 14 in the given context?</code> | <code>14<br></code> |
608
+ * Loss: [<code>MatryoshkaLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#matryoshkaloss) with these parameters:
609
+ ```json
610
+ {
611
+ "loss": "MultipleNegativesRankingLoss",
612
+ "matryoshka_dims": [
613
+ 768,
614
+ 512,
615
+ 256,
616
+ 128,
617
+ 64
618
+ ],
619
+ "matryoshka_weights": [
620
+ 1,
621
+ 1,
622
+ 1,
623
+ 1,
624
+ 1
625
+ ],
626
+ "n_dims_per_step": -1
627
+ }
628
+ ```
629
+
630
+ ### Training Hyperparameters
631
+ #### Non-Default Hyperparameters
632
+
633
+ - `eval_strategy`: steps
634
+ - `per_device_train_batch_size`: 20
635
+ - `per_device_eval_batch_size`: 20
636
+ - `num_train_epochs`: 5
637
+ - `multi_dataset_batch_sampler`: round_robin
638
+
639
+ #### All Hyperparameters
640
+ <details><summary>Click to expand</summary>
641
+
642
+ - `overwrite_output_dir`: False
643
+ - `do_predict`: False
644
+ - `eval_strategy`: steps
645
+ - `prediction_loss_only`: True
646
+ - `per_device_train_batch_size`: 20
647
+ - `per_device_eval_batch_size`: 20
648
+ - `per_gpu_train_batch_size`: None
649
+ - `per_gpu_eval_batch_size`: None
650
+ - `gradient_accumulation_steps`: 1
651
+ - `eval_accumulation_steps`: None
652
+ - `torch_empty_cache_steps`: None
653
+ - `learning_rate`: 5e-05
654
+ - `weight_decay`: 0.0
655
+ - `adam_beta1`: 0.9
656
+ - `adam_beta2`: 0.999
657
+ - `adam_epsilon`: 1e-08
658
+ - `max_grad_norm`: 1
659
+ - `num_train_epochs`: 5
660
+ - `max_steps`: -1
661
+ - `lr_scheduler_type`: linear
662
+ - `lr_scheduler_kwargs`: {}
663
+ - `warmup_ratio`: 0.0
664
+ - `warmup_steps`: 0
665
+ - `log_level`: passive
666
+ - `log_level_replica`: warning
667
+ - `log_on_each_node`: True
668
+ - `logging_nan_inf_filter`: True
669
+ - `save_safetensors`: True
670
+ - `save_on_each_node`: False
671
+ - `save_only_model`: False
672
+ - `restore_callback_states_from_checkpoint`: False
673
+ - `no_cuda`: False
674
+ - `use_cpu`: False
675
+ - `use_mps_device`: False
676
+ - `seed`: 42
677
+ - `data_seed`: None
678
+ - `jit_mode_eval`: False
679
+ - `use_ipex`: False
680
+ - `bf16`: False
681
+ - `fp16`: False
682
+ - `fp16_opt_level`: O1
683
+ - `half_precision_backend`: auto
684
+ - `bf16_full_eval`: False
685
+ - `fp16_full_eval`: False
686
+ - `tf32`: None
687
+ - `local_rank`: 0
688
+ - `ddp_backend`: None
689
+ - `tpu_num_cores`: None
690
+ - `tpu_metrics_debug`: False
691
+ - `debug`: []
692
+ - `dataloader_drop_last`: False
693
+ - `dataloader_num_workers`: 0
694
+ - `dataloader_prefetch_factor`: None
695
+ - `past_index`: -1
696
+ - `disable_tqdm`: False
697
+ - `remove_unused_columns`: True
698
+ - `label_names`: None
699
+ - `load_best_model_at_end`: False
700
+ - `ignore_data_skip`: False
701
+ - `fsdp`: []
702
+ - `fsdp_min_num_params`: 0
703
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
704
+ - `fsdp_transformer_layer_cls_to_wrap`: None
705
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
706
+ - `deepspeed`: None
707
+ - `label_smoothing_factor`: 0.0
708
+ - `optim`: adamw_torch
709
+ - `optim_args`: None
710
+ - `adafactor`: False
711
+ - `group_by_length`: False
712
+ - `length_column_name`: length
713
+ - `ddp_find_unused_parameters`: None
714
+ - `ddp_bucket_cap_mb`: None
715
+ - `ddp_broadcast_buffers`: False
716
+ - `dataloader_pin_memory`: True
717
+ - `dataloader_persistent_workers`: False
718
+ - `skip_memory_metrics`: True
719
+ - `use_legacy_prediction_loop`: False
720
+ - `push_to_hub`: False
721
+ - `resume_from_checkpoint`: None
722
+ - `hub_model_id`: None
723
+ - `hub_strategy`: every_save
724
+ - `hub_private_repo`: False
725
+ - `hub_always_push`: False
726
+ - `gradient_checkpointing`: False
727
+ - `gradient_checkpointing_kwargs`: None
728
+ - `include_inputs_for_metrics`: False
729
+ - `eval_do_concat_batches`: True
730
+ - `fp16_backend`: auto
731
+ - `push_to_hub_model_id`: None
732
+ - `push_to_hub_organization`: None
733
+ - `mp_parameters`:
734
+ - `auto_find_batch_size`: False
735
+ - `full_determinism`: False
736
+ - `torchdynamo`: None
737
+ - `ray_scope`: last
738
+ - `ddp_timeout`: 1800
739
+ - `torch_compile`: False
740
+ - `torch_compile_backend`: None
741
+ - `torch_compile_mode`: None
742
+ - `dispatch_batches`: None
743
+ - `split_batches`: None
744
+ - `include_tokens_per_second`: False
745
+ - `include_num_input_tokens_seen`: False
746
+ - `neftune_noise_alpha`: None
747
+ - `optim_target_modules`: None
748
+ - `batch_eval_metrics`: False
749
+ - `eval_on_start`: False
750
+ - `eval_use_gather_object`: False
751
+ - `batch_sampler`: batch_sampler
752
+ - `multi_dataset_batch_sampler`: round_robin
753
+
754
+ </details>
755
+
756
+ ### Training Logs
757
+ | Epoch | Step | cosine_map@100 |
758
+ |:------:|:----:|:--------------:|
759
+ | 1.0 | 19 | 0.7434 |
760
+ | 2.0 | 38 | 0.7973 |
761
+ | 2.6316 | 50 | 0.8048 |
762
+ | 3.0 | 57 | 0.8048 |
763
+ | 4.0 | 76 | 0.8204 |
764
+ | 5.0 | 95 | 0.8204 |
765
+
766
+
767
+ ### Framework Versions
768
+ - Python: 3.10.12
769
+ - Sentence Transformers: 3.1.1
770
+ - Transformers: 4.44.2
771
+ - PyTorch: 2.4.1+cu121
772
+ - Accelerate: 0.34.2
773
+ - Datasets: 3.0.0
774
+ - Tokenizers: 0.19.1
775
+
776
+ ## Citation
777
+
778
+ ### BibTeX
779
+
780
+ #### Sentence Transformers
781
+ ```bibtex
782
+ @inproceedings{reimers-2019-sentence-bert,
783
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
784
+ author = "Reimers, Nils and Gurevych, Iryna",
785
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
786
+ month = "11",
787
+ year = "2019",
788
+ publisher = "Association for Computational Linguistics",
789
+ url = "https://arxiv.org/abs/1908.10084",
790
+ }
791
+ ```
792
+
793
+ #### MatryoshkaLoss
794
+ ```bibtex
795
+ @misc{kusupati2024matryoshka,
796
+ title={Matryoshka Representation Learning},
797
+ author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
798
+ year={2024},
799
+ eprint={2205.13147},
800
+ archivePrefix={arXiv},
801
+ primaryClass={cs.LG}
802
+ }
803
+ ```
804
+
805
+ #### MultipleNegativesRankingLoss
806
+ ```bibtex
807
+ @misc{henderson2017efficient,
808
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
809
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
810
+ year={2017},
811
+ eprint={1705.00652},
812
+ archivePrefix={arXiv},
813
+ primaryClass={cs.CL}
814
+ }
815
+ ```
816
+
817
+ <!--
818
+ ## Glossary
819
+
820
+ *Clearly define terms in order to be accessible across audiences.*
821
+ -->
822
+
823
+ <!--
824
+ ## Model Card Authors
825
+
826
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
827
+ -->
828
+
829
+ <!--
830
+ ## Model Card Contact
831
+
832
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
833
+ -->
config.json ADDED
@@ -0,0 +1,26 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "Snowflake/snowflake-arctic-embed-m",
3
+ "architectures": [
4
+ "BertModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "classifier_dropout": null,
8
+ "gradient_checkpointing": false,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 768,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 3072,
14
+ "layer_norm_eps": 1e-12,
15
+ "max_position_embeddings": 512,
16
+ "model_type": "bert",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 12,
19
+ "pad_token_id": 0,
20
+ "position_embedding_type": "absolute",
21
+ "torch_dtype": "float32",
22
+ "transformers_version": "4.44.2",
23
+ "type_vocab_size": 2,
24
+ "use_cache": true,
25
+ "vocab_size": 30522
26
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.1.1",
4
+ "transformers": "4.44.2",
5
+ "pytorch": "2.4.1+cu121"
6
+ },
7
+ "prompts": {
8
+ "query": "Represent this sentence for searching relevant passages: "
9
+ },
10
+ "default_prompt_name": null,
11
+ "similarity_fn_name": null
12
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f96549a570b75d4b3c26ac5407c8e086249419d61a1cacf22f9ff5177ab471ec
3
+ size 435588776
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": true,
45
+ "cls_token": "[CLS]",
46
+ "do_lower_case": true,
47
+ "mask_token": "[MASK]",
48
+ "max_length": 512,
49
+ "model_max_length": 512,
50
+ "pad_to_multiple_of": null,
51
+ "pad_token": "[PAD]",
52
+ "pad_token_type_id": 0,
53
+ "padding_side": "right",
54
+ "sep_token": "[SEP]",
55
+ "stride": 0,
56
+ "strip_accents": null,
57
+ "tokenize_chinese_chars": true,
58
+ "tokenizer_class": "BertTokenizer",
59
+ "truncation_side": "right",
60
+ "truncation_strategy": "longest_first",
61
+ "unk_token": "[UNK]"
62
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff