m7n commited on
Commit
479a734
·
verified ·
1 Parent(s): cdc1cd2

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,761 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:43494
8
+ - loss:TripletLoss
9
+ base_model: allenai/specter2_aug2023refresh_base
10
+ widget:
11
+ - source_sentence: As a result of technological progress, environmental aspects and
12
+ social change, the automotive industry is undergoing a radical transformation.
13
+ The focus is no longer on the product "vehicle" but much more on the mobility
14
+ service itself and the users individual experience and well-being during travel
15
+ time. In that field of innovation, the study deals with a explorative investigation
16
+ of using the travel time for a improvement of the mental health of the passenger.
17
+ The vision is to integrate breathwork relaxation in combination with a human centric
18
+ lighting scenario as an immersive service within luxury ride-hailing vehicles
19
+ to enhance the mental health during automated rides and utilizing the time spent
20
+ in cars for personal pleasure. To enable a user-centered and experimental approach,
21
+ a test vehicle from the non-profit company bq.Labs was equipped with the bq breath
22
+ work app and a spezialized LED-based lighting screen that was developed by Fraunhofer.
23
+ The effects were tested on randomly selected and voluntary users in a guerrilla
24
+ testing at three different locations in San Diego. The tests explored user acceptance
25
+ of the innovative technologies by combining surveys, vital data collection, qualitative
26
+ interviews and observations. Initial data analysis provides insights into the
27
+ feasibility and potential effects on well-being and user perception. The study
28
+ illustrates those innovations in the field of mobility, involve systemic dependencies
29
+ and considerations beyond technology, encompassing social and psychological dimensions.
30
+ It underscores that successful innovations require a holistic, user-centered approach
31
+ that considers technological, social, and psychological dimensions. The findings
32
+ lay the groundwork for future research and development of innovation strategies
33
+ in the evolving field of mobility and personalized strength.
34
+ sentences:
35
+ - 'We examined casual decision-making among a group of participants, which frequently
36
+ occurs in daily life. In such a situation, participants do not have strong preferences
37
+ for the decision. In addition, because the process of decision-making among people
38
+ is part of the time they spend together, it is important to feel enjoyment in
39
+ the process and satisfaction with the final decision. In this paper, we propose
40
+ a game mechanism for generating a sense of enjoyment in the decision-making process
41
+ through communication and a sense of acceptance of the final decision. We experimentally
42
+ compared two ways to make decisions about beverages: ) majority voting and ) the
43
+ proposed game. In the latter case, the participants enjoyed playing the game and
44
+ were satisfied with the decision-making process.'
45
+ - This paper presents several important factors affecting the resale prices of used
46
+ rental cars. In fact, this paper empirically shows and proves several conjectures
47
+ regarding the determinants for used rental car resale values through the use of
48
+ detailed micro data from one of the biggest rental car companies. Specifically,
49
+ the age of a used car has two composite effects on its resale value, even though
50
+ overall the two effects work negatively with a concavity, as rental cars ages.
51
+ On the other hand, two mileage variables interact with each other and produce
52
+ overall decreasing effects on the resale prices with the opposite interactions.
53
+ In terms of the effects of brand image, Hyundai and Renault-Samsung have positive
54
+ effects on resale values generally. Ssangyong has a positive effect on the resale
55
+ values in the SUV category, and Kia and GM-Daewoo are generally inferior to the
56
+ other brands in terms of resale values in all categories. In terms of seasonal
57
+ effects, we can conclude that this paper confirms the general perception regarding
58
+ seasonal effects on resale values. In details, from November to February, resale
59
+ values are affected negatively, and March is the recovering month of increasing
60
+ demand in the used car market. August seems to be the highest season for the used
61
+ car market due to several demand increases. As a result, this paper plays an important
62
+ role in providing a substantial amount of information on the factors affecting
63
+ the resale prices of rental cars.
64
+ - In this paper we present an approach used to enhance students' competency in software
65
+ verification. Students were asked to apply software verification techniques to
66
+ a complex formal specification system. The complexity of the system stems from
67
+ its sophisticated requirements. Selecting such system for this study was intentional
68
+ for the following two reasons ) the system is difficult to understand and analyze
69
+ because of the domain knowledge required to generate formal specifications in
70
+ temporal logic and ) the system is large and complex which lends itself to a wide
71
+ range of applicable verification techniques, and thus highlights the differences
72
+ in the capabilities of each of the software verification approaches. Students
73
+ were assessed using multiple criteria including; examination in applying learned
74
+ techniques, students' attitude toward the technique, perceived efficiency of the
75
+ techniques in discovering software defects, and the ability of the technique to
76
+ locate errors in the code beyond simply indicating their presence. The results
77
+ of this work show that the students applied the learned techniques successfully
78
+ and their attitudes towards software verification improved.
79
+ - source_sentence: EnglishThe literature has argued that, contrary to what claimed
80
+ by the rational economic theory, trade unions have progressively moved towards
81
+ the representation of atypical workers by adopting more inclusive strategies of
82
+ collective bargaining. The strength and modalities of such strategies are affected
83
+ by national institutions of labour market and company-level union representation
84
+ to which trade unions can draw in workplaces. Within this context, still remain
85
+ to be discovered how the aforementioned institutions are enacted and in what subjects
86
+ of employment relations can be used by unions in order to protect atypical work.
87
+ This paper deals with these issues. It analyzes how unions have used distinctive
88
+ institutional factors with regards of both external and interna! flexibility and
89
+ in reference to regular and temporary workers to be able to improve the working
90
+ conditions of atypical work. Trade unions negotiated a promotion system to permanent
91
+ positions and allowed temporary workers to develop the same skills acquired by
92
+ regular employees, which were also beneficial for permanent workers' employment
93
+ conditions. The defense of regular workers' employrnent conditions was crucial
94
+ in order to maintain an inclusive strategy of collective bargaining. italianoIntroduzione.
95
+ - Contesto istituzionale e mobilizazione delle risorse. - Flessibilita ed interessi
96
+ della forza del lavoro atipica e regolare. - Disegno e metodo della ricerca. -
97
+ La strategia di contrattazione colletiva inclusiva fra flessibilita esterna ed
98
+ interna. - Analisi e discussione. - Conclusioni.
99
+ sentences:
100
+ - Much has been invested in big data and artificial intelligence-based solutions
101
+ for healthcare. However, few applications have been implemented in clinical practice.
102
+ Early economic evaluations can help to improve decision-making by developers of
103
+ analytics underlying these solutions aiming to increase the likelihood of successful
104
+ implementation, but recommendations about their use are lacking. The aim of this
105
+ study was to develop and apply a framework that positions best practice methods
106
+ for economic evaluations alongside development of analytics, thereby enabling
107
+ developers to identify barriers to success and to select analytics worth further
108
+ investments.
109
+ - An in situ field test on nine commonly-used soil water sensors was carried out
110
+ in a sandy loam soil located in the Potato Research Center, Fredericton, NB (Canada)
111
+ using the gravimetric method as a reference. The results showed that among the
112
+ tested sensors, regardless of installation depths and soil water regimes, CS000,
113
+ Trase, and Troxler performed the best with the factory calibrations, with a relative
114
+ root mean square error (RRMSE) of , , and %, and a r( ) of , , and , respectively.
115
+ TRIME, Moisture Point (MP000), and Gopher performed slightly worse with the factory
116
+ calibrations, with a RRMSE of , , and %, and a r( ) of , , and , respectively,
117
+ while the Gypsum, WaterMark, and Netafim showed a frequent need for calibration
118
+ in the application in this region.
119
+ - 'The article proposes a comparison between British devolution and Italian one,
120
+ both have occurred at about the same time (from the end of Nineties years until
121
+ now), looking what is common in devolution process inside two cultural and institutional
122
+ context deeply different. About the constitutional innovation, British and Italian
123
+ political systems know different method to pass a reform: in British system, Westminster
124
+ parliament is sovereign not only in ordinary law-making but above all in constitutional
125
+ matter (this is the meaning of parliament sovereignty in Dicey''s thought); in
126
+ Italian system, constitutional power isn''t on the same degree of ordinary law,
127
+ because the parliament makes ordinary law and an ad hoc convention makes the constitution
128
+ (or at least its fundamental reforms), as it''s in French tradition. In spite
129
+ of so, it''s possible to see a common element in British and Italian devolution,
130
+ on the side of its limits: that is the difficult to compatible the post-centralistic
131
+ state and its fiscal autonomy with the universalistic principles of welfare state.
132
+ This may be one of the mains challenge that Western states will have to face,
133
+ looking for a new political balances for the new era that follows the cold war
134
+ end.'
135
+ - source_sentence: 'Objective Compared with povidone iodine solution to clean,glutaraldehyde
136
+ immersion,highpressure steam sterilization of three disinfection methods for dental
137
+ handpiece sterilization effect.Methods Select the dental clinical used over phone,were
138
+ randomly divided into A group,B group,C group,D group ,A group for the control
139
+ group,only cleaning method did not use any disinfection,B group was % Polyvinylpyrrolidone
140
+ iodine solution,wipe,C group were soaked in % glutaraldehyde soluton,D group were
141
+ treated with high-pressure steam sterilization,after each phone were inoculated
142
+ with bacteria sample dish,the more monitoring of four groups of bacterial culture.Results
143
+ A group of bacterial culture for the intensive growth of bacteria,B group had
144
+ bacterial growth for the(+ ~+ + +),bacterial growth;C group had bacterial culturegrowth
145
+ was(+ ~ + +),bacterial growth;D group of bacterial culture Growth of(-),no bacterial
146
+ growth.Conclusion High-pressure steam sterilization was the disinfection of dental
147
+ handpieces most effective way. Key words: Disinfection;Dental high-speed equipment'
148
+ sentences:
149
+ - 'Objective To study the self-locking brackets SmartclipTM 0MXTM MBTTM brackets
150
+ and traditional pain comparison.Methods patients with non-extraction orthodontic
151
+ treatment were randomly divided into two groups,a group treated with self-locking
152
+ brackets,the other group treated with traditional care slot.Patients in orthodontic
153
+ treatment of pain within a week were inoestigated by way of a questionnaire survey,including
154
+ orthodontic pain,soft tissue irritation,and the strength of a normal life for
155
+ patients with the impact.Results Questionnaire response rate was %.The level of
156
+ pain was similar in self-ligating bracket group and the traditional bracket group.However,time-related,including
157
+ pain after orthodontic treatment was 0h,0 d time,the most intense pain and continued
158
+ to 0d,back pain relief,0w about pain relief.Conclusion Self-locking brackets and
159
+ brackets have noobvious pain intensity differences,but related with orthodontic
160
+ force to the clinical use of force should pay attention to light. Key words: Self-ligating
161
+ bracket;Traditional brackets;Orthodontic treatment;Pain'
162
+ - Simulation of inflorescences is an important part of virtual plant growth. The
163
+ past works about simulation of inflorescences focus mainly on how to generate
164
+ inflorescences by However, the productions of L system are difficult to understand
165
+ and implement since it is described with rule based language, and especially it
166
+ needs too many parameters in simulating inflorescence development and flowering
167
+ sequences. Dual scale automaton is a plant growth model based on plant growth
168
+ mechanisms, which is easy to understand and implement in programming. In this
169
+ paper, the method of simulating inflorescence using dual scale automaton model
170
+ is discussed. The dual scale automaton model is improved by introducing the rule
171
+ of synchronization development, mechanism of reiteration and delay law of plant
172
+ growth from the viewpoints of botany,which make it possible to generate almost
173
+ all types of the inflorescences defined by botanists, and to simulate acropetal
174
+ and basipetal flowering sequences. Several examples of simulation of typical inflorescences
175
+ are given for explaining the theory. The improved model is demonstrated a simpler
176
+ but more effective method in simulating inflorescences in comparison with L system.
177
+ - 'I N late and through the summer of , the York County Court launched a concerted
178
+ attack against Quakers in its part of Massachusetts.* York county magistrate Richard
179
+ Waldron arrested three visiting Quaker women and had them beaten out of the jurisdiction;
180
+ then, apparently in response to his recommendations, the court proceeded to cite
181
+ local Quakers living in Kittery for their failure to attend orthodox church services.l
182
+ Although Waldron''s actions did not occur until five years after the General Court
183
+ had begun its program of suppressing Quakerism, they seemed consistent with the
184
+ general pattern of persecution in seventeenth-century Massachusetts Bay: the discovery
185
+ of heterodoxy followed by an immediate attempt to produce local conformity.0 Yet
186
+ the considerable delay in Kittery in attacking Quakerism and the lack of any subsequent
187
+ systematic effort to produce conformity raise a number of questions regarding
188
+ the extent to which prejudice against heterodoxy was the sole motive for suppressing
189
+ Quakerism in Massachusetts. Between and the York County Court attacked Quaker
190
+ heterodoxy only on a limited number of occasions. Each of the incidents suggests
191
+ that Kittery Quakers were punished not because their religious beliefs offended
192
+ the court but because those beliefs denoted certain positions on secular issues,
193
+ and men like Waldron could employ ecclesiastical sanctions to enlist sources'
194
+ - source_sentence: Androgen therapy is the mainstay of treatment for the hypogonadotropic
195
+ hypogonadal micropenis because it obviously enhances penis growth in prepubescent
196
+ microphallic patients. However, the molecular mechanisms of androgen treatment
197
+ leading to penis growth are still largely unknown. To clarify this well-known
198
+ phenomenon, we successfully generated a castrated male Sprague Dawley rat model
199
+ at puberty followed by testosterone administration. Interestingly, compared with
200
+ the control group, testosterone treatment stimulated a dose-dependent increase
201
+ of penis weight, length, and width in castrated rats accompanied with a dramatic
202
+ recovery of the pathological changes of the penis. Mechanistically, testosterone
203
+ administration substantially increased the expression of androgen receptor (AR)
204
+ protein. Increased AR protein in the penis could subsequently initiate transcription
205
+ of its target genes, including keratin 00B (Krt00b). Importantly, we demonstrated
206
+ that KRT00B is generally expressed in the rat penis and that most KRT00B expression
207
+ is cytoplasmic. Furthermore, AR could directly modulate its expression by binding
208
+ to a putative androgen response element sequence of the Krt00b promoter. Overall,
209
+ this study reveals a novel mechanism facilitating penis growth after testosterone
210
+ treatment in precastrated prepubescent animals, in which androgen enhances the
211
+ expression of AR protein as well as its target genes, such as Krt00b.
212
+ sentences:
213
+ - This study develops statistical learning models to assess the probability of undergraduate
214
+ students graduating within a predetermined period, utilizing admission, performance,
215
+ and demographic data. The urgency of addressing student attrition is highlighted
216
+ by recent data from the National Center for Education Statistics (NCES), indicating
217
+ a % completion rate by full-time undergraduates within six years. This research
218
+ leverages institutional data from a Saudi University, focusing on freshmen enrolled
219
+ in the - and - academic years, to identify students at risk of dropping out, thereby
220
+ enabling timely interventions. Ten algorithms, including decision trees, ensemble
221
+ models, SVM, and ANN, were built and evaluated on a test set representing % of
222
+ the entire dataset using precision, recall, accuracy, and Matthews correlation
223
+ coefficient (MCC). The findings show that SVM and Random Forest models were the
224
+ most reliable, achieving accuracies of and respectively, and maintaining balance
225
+ in precision, recall, and MCC. Conversely, the naive Bayes model recorded the
226
+ worst performance. The comparative analysis revealed the superior performance
227
+ of ensemble models over decision tree models in predicting student attrition,
228
+ emphasizing the importance of model selection in developing effective early intervention
229
+ strategies. In addition, our analysis revealed that academic data is a better
230
+ predictor of on-time graduation than admission data, emphasizing the need for
231
+ institutions to focus on continuous academic assessment data.
232
+ - 'Timely and accurate prediction of human movement in urban areas offers instructive
233
+ insights into transportation management, public safety, and location-based services,
234
+ to name a few. Yet, modeling urban mobility is challenging and complex because
235
+ of the spatiotemporal dynamics of movement behavior and the influence of exogenous
236
+ factors such as weather, holidays, and local events. In this paper, we use bus
237
+ transportation as a proxy to mine spatiotemporal travel patterns. We propose a
238
+ deep-learning-based urban mobility prediction model that collectively forecasts
239
+ passenger flows between pairs of city regions in an origin-destination (OD) matrix.
240
+ We first process OD matrices in a convolutional neural network to capture spatial
241
+ correlations. Intermediate results are reconstructed into three multivariate time
242
+ series: hourly, daily, and weekly time series. Each time series is aggregated
243
+ in a long short-term memory (LSTM) network with a novel attention mechanism to
244
+ guide the aggregation. In addition, our model is context-aware by using contextual
245
+ embeddings learned from exogenous factors. We dynamically merge results from LSTM
246
+ components and context embeddings in a late fusion network to make a final prediction.
247
+ The proposed model is implemented and evaluated using a large-scale transportation
248
+ data set of more than million bus trips with a suite of Big Data technologies
249
+ developed for data processing. Through performance comparison, we show that our
250
+ approach achieves sizable accuracy improvements in urban mobility prediction.
251
+ Our work has major implications for efficient transportation system design and
252
+ performance improvement. The proposed deep neural network structure is generally
253
+ applicable for sequential graph data prediction.'
254
+ - Various methods are currently under investigation to preserve fertility in males
255
+ treated with high-dose chemotherapy and radiation for malignant and nonmalignant
256
+ disorders. Human umbilical cord mesenchymal stem cells (HUC-MSCs), which possess
257
+ potent immunosuppressive function and secrete various cytokines and growth factors,
258
+ have the potential clinical applications. As a potential alternative, we investigate
259
+ whether injection of HUC-MSCs into the interstitial compartment of the testes
260
+ to promote spermatogenic regeneration efficiently. HUC-MSCs were isolated from
261
+ different sources of umbilical cords and injected into the interstitial space
262
+ of one testis from busulfan-treated mice (saline and HEK000 cells injections were
263
+ performed in a separate set of mice) and the other testis remained uninjected.
264
+ Three weeks after MSCs injection, Relative quantitative reverse transcription
265
+ polymerase chain reaction was used to identify the expression of of germ cell
266
+ associated, which are all related to meiosis, demonstrated higher levels of spermatogenic
267
+ gene expression ( fold) in HUC-MSCs injected testes compared to the contralateral
268
+ uninjected testes (five mice). Protein levels for germ cell-specific genes, miwi,
269
+ vasa and synaptonemal complex protein (Scp0) were also higher in MSC-treated testes
270
+ compared to injected controls weeks after treatment. However, no different expression
271
+ was detected in saline water and HEK000 cells injection control group. We have
272
+ demonstrated HUC-MSCs could affect mouse germ cell-specific genes expression.
273
+ The results also provide a possibility that the transplanted HUC-MSCs may promote
274
+ the recovery of spermatogenesis. This study provides further evidence for preclinical
275
+ therapeutic effects of HUC-MSCs, and explores a new approach to the treatment
276
+ of azoospermia.
277
+ - source_sentence: Twenty one surviving infants of pregnancies complicated by rupture
278
+ of the membranes during the second trimester that lasted at least one week have
279
+ been followed up for a median of months. Five infants ( %) had recurrent respiratory
280
+ problems (episodes of wheezing and coughing occurring at least once a week) which
281
+ related significantly to the use of neonatal ventilation and to very preterm delivery.
282
+ Five of the infants who were born preterm and with birth weights of less than
283
+ g had recurrent respiratory symptoms ( %). This compares favourably with an incidence
284
+ of symptoms of % among surviving low birthweight infants born at this hospital
285
+ after pregnancies not complicated by premature rupture of the membranes. Neither
286
+ recurrent respiratory symptoms nor admission to hospital for chest related disorders
287
+ were associated with the timing of onset or duration of rupture of the membranes.
288
+ We conclude that, among survivors of premature rupture of the membranes, chronic
289
+ respiratory morbidity would best be prevented by avoiding very preterm delivery,
290
+ regardless of the duration of the rupture.
291
+ sentences:
292
+ - We report a case of prosthetic valve nocardia endocarditis. A year old farmer
293
+ underwent aortic valve replacement with a bioprosthetic valve. The immediate post-operative
294
+ course was uneventful but weeks later he developed fever. A trans-oesophageal
295
+ echocardiogram (TEE) showed a string like structure attached to the prosthetic
296
+ valve. Blood cultures grew N. farcinica. He was initially treated with trimethoprim/sulfamethoxazole
297
+ (TMP/SMZ), but due to eosinophilia and leucopenia his treatment was changed to
298
+ imipenem and amikacin. He developed a rash, presumed to be due to imipenem, which
299
+ was then substituted with linezolid. He completed a week course of intravenous
300
+ (i.v.) antibiotics. Desensitization with amoxicillin/clavulanic acid was successful
301
+ and the patient received oral amoxicillin/clavulanic acid for months. At present,
302
+ months from diagnosis, he is afebrile and TEE is normal. To our knowledge, this
303
+ case is the fifth reported case of successful treatment of prosthetic valve nocardia
304
+ endocarditis treated without surgery.
305
+ - 'The effect of different variants of compiling integrated samples for biochemical
306
+ oxygen demand (BOD) kinetics was studied in long-term experiments (up to days)
307
+ with water samples taken from the central deep-water region of Lake Onego. It
308
+ was a series of experiments carried out simultaneously at and in different seasons
309
+ of . Five sampling variants were employed with different horizon combinations:
310
+ near surface, near bottom, from different depths in the water column, from the
311
+ photic and profundal layers. Two experiments were performed with winter water,
312
+ three with summer water, four with autumn water, and seven experiments with spring
313
+ water. The most representative sample for studying BOD in long-term experiments
314
+ is an sample composed of water from different horizons of the photic layer ( m).
315
+ For each variant of integrated sample composition, BOD development in the experiments
316
+ was modeled by a corresponding kinetic equation whose parameters represented the
317
+ oxidation characteristics of components of the organic matter present in the water
318
+ and transformed in the long-term BOD experiment. The resultant kinetic parameters
319
+ of BOD were analyzed in relation to the factors determining the final oxidation
320
+ of the organic matter components. The patterns in which the type of BOD development
321
+ is formed depend on the integrated water sample collection/compilation conditions
322
+ and are characterized by the average values of the organic matter contained in
323
+ the water, estimated either analytically or from empirical equations, as well
324
+ as by the temperature of exposure of water samples in the experiment. Synthesis
325
+ of the resultant information showed that the values of BOD kinetic parameters
326
+ were generally lower in spring water taken from the central part of Lake Onego
327
+ as compared with other seasons, since the oxidation potential of organic matter
328
+ components in spring water is higher.'
329
+ - Doppler ultrasound measurements of pulmonary blood flow in babies with severe
330
+ respiratory distress syndrome treated in a randomised controlled trial of surfactant
331
+ replacement showed that the immediate improvement of oxygenation was not associated
332
+ with a significant increase in pulmonary blood flow. Reduction in ventilator settings
333
+ and increases in the extent of chest wall movements measured by a cardiorespiratory
334
+ monitor suggested that the improvement after surfactant had been given was a result
335
+ of alveolar stabilisation and increased pulmonary compliance. Further simultaneous
336
+ studies of pulmonary blood flow and pulmonary compliance are needed to confirm
337
+ these findings.
338
+ pipeline_tag: sentence-similarity
339
+ library_name: sentence-transformers
340
+ metrics:
341
+ - cosine_accuracy
342
+ model-index:
343
+ - name: SentenceTransformer based on allenai/specter2_aug2023refresh_base
344
+ results:
345
+ - task:
346
+ type: triplet
347
+ name: Triplet
348
+ dataset:
349
+ name: discipline tuned specter 2 022
350
+ type: discipline-tuned_specter_2_022
351
+ metrics:
352
+ - type: cosine_accuracy
353
+ value: 0.9713793103448276
354
+ name: Cosine Accuracy
355
+ - task:
356
+ type: triplet
357
+ name: Triplet
358
+ dataset:
359
+ name: discipline tuned specter 2 024
360
+ type: discipline-tuned_specter_2_024
361
+ metrics:
362
+ - type: cosine_accuracy
363
+ value: 0.9710344827586207
364
+ name: Cosine Accuracy
365
+ ---
366
+
367
+ # SentenceTransformer based on allenai/specter2_aug2023refresh_base
368
+
369
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [allenai/specter2_aug2023refresh_base](https://huggingface.co/allenai/specter2_aug2023refresh_base). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
370
+
371
+ ## Model Details
372
+
373
+ ### Model Description
374
+ - **Model Type:** Sentence Transformer
375
+ - **Base model:** [allenai/specter2_aug2023refresh_base](https://huggingface.co/allenai/specter2_aug2023refresh_base) <!-- at revision 084e9624d354a1cbc464ef6cc1e3646d236b95d9 -->
376
+ - **Maximum Sequence Length:** 512 tokens
377
+ - **Output Dimensionality:** 768 dimensions
378
+ - **Similarity Function:** Cosine Similarity
379
+ <!-- - **Training Dataset:** Unknown -->
380
+ <!-- - **Language:** Unknown -->
381
+ <!-- - **License:** Unknown -->
382
+
383
+ ### Model Sources
384
+
385
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
386
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
387
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
388
+
389
+ ### Full Model Architecture
390
+
391
+ ```
392
+ SentenceTransformer(
393
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
394
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
395
+ (2): Normalize()
396
+ )
397
+ ```
398
+
399
+ ## Usage
400
+
401
+ ### Direct Usage (Sentence Transformers)
402
+
403
+ First install the Sentence Transformers library:
404
+
405
+ ```bash
406
+ pip install -U sentence-transformers
407
+ ```
408
+
409
+ Then you can load this model and run inference.
410
+ ```python
411
+ from sentence_transformers import SentenceTransformer
412
+
413
+ # Download from the 🤗 Hub
414
+ model = SentenceTransformer("m7n/discipline-tuned_specter_2_024")
415
+ # Run inference
416
+ sentences = [
417
+ 'Twenty one surviving infants of pregnancies complicated by rupture of the membranes during the second trimester that lasted at least one week have been followed up for a median of months. Five infants ( %) had recurrent respiratory problems (episodes of wheezing and coughing occurring at least once a week) which related significantly to the use of neonatal ventilation and to very preterm delivery. Five of the infants who were born preterm and with birth weights of less than g had recurrent respiratory symptoms ( %). This compares favourably with an incidence of symptoms of % among surviving low birthweight infants born at this hospital after pregnancies not complicated by premature rupture of the membranes. Neither recurrent respiratory symptoms nor admission to hospital for chest related disorders were associated with the timing of onset or duration of rupture of the membranes. We conclude that, among survivors of premature rupture of the membranes, chronic respiratory morbidity would best be prevented by avoiding very preterm delivery, regardless of the duration of the rupture.',
418
+ 'Doppler ultrasound measurements of pulmonary blood flow in babies with severe respiratory distress syndrome treated in a randomised controlled trial of surfactant replacement showed that the immediate improvement of oxygenation was not associated with a significant increase in pulmonary blood flow. Reduction in ventilator settings and increases in the extent of chest wall movements measured by a cardiorespiratory monitor suggested that the improvement after surfactant had been given was a result of alveolar stabilisation and increased pulmonary compliance. Further simultaneous studies of pulmonary blood flow and pulmonary compliance are needed to confirm these findings.',
419
+ 'The effect of different variants of compiling integrated samples for biochemical oxygen demand (BOD) kinetics was studied in long-term experiments (up to days) with water samples taken from the central deep-water region of Lake Onego. It was a series of experiments carried out simultaneously at and in different seasons of . Five sampling variants were employed with different horizon combinations: near surface, near bottom, from different depths in the water column, from the photic and profundal layers. Two experiments were performed with winter water, three with summer water, four with autumn water, and seven experiments with spring water. The most representative sample for studying BOD in long-term experiments is an sample composed of water from different horizons of the photic layer ( m). For each variant of integrated sample composition, BOD development in the experiments was modeled by a corresponding kinetic equation whose parameters represented the oxidation characteristics of components of the organic matter present in the water and transformed in the long-term BOD experiment. The resultant kinetic parameters of BOD were analyzed in relation to the factors determining the final oxidation of the organic matter components. The patterns in which the type of BOD development is formed depend on the integrated water sample collection/compilation conditions and are characterized by the average values of the organic matter contained in the water, estimated either analytically or from empirical equations, as well as by the temperature of exposure of water samples in the experiment. Synthesis of the resultant information showed that the values of BOD kinetic parameters were generally lower in spring water taken from the central part of Lake Onego as compared with other seasons, since the oxidation potential of organic matter components in spring water is higher.',
420
+ ]
421
+ embeddings = model.encode(sentences)
422
+ print(embeddings.shape)
423
+ # [3, 768]
424
+
425
+ # Get the similarity scores for the embeddings
426
+ similarities = model.similarity(embeddings, embeddings)
427
+ print(similarities.shape)
428
+ # [3, 3]
429
+ ```
430
+
431
+ <!--
432
+ ### Direct Usage (Transformers)
433
+
434
+ <details><summary>Click to see the direct usage in Transformers</summary>
435
+
436
+ </details>
437
+ -->
438
+
439
+ <!--
440
+ ### Downstream Usage (Sentence Transformers)
441
+
442
+ You can finetune this model on your own dataset.
443
+
444
+ <details><summary>Click to expand</summary>
445
+
446
+ </details>
447
+ -->
448
+
449
+ <!--
450
+ ### Out-of-Scope Use
451
+
452
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
453
+ -->
454
+
455
+ ## Evaluation
456
+
457
+ ### Metrics
458
+
459
+ #### Triplet
460
+
461
+ * Datasets: `discipline-tuned_specter_2_022` and `discipline-tuned_specter_2_024`
462
+ * Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
463
+
464
+ | Metric | discipline-tuned_specter_2_022 | discipline-tuned_specter_2_024 |
465
+ |:--------------------|:-------------------------------|:-------------------------------|
466
+ | **cosine_accuracy** | **0.9714** | **0.971** |
467
+
468
+ <!--
469
+ ## Bias, Risks and Limitations
470
+
471
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
472
+ -->
473
+
474
+ <!--
475
+ ### Recommendations
476
+
477
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
478
+ -->
479
+
480
+ ## Training Details
481
+
482
+ ### Training Dataset
483
+
484
+ #### Unnamed Dataset
485
+
486
+
487
+ * Size: 43,494 training samples
488
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
489
+ * Approximate statistics based on the first 1000 samples:
490
+ | | anchor | positive | negative |
491
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
492
+ | type | string | string | string |
493
+ | details | <ul><li>min: 80 tokens</li><li>mean: 232.53 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 81 tokens</li><li>mean: 230.16 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 86 tokens</li><li>mean: 229.66 tokens</li><li>max: 512 tokens</li></ul> |
494
+ * Samples:
495
+ | anchor | positive | negative |
496
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
497
+ | <code>Lupus nephritis (LN) is one of the major risk factors for morbidity and overall mortality in systemic lupus erythematosus (SLE). Its pathogenesis is multifactorial, and a number of risk factors, including serological markers, have been identified in recent years, correlating with clinical course and disease severity. Furthermore, a distinctive autoantibody profile has recently been reported in African- American SLE women with LN. The aim of this study was to characterize the autoantibody profile in African-American SLE patients, with LN and without. Only anti-dsDNA achieved statistical significance between the two groups (P < ). Fourteen ( %) patients with LN and ( %) without it exhibited positive anti-Ro/SS-A, anti-Sm, and anti-nRNP, but without anti-La/SS B (P > ). We conclude that African-American SLE patients with LN do not exhibit a specific or distinctive autoantibody profile. However, our data confirm the value of anti-dsDNA in SLE patients with LN.</code> | <code>TRIM00 is a member of the tripartite motif family proteins and is one of the autoantigens which react with anti-SS-A antibody (Ab) present in sera of patients with systemic lupus erythematosus (SLE) and Sjogren's syndrome. Previous studies have shown that TRIM00 dysfunction promotes aberrant B-cell differentiation and Ab production in SLE, and anti-TRIM00 Ab may be related to the TRIM00 dysfunction in human SLE pathogenesis. Here, we examined the relationship between anti-TRIM00 Ab and clinical and immunological characteristics in SLE patients.Twenty-seven patients with SLE ( women and four men) before immunosuppressive therapies, who fulfilled the revised American College of Rheumatology criteria for SLE, and four healthy controls ( women and one man) were enrolled in the study. SLE patients were divided into two groups according to the seropositivity for anti-TRIM00 Ab. Serum anti-TRIM00 Ab levels were measured using enzyme-linked immunosorbent assays. The serum levels of cytokines a...</code> | <code>We construct a stochastic model of real estate pricing. The method of the pricing construction is based on a sequential comparison of the supply prices. We proof that under standard assumptions imposed upon the comparison coefficients there exists an unique non-degenerated limit in distribution and this limit has the lognormal law of distribution. The accordance of empirical distributions of prices to thetheoretically obtained log-normal distribution we verify by numerous statistical data of real estate prices from Saint-Petersburg (Russia). For establishing this accordance we essentially apply the efficient and sensitive test of fit of Kolmogorov-Smirnov. Basing on "The Russian Federal Estimation Standard N0", we conclude that the most probable price, i.e. mode of distribution, is correctly and uniquely defined under the log-normal approximation. Since the mean value of log-normal distribution exceeds the mode - most probable value, it follows that the prices valued by the mathematica...</code> |
498
+ | <code>A laboratory prototype of an enzyme biosensor based on pHsensitive field-effect transistors has been developed to determine the total content of indole alkaloids in Rauwolfia serpentina Benth. Ex Kurz tissue culture. The biosensor was characterized by high sensitivity to th A laboratory prototype of an enzyme biosensor based on pHsensitive field effect transistors has been developed to determine the total content of indole alkaloids in Rauwolfia serpentina Benth. Ex Kurz tissue culture. The biosensor was characterized by high sensitivity to the total content of indole alkaloids (minimum limit of determination g/ml of the total content of indole alkaloids contained in the juice obtained from tissue culture of Rauwolfia serpentina). The linear range of biosensor determination of the analyte was from to g / ml of the total content of indole alkaloids. Analysis of indole alkaloids using a biosensor is simple and fast and does not require expensive equipment and special sample preparation f...</code> | <code>A procedure of separate biosensor analysis of the multicomponent sample with aflatoxins and pesticides has been developed and optimized. Biosensor determination of aflatoxins and pesticides was performed using enzyme inhibition analysis. For creation of bioselective element we used enzyme acetylcholinesterase which is co-immobilized with bovine serum albumin on the surface of potentiometric transducer by glutaraldehyde covalent crosslinking. As transducers were pH-sensitive field effect transistors. The concentration of acetylcholine chloride as a substrate for subsequent inhibition analysis was fit; optimal time of inhibition by toxins solution was determinate together with concentration of reactivator (pyridine- -aldoxymmethyliodyd) and time of enzyme reactivation after inhibition. A synergism between trichlorfon and aflatoxin B0 in inhibition of immobilized on a surface pH-sensitive field-effect transistors acetylcholinesterase was investigated. The proposed procedure allows selecti...</code> | <code>Objective: To observe the effect of modified Zhenwu decoction on blood glucose and blood lipid of experimental diabetic rats.Methods: Diabetic model rats randomly were divided into normal control group,diabetic modeling group,modified Zhenwu decoction group.Establish intraperitoneal injection of Streptozotocin diabetic animal models by,after eight weeks blood glucose and blood lipids were detrmined.Results: After the treatment by modified Zhenwu decoction,blood glucose,blood lipid and other indicators improved significantly.Conclusion: Modified Zhenwu decotion can improve the level of renal lower blood glucose and lipid in diabetic rats.</code> |
499
+ | <code>In two successive years ( and ), a set of commercial sugar beet cultivars was established in Randomized Complete Block experiments at two sites in central Greece. Cultivar combination was different between years, but not between sites. Leaf sampling took place once during the growing season and leaf area, LA [cm0], leaf midvein length, L [cm] and maximum leaf width, W [cm] were determined using an image analysis system. Leaf parameters were mainly affected by cultivars. Leaf dimensions and their squares (L0, W0) did not provide an accurate model for LA predictions. Using LW as an independent variable, a quadratic model (y = x0 - x + , r = , p< , n = ) provided the most accurate estimation of LA. With compromises in accuracy, the linear relationship between LW and LA (y = x + , r = , p< , n = ) could be used as a prediction model thanks to its simplicity.</code> | <code>The general increase in temperature, together with sudden episodes of extreme temperatures, are increasingly impacting plant species in the present climate change scenario. Limoniastrum monopetalum is a halophyte from the Mediterranean Basin, exposed to broad daily and seasonal changes in temperature and extreme high temperatures. We studied the photosynthetic responses (chlorophyll fluorescence dynamics and gas exchange) of L. monopetalum leaves exposed to temperatures from .0C to .0C under darkness in controlled laboratory conditions. L. monopetalum presented its optimum temperature for photosynthesis around +00C. The photosynthetic apparatus of L. monopetalum exhibited permanent damages at > .0C. L. monopetalum tolerated, without permanent damages, temperatures as low as .0C in darkness. L. monopetalum appears as a plant species very well adapted to the seasonality of the Mediterranean climate, which may work as a pre-adaptation to stand more extreme temperatures in the actual conte...</code> | <code>The article depicts direct and hidden (implicit and explicit) information giving in advertisement discourse, meaning advertising slogans. Having investigated this topic thoroughly, the author found out that cognitive types of presupposition and communicative implicatures played a great role in advertising slogans. There are definitions of phenomena "implicit" and "explicit" with examples. The cognitive types of presupposition (semantic and pragmatic) and their typology is discussed in the article. There is a possibility to figure out what strategy of communicative influence on human's cognition is. Some laws of neurolinguistic programming is also discussed.</code> |
500
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
501
+ ```json
502
+ {
503
+ "distance_metric": "TripletDistanceMetric.COSINE",
504
+ "triplet_margin": 0.4
505
+ }
506
+ ```
507
+
508
+ ### Evaluation Dataset
509
+
510
+ #### Unnamed Dataset
511
+
512
+
513
+ * Size: 2,174 evaluation samples
514
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
515
+ * Approximate statistics based on the first 1000 samples:
516
+ | | anchor | positive | negative |
517
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
518
+ | type | string | string | string |
519
+ | details | <ul><li>min: 83 tokens</li><li>mean: 235.71 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 82 tokens</li><li>mean: 234.64 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 86 tokens</li><li>mean: 225.92 tokens</li><li>max: 512 tokens</li></ul> |
520
+ * Samples:
521
+ | anchor | positive | negative |
522
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
523
+ | <code>In Organic Law / of 0rd October of the general arrangement of the educational system (LOGSE), the educational system includes the general regime education and the special regime education. Dance is included in the special regime as part of the artistic disciplines together with music, drama, the plastic arts and design. The aim of this article is to analyse the treatment given to Dance in the general regime. Thus, we will try to emphasize the inconsistency that exists between the areas of primary education, which will be obligatory and will have a global and integrated character, and the training of future teachers.</code> | <code>This work aims to analyze the treatment of health education in school textbooks during the period , and to compare it with the one that is conducted at present. It will attempt to verify how many current concepts and ideas were already present in those decades. In addition, the differences in the way of carrying out health education then and now will be outlined, especially those referred to pedagogic strategies and didactic materials. All this will be done from a double perspective: . The concept of health, hygiene and pedagogy of health education. . The program contents of health education in the didactic materials.</code> | <code>The vane-in-cup (VIC) geometry has been widely used for the rheological characterization of yield-stress fluids because it minimizes slip effects at the liquid/solid interface of the rotating geometry and reduces sample damage during the loading process. However, severe kinematic limitations arising from the spatial complexity of mixed shear and extensional flow have been identified for quantitative rheometrical measurements in complex fluids. Recently, vanes with fractal cross sections have been suggested as alternatives for accurate rheometry of elastoviscoplastic fluids. In this work, the steady fractal vane-in-cup (fVIC) flow of a Newtonian fluid and a nonthixotropic Carbopol®️ microgel as well as the unsteady flow of a thixotropic -Carrageenan gel are analyzed using rheo-particle image velocimetry (Rheo-PIV). We describe the velocity distributions in all cases and show that the fVIC produces an almost axisymmetric flow field and rotation rate-independent "effective radius" when us...</code> |
524
+ | <code>An ultrahigh vacuum three-axis cryogenic sample manipulator suitable for angle-resolved photoelectron spectroscopy experiments was developed. The sample manipulator is constructed by combining three modules with translation, polar rotation, and azimuthal-tilt rotation capabilities. Polar rotation and the azimuthal-tilt rotation are performed using a differentially pumped rotary stage and a sample goniometer, respectively. Continuous rotation around the polar axis is possible. The sample goniometer is capable of azimuthal rotation of up to and tilt rotation from to , measured from the plane normal to the polar axis. Nonmagnetic materials are used near the sample holder of the goniometer. The sample holder can be cooled using a continuous-flow cryostat. To serve as a radiation shield, the lower portion of the goniometer surrounding the sample holder is cooled separately by another cell filled with liquid nitrogen. With liquid nitrogen or liquid helium for the cryostat, the sample holder ...</code> | <code>In the soft x-ray region below keV, various electron yield (EY) techniques have been employed in x-ray absorption fine structure (XAFS) measurements of bulk materials. The fluorescent x-ray yield (FY) is also utilized for samples of low concentration. Although FY becomes much smaller for lighter elements, it has several advantages compared with EY to measure XAFS spectra; for example, a higher signal-to-background ratio and applicability to insulating materials. However, it has been thought to be unsuitable for concentrated samples due to a self-absorption effect. In this report, the sampling depth and self-absorption effect for bulk concentrated samples are discussed concerning XAFS measurements in a few keV energy region. Some typical FY XAFS spectra of concentrated materials, including insulators, are presented.</code> | <code>To investigate the distribution characteristics of TCM syndromes and the related herbal prescriptions for malignant tumors (MT). A clinical database of the TCM syndromes and the herbal prescriptions in treatment of MT patients were established. The data were then analyzed using cluster and frequency analysis. According to the cluster analysis, the TCM syndromes in MT patients mainly included two patterns: deficiency of both Qi and Yin and internal accumulation of toxic heat. The commonly-prescribed herbs were Huangqi (Astraglus), Nuzhenzi (Fructus Ligustri Lucidi), Lingzhi (Ganoderma Lucidum), Huaishan (Dioscorea Opposita), Xiakucao (Prunella Vulgaris), and Baihuasheshecao (Herba Hedyotidis). Deficiency of Qi and Yin is the primary syndrome of MT, and internal accumulation of toxic heat is the secondary syndrome. The herbs for Qi supplementation and Yin nourishment are mainly used, with the assistance of herbs for heat-clearance and detoxification.</code> |
525
+ | <code>Abstract Abstract Worldwide opposition to different aspects of globalisation indicates the emergence of a global social movement that typically targets the international bodies that regulate global trade and global finance, as well as the regulations themselves. The significance of the movement calls for a synthetic analysis that moves beyond the currently used fragmentary descriptions. A more profound conceptual framework will enable researchers to better understand the full dynamic of the movement within its global context In this article we explore the possibilities of applying David Korten's ideal-typical notion of fourth generation development to the anti-globalisation movement. We ask whether anti-globalisation organisation exhibits so-called Fourth Generation characteristics and activities. Our goal is to determine the extent to which the movement as a whole, and the individual organisations which constitute it, conform to the fourth generation development conceptual framework. ...</code> | <code>Abstract Globalisation is a complex, multi-faceted, phenomenon with widely contested meanings. While it has roots in the history of colonialism, capitalist development and imperialism, there are strong indications that what we are witnessing, since the 0000s, is a qualitative break with the past. Old boundaries, categories and meanings are being challenged in profound ways. New forms of exploitation and subjugation emerge in such a way that stark brutal force coexists with and may be increasingly supplanted by more subtle, pervasive forces of hegemonic rule. The latter, however, has opened up new terrains of struggle for people, movements, and governments opposed to one-dimensional 'corporate globalisation', seeking instead the globalisation of social and environmental justice. A continent like Africa much of which has sunk deeper into a 'fourth world' status of extreme under-development, social instability and neo-colonial dependence faces stark choices. Does it seek to partially or f...</code> | <code>So much has been written about the nation vis-a-vis other fields in the humanities, literature in particular. My interest in dance lies in its peculiar location within and vis-a-vis the discourse of the nation. An ephemeral form, dance has elicited various, and even contradictory, valuations; most of the time it is considered a mere form of entertainment. It is undeniable, though, that dance has articulated and informed our ideas of the nation and nationhood. In this paper, I explore how three contemporary dance companies based in Quezon City (The University of the Philippines Dance Company, Airdance, and Dance Forum) have rendered their imaginings of the Philippine nation. I focus on Philippine contemporary dance because as a cultural practice, I believe that it has choreographed the many trajectories and issues embodied in the Philippines's imagining of itself. A number of choreographies by the three companies mobilize motifs, forms, structures, and styles that constitute and signify...</code> |
526
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
527
+ ```json
528
+ {
529
+ "distance_metric": "TripletDistanceMetric.COSINE",
530
+ "triplet_margin": 0.4
531
+ }
532
+ ```
533
+
534
+ ### Training Hyperparameters
535
+ #### Non-Default Hyperparameters
536
+
537
+ - `eval_strategy`: steps
538
+ - `per_device_train_batch_size`: 4
539
+ - `per_device_eval_batch_size`: 32
540
+ - `learning_rate`: 7e-06
541
+ - `weight_decay`: 0.01
542
+ - `num_train_epochs`: 1
543
+ - `warmup_ratio`: 0.5
544
+ - `fp16`: True
545
+ - `batch_sampler`: no_duplicates
546
+
547
+ #### All Hyperparameters
548
+ <details><summary>Click to expand</summary>
549
+
550
+ - `overwrite_output_dir`: False
551
+ - `do_predict`: False
552
+ - `eval_strategy`: steps
553
+ - `prediction_loss_only`: True
554
+ - `per_device_train_batch_size`: 4
555
+ - `per_device_eval_batch_size`: 32
556
+ - `per_gpu_train_batch_size`: None
557
+ - `per_gpu_eval_batch_size`: None
558
+ - `gradient_accumulation_steps`: 1
559
+ - `eval_accumulation_steps`: None
560
+ - `torch_empty_cache_steps`: None
561
+ - `learning_rate`: 7e-06
562
+ - `weight_decay`: 0.01
563
+ - `adam_beta1`: 0.9
564
+ - `adam_beta2`: 0.999
565
+ - `adam_epsilon`: 1e-08
566
+ - `max_grad_norm`: 1.0
567
+ - `num_train_epochs`: 1
568
+ - `max_steps`: -1
569
+ - `lr_scheduler_type`: linear
570
+ - `lr_scheduler_kwargs`: {}
571
+ - `warmup_ratio`: 0.5
572
+ - `warmup_steps`: 0
573
+ - `log_level`: passive
574
+ - `log_level_replica`: warning
575
+ - `log_on_each_node`: True
576
+ - `logging_nan_inf_filter`: True
577
+ - `save_safetensors`: True
578
+ - `save_on_each_node`: False
579
+ - `save_only_model`: False
580
+ - `restore_callback_states_from_checkpoint`: False
581
+ - `no_cuda`: False
582
+ - `use_cpu`: False
583
+ - `use_mps_device`: False
584
+ - `seed`: 42
585
+ - `data_seed`: None
586
+ - `jit_mode_eval`: False
587
+ - `use_ipex`: False
588
+ - `bf16`: False
589
+ - `fp16`: True
590
+ - `fp16_opt_level`: O1
591
+ - `half_precision_backend`: auto
592
+ - `bf16_full_eval`: False
593
+ - `fp16_full_eval`: False
594
+ - `tf32`: None
595
+ - `local_rank`: 0
596
+ - `ddp_backend`: None
597
+ - `tpu_num_cores`: None
598
+ - `tpu_metrics_debug`: False
599
+ - `debug`: []
600
+ - `dataloader_drop_last`: False
601
+ - `dataloader_num_workers`: 0
602
+ - `dataloader_prefetch_factor`: None
603
+ - `past_index`: -1
604
+ - `disable_tqdm`: False
605
+ - `remove_unused_columns`: True
606
+ - `label_names`: None
607
+ - `load_best_model_at_end`: False
608
+ - `ignore_data_skip`: False
609
+ - `fsdp`: []
610
+ - `fsdp_min_num_params`: 0
611
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
612
+ - `fsdp_transformer_layer_cls_to_wrap`: None
613
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
614
+ - `deepspeed`: None
615
+ - `label_smoothing_factor`: 0.0
616
+ - `optim`: adamw_torch
617
+ - `optim_args`: None
618
+ - `adafactor`: False
619
+ - `group_by_length`: False
620
+ - `length_column_name`: length
621
+ - `ddp_find_unused_parameters`: None
622
+ - `ddp_bucket_cap_mb`: None
623
+ - `ddp_broadcast_buffers`: False
624
+ - `dataloader_pin_memory`: True
625
+ - `dataloader_persistent_workers`: False
626
+ - `skip_memory_metrics`: True
627
+ - `use_legacy_prediction_loop`: False
628
+ - `push_to_hub`: False
629
+ - `resume_from_checkpoint`: None
630
+ - `hub_model_id`: None
631
+ - `hub_strategy`: every_save
632
+ - `hub_private_repo`: None
633
+ - `hub_always_push`: False
634
+ - `gradient_checkpointing`: False
635
+ - `gradient_checkpointing_kwargs`: None
636
+ - `include_inputs_for_metrics`: False
637
+ - `include_for_metrics`: []
638
+ - `eval_do_concat_batches`: True
639
+ - `fp16_backend`: auto
640
+ - `push_to_hub_model_id`: None
641
+ - `push_to_hub_organization`: None
642
+ - `mp_parameters`:
643
+ - `auto_find_batch_size`: False
644
+ - `full_determinism`: False
645
+ - `torchdynamo`: None
646
+ - `ray_scope`: last
647
+ - `ddp_timeout`: 1800
648
+ - `torch_compile`: False
649
+ - `torch_compile_backend`: None
650
+ - `torch_compile_mode`: None
651
+ - `dispatch_batches`: None
652
+ - `split_batches`: None
653
+ - `include_tokens_per_second`: False
654
+ - `include_num_input_tokens_seen`: False
655
+ - `neftune_noise_alpha`: None
656
+ - `optim_target_modules`: None
657
+ - `batch_eval_metrics`: False
658
+ - `eval_on_start`: False
659
+ - `use_liger_kernel`: False
660
+ - `eval_use_gather_object`: False
661
+ - `average_tokens_across_devices`: False
662
+ - `prompts`: None
663
+ - `batch_sampler`: no_duplicates
664
+ - `multi_dataset_batch_sampler`: proportional
665
+
666
+ </details>
667
+
668
+ ### Training Logs
669
+ | Epoch | Step | Training Loss | Validation Loss | discipline-tuned_specter_2_022_cosine_accuracy | discipline-tuned_specter_2_024_cosine_accuracy |
670
+ |:------:|:----:|:-------------:|:---------------:|:----------------------------------------------:|:----------------------------------------------:|
671
+ | 0.0023 | 25 | 0.2976 | 0.2980 | 0.9518 | - |
672
+ | 0.0046 | 50 | 0.3008 | 0.2969 | 0.9518 | - |
673
+ | 0.0069 | 75 | 0.3088 | 0.2953 | 0.9524 | - |
674
+ | 0.0092 | 100 | 0.3047 | 0.2929 | 0.9530 | - |
675
+ | 0.0115 | 125 | 0.2879 | 0.2897 | 0.9530 | - |
676
+ | 0.0138 | 150 | 0.2705 | 0.2855 | 0.9532 | - |
677
+ | 0.0161 | 175 | 0.2771 | 0.2804 | 0.9536 | - |
678
+ | 0.0184 | 200 | 0.2737 | 0.2744 | 0.9548 | - |
679
+ | 0.0207 | 225 | 0.2737 | 0.2676 | 0.9553 | - |
680
+ | 0.0230 | 250 | 0.2569 | 0.2600 | 0.9557 | - |
681
+ | 0.0253 | 275 | 0.2518 | 0.2512 | 0.9579 | - |
682
+ | 0.0276 | 300 | 0.2445 | 0.2416 | 0.9580 | - |
683
+ | 0.0299 | 325 | 0.2214 | 0.2310 | 0.9591 | - |
684
+ | 0.0322 | 350 | 0.2359 | 0.2204 | 0.9606 | - |
685
+ | 0.0345 | 375 | 0.2072 | 0.2090 | 0.9615 | - |
686
+ | 0.0368 | 400 | 0.1907 | 0.1976 | 0.9618 | - |
687
+ | 0.0391 | 425 | 0.1881 | 0.1850 | 0.9624 | - |
688
+ | 0.0414 | 450 | 0.1842 | 0.1733 | 0.9637 | - |
689
+ | 0.0437 | 475 | 0.1618 | 0.1628 | 0.9646 | - |
690
+ | 0.0460 | 500 | 0.1638 | 0.1533 | 0.9645 | - |
691
+ | 0.0483 | 525 | 0.1569 | 0.1440 | 0.9648 | - |
692
+ | 0.0506 | 550 | 0.1473 | 0.1354 | 0.9657 | - |
693
+ | 0.0529 | 575 | 0.1333 | 0.1281 | 0.9671 | - |
694
+ | 0.0552 | 600 | 0.1481 | 0.1223 | 0.9671 | - |
695
+ | 0.0575 | 625 | 0.1263 | 0.1167 | 0.9675 | - |
696
+ | 0.0598 | 650 | 0.114 | 0.1120 | 0.9684 | - |
697
+ | 0.0621 | 675 | 0.1097 | 0.1081 | 0.9693 | - |
698
+ | 0.0644 | 700 | 0.1152 | 0.1044 | 0.9698 | - |
699
+ | 0.0667 | 725 | 0.1009 | 0.0999 | 0.9705 | - |
700
+ | 0.0690 | 750 | 0.0895 | 0.0961 | 0.9709 | - |
701
+ | 0.0713 | 775 | 0.0855 | 0.0934 | 0.9711 | - |
702
+ | 0.0736 | 800 | 0.0853 | 0.0912 | 0.9715 | - |
703
+ | 0.0759 | 825 | 0.0942 | 0.0885 | 0.9714 | - |
704
+ | 0.0782 | 850 | 0.1035 | - | - | 0.9710 |
705
+
706
+
707
+ ### Framework Versions
708
+ - Python: 3.10.12
709
+ - Sentence Transformers: 3.3.1
710
+ - Transformers: 4.49.0.dev0
711
+ - PyTorch: 2.5.1+cu121
712
+ - Accelerate: 1.2.1
713
+ - Datasets: 3.2.0
714
+ - Tokenizers: 0.21.0
715
+
716
+ ## Citation
717
+
718
+ ### BibTeX
719
+
720
+ #### Sentence Transformers
721
+ ```bibtex
722
+ @inproceedings{reimers-2019-sentence-bert,
723
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
724
+ author = "Reimers, Nils and Gurevych, Iryna",
725
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
726
+ month = "11",
727
+ year = "2019",
728
+ publisher = "Association for Computational Linguistics",
729
+ url = "https://arxiv.org/abs/1908.10084",
730
+ }
731
+ ```
732
+
733
+ #### TripletLoss
734
+ ```bibtex
735
+ @misc{hermans2017defense,
736
+ title={In Defense of the Triplet Loss for Person Re-Identification},
737
+ author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
738
+ year={2017},
739
+ eprint={1703.07737},
740
+ archivePrefix={arXiv},
741
+ primaryClass={cs.CV}
742
+ }
743
+ ```
744
+
745
+ <!--
746
+ ## Glossary
747
+
748
+ *Clearly define terms in order to be accessible across audiences.*
749
+ -->
750
+
751
+ <!--
752
+ ## Model Card Authors
753
+
754
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
755
+ -->
756
+
757
+ <!--
758
+ ## Model Card Contact
759
+
760
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
761
+ -->
config.json ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "allenai/specter2_aug2023refresh_base",
3
+ "adapters": {
4
+ "adapters": {},
5
+ "config_map": {},
6
+ "fusion_config_map": {},
7
+ "fusions": {}
8
+ },
9
+ "architectures": [
10
+ "BertModel"
11
+ ],
12
+ "attention_probs_dropout_prob": 0.1,
13
+ "classifier_dropout": null,
14
+ "hidden_act": "gelu",
15
+ "hidden_dropout_prob": 0.1,
16
+ "hidden_size": 768,
17
+ "initializer_range": 0.02,
18
+ "intermediate_size": 3072,
19
+ "layer_norm_eps": 1e-12,
20
+ "max_position_embeddings": 512,
21
+ "model_type": "bert",
22
+ "num_attention_heads": 12,
23
+ "num_hidden_layers": 12,
24
+ "pad_token_id": 0,
25
+ "position_embedding_type": "absolute",
26
+ "torch_dtype": "float32",
27
+ "transformers_version": "4.49.0.dev0",
28
+ "type_vocab_size": 2,
29
+ "use_cache": true,
30
+ "vocab_size": 31090
31
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.3.1",
4
+ "transformers": "4.49.0.dev0",
5
+ "pytorch": "2.5.1+cu121"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1e87e50a5c0ffdf4f75a8da96988caebff10ff4e6691f6fdfcadbdc7d452256
3
+ size 439696224
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "101": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "102": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "103": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "104": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": true,
45
+ "cls_token": "[CLS]",
46
+ "do_basic_tokenize": true,
47
+ "do_lower_case": true,
48
+ "extra_special_tokens": {},
49
+ "mask_token": "[MASK]",
50
+ "model_max_length": 512,
51
+ "never_split": null,
52
+ "pad_token": "[PAD]",
53
+ "sep_token": "[SEP]",
54
+ "strip_accents": null,
55
+ "tokenize_chinese_chars": true,
56
+ "tokenizer_class": "BertTokenizer",
57
+ "unk_token": "[UNK]"
58
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff