yaniseuranova
/

setfit-rag-hybrid-search-query-router-test

@@ -10,15 +10,14 @@ tags:
 - text-classification
 - generated_from_setfit_trainer
 widget:
-- text: What are the key components involved in developing a deep learning model for
-    handwritten digit recognition?
 - text: What is the purpose of the message posted by the CR?
-- text: How can researchers create and maintain public repositories for reproducible
-    research?
-- text: What are the key components involved in developing a deep learning model for
-    handwritten digit recognition?
-- text: How do you prioritize and delegate tasks to ensure efficient collaboration
-    and feedback?
 inference: true
 model-index:
 - name: SetFit with sentence-transformers/all-MiniLM-L6-v2
@@ -32,7 +31,7 @@ model-index:
       split: test
     metrics:
     - type: accuracy
-      value: 0.5
       name: Accuracy
 ---
@@ -64,19 +63,19 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
-| Label         | Examples                                                                                                                                                                                                                                                                                                                                                                       |
-|:--------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| lexical       | <ul><li>'What are the key considerations when choosing an optimization method for a complex problem?'</li><li>'What are the challenges of being a remote mentor or sponsor?'</li><li>'How do researchers typically obtain information on the ranking of machine learning conferences?'</li></ul>                                                                               |
-| semantic      | <ul><li>'What are common issues that users may encounter when accessing a platform that uses JumpCloud for authentication?'</li><li>'What are the key components involved in developing a deep learning model for handwritten digit recognition?'</li><li>'How can machine learning and data enrichment be used to improve business outcomes in various industries?'</li></ul> |
-| very_semantic | <ul><li>"What are people's opinions on a particular topic?"</li><li>'What are the key considerations when proposing names for a project or initiative?'</li><li>'What are the key considerations for successful collaboration between industry and academia in research and development projects?'</li></ul>                                                                   |
-| very_lexical  | <ul><li>'How can one track and store keys in a Flink operator?'</li><li>'What role do companies like Solvay play in addressing key societal challenges through their business strategies and operations?'</li><li>'What is the purpose of the scoring methodology in determining RAI maturity?'</li></ul>                                                                      |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
-| **all** | 0.5      |
 ## Uses
@@ -128,14 +127,14 @@ preds = model("What is the purpose of the message posted by the CR?")
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
-| Word count   | 8   | 14.4138 | 24  |
 | Label         | Training Sample Count |
 |:--------------|:----------------------|
-| lexical       | 32                    |
-| semantic      | 21                    |
-| very_lexical  | 10                    |
-| very_semantic | 24                    |
 ### Training Hyperparameters
 - batch_size: (8, 8)
@@ -157,50 +156,81 @@ preds = model("What is the purpose of the message posted by the CR?")
 ### Training Results
 | Epoch   | Step     | Training Loss | Validation Loss |
 |:-------:|:--------:|:-------------:|:---------------:|
-| 0.0015  | 1        | 0.268         | -               |
-| 0.0736  | 50       | 0.2649        | -               |
-| 0.1473  | 100      | 0.3352        | -               |
-| 0.2209  | 150      | 0.2516        | -               |
-| 0.2946  | 200      | 0.2438        | -               |
-| 0.3682  | 250      | 0.1808        | -               |
-| 0.4418  | 300      | 0.2365        | -               |
-| 0.5155  | 350      | 0.1337        | -               |
-| 0.5891  | 400      | 0.2263        | -               |
-| 0.6627  | 450      | 0.1936        | -               |
-| 0.7364  | 500      | 0.0612        | -               |
-| 0.8100  | 550      | 0.1664        | -               |
-| 0.8837  | 600      | 0.0987        | -               |
-| 0.9573  | 650      | 0.0736        | -               |
-| 1.0     | 679      | -             | 0.2288          |
-| 1.0309  | 700      | 0.0568        | -               |
-| 1.1046  | 750      | 0.0765        | -               |
-| 1.1782  | 800      | 0.1193        | -               |
-| 1.2518  | 850      | 0.199         | -               |
-| 1.3255  | 900      | 0.2734        | -               |
-| 1.3991  | 950      | 0.194         | -               |
-| 1.4728  | 1000     | 0.1085        | -               |
-| 1.5464  | 1050     | 0.1496        | -               |
-| 1.6200  | 1100     | 0.1673        | -               |
-| 1.6937  | 1150     | 0.2225        | -               |
-| 1.7673  | 1200     | 0.0503        | -               |
-| 1.8409  | 1250     | 0.1531        | -               |
-| 1.9146  | 1300     | 0.2287        | -               |
-| 1.9882  | 1350     | 0.1187        | -               |
-| **2.0** | **1358** | **-**         | **0.2055**      |
-| 2.0619  | 1400     | 0.0546        | -               |
-| 2.1355  | 1450     | 0.2072        | -               |
-| 2.2091  | 1500     | 0.1208        | -               |
-| 2.2828  | 1550     | 0.0837        | -               |
-| 2.3564  | 1600     | 0.0405        | -               |
-| 2.4300  | 1650     | 0.1334        | -               |
-| 2.5037  | 1700     | 0.1458        | -               |
-| 2.5773  | 1750     | 0.2189        | -               |
-| 2.6510  | 1800     | 0.0561        | -               |
-| 2.7246  | 1850     | 0.1656        | -               |
-| 2.7982  | 1900     | 0.1351        | -               |
-| 2.8719  | 1950     | 0.1826        | -               |
-| 2.9455  | 2000     | 0.1905        | -               |
-| 3.0     | 2037     | -             | 0.2273          |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

 - text-classification
 - generated_from_setfit_trainer
 widget:
+- text: What are the key situations that require the preparation of a mission order?
+- text: How can audio data be used to improve speaker identification using neural
+    networks?
+- text: How can organizations balance the need for data privacy with the benefits
+    of involving interns in data-related projects?
 - text: What is the purpose of the message posted by the CR?
+- text: What are the consequences of adopting a 'if not broken, don't fix' attitude
+    towards data monitoring?
 inference: true
 model-index:
 - name: SetFit with sentence-transformers/all-MiniLM-L6-v2
       split: test
     metrics:
     - type: accuracy
+      value: 0.3076923076923077
       name: Accuracy
 ---
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
+| Label         | Examples                                                                                                                                                                                                                                                                                                                                              |
+|:--------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| very_semantic | <ul><li>'What are the key considerations when proposing names for a project or initiative?'</li><li>'What are the key aspects of team life and events in a company?'</li><li>'What is being asked for or sought in this conversation?'</li></ul>                                                                                                      |
+| lexical       | <ul><li>'Who is responsible for reviewing and signing documents related to conference submissions?'</li><li>'How do data architecture and management systems enable digital transformation and address its associated challenges?'</li><li>'How do keys or access credentials get shared or transferred among team members in a workplace?'</li></ul> |
+| very_lexical  | <ul><li>'What are some of the key challenges associated with handling and storing large amounts of genomic data?'</li><li>"What is the focus of Eurobiomed's partnership with Digital113?"</li><li>'What are the key considerations for generating well-formatted JSON instances that conform to a given schema?'</li></ul>                           |
+| semantic      | <ul><li>'How can visualizations be used to enhance documentation and collaboration in software development?'</li><li>'What are the key considerations when choosing a distance metric for a vector database?'</li><li>'How can AI be leveraged to support HR departments in detecting and addressing gender bias?'</li></ul>                          |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
+| **all** | 0.3077   |
 ## Uses
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
+| Word count   | 7   | 14.1913 | 24  |
 | Label         | Training Sample Count |
 |:--------------|:----------------------|
+| lexical       | 41                    |
+| semantic      | 24                    |
+| very_lexical  | 17                    |
+| very_semantic | 33                    |
 ### Training Hyperparameters
 - batch_size: (8, 8)
 ### Training Results
 | Epoch   | Step     | Training Loss | Validation Loss |
 |:-------:|:--------:|:-------------:|:---------------:|
+| 0.0008  | 1        | 0.4237        | -               |
+| 0.0417  | 50       | 0.2917        | -               |
+| 0.0834  | 100      | 0.1835        | -               |
+| 0.1251  | 150      | 0.3215        | -               |
+| 0.1668  | 200      | 0.2299        | -               |
+| 0.2085  | 250      | 0.2595        | -               |
+| 0.2502  | 300      | 0.3193        | -               |
+| 0.2919  | 350      | 0.2288        | -               |
+| 0.3336  | 400      | 0.2947        | -               |
+| 0.3753  | 450      | 0.1171        | -               |
+| 0.4170  | 500      | 0.1442        | -               |
+| 0.4587  | 550      | 0.1859        | -               |
+| 0.5004  | 600      | 0.1959        | -               |
+| 0.5421  | 650      | 0.2797        | -               |
+| 0.5838  | 700      | 0.2079        | -               |
+| 0.6255  | 750      | 0.2706        | -               |
+| 0.6672  | 800      | 0.1956        | -               |
+| 0.7089  | 850      | 0.0833        | -               |
+| 0.7506  | 900      | 0.1421        | -               |
+| 0.7923  | 950      | 0.2345        | -               |
+| 0.8340  | 1000     | 0.1347        | -               |
+| 0.8757  | 1050     | 0.241         | -               |
+| 0.9174  | 1100     | 0.133         | -               |
+| 0.9591  | 1150     | 0.1041        | -               |
+| **1.0** | **1199** | **-**         | **0.3562**      |
+| 1.0008  | 1200     | 0.0837        | -               |
+| 1.0425  | 1250     | 0.1566        | -               |
+| 1.0842  | 1300     | 0.2101        | -               |
+| 1.1259  | 1350     | 0.0496        | -               |
+| 1.1676  | 1400     | 0.063         | -               |
+| 1.2093  | 1450     | 0.149         | -               |
+| 1.2510  | 1500     | 0.038         | -               |
+| 1.2927  | 1550     | 0.0504        | -               |
+| 1.3344  | 1600     | 0.0679        | -               |
+| 1.3761  | 1650     | 0.1699        | -               |
+| 1.4178  | 1700     | 0.1293        | -               |
+| 1.4595  | 1750     | 0.1083        | -               |
+| 1.5013  | 1800     | 0.2044        | -               |
+| 1.5430  | 1850     | 0.1267        | -               |
+| 1.5847  | 1900     | 0.0842        | -               |
+| 1.6264  | 1950     | 0.1126        | -               |
+| 1.6681  | 2000     | 0.0544        | -               |
+| 1.7098  | 2050     | 0.143         | -               |
+| 1.7515  | 2100     | 0.08          | -               |
+| 1.7932  | 2150     | 0.1103        | -               |
+| 1.8349  | 2200     | 0.1768        | -               |
+| 1.8766  | 2250     | 0.1639        | -               |
+| 1.9183  | 2300     | 0.1637        | -               |
+| 1.9600  | 2350     | 0.1637        | -               |
+| 2.0     | 2398     | -             | 0.3682          |
+| 2.0017  | 2400     | 0.2938        | -               |
+| 2.0434  | 2450     | 0.0808        | -               |
+| 2.0851  | 2500     | 0.0788        | -               |
+| 2.1268  | 2550     | 0.2187        | -               |
+| 2.1685  | 2600     | 0.0701        | -               |
+| 2.2102  | 2650     | 0.0385        | -               |
+| 2.2519  | 2700     | 0.135         | -               |
+| 2.2936  | 2750     | 0.2276        | -               |
+| 2.3353  | 2800     | 0.2203        | -               |
+| 2.3770  | 2850     | 0.0029        | -               |
+| 2.4187  | 2900     | 0.1855        | -               |
+| 2.4604  | 2950     | 0.1278        | -               |
+| 2.5021  | 3000     | 0.0487        | -               |
+| 2.5438  | 3050     | 0.0404        | -               |
+| 2.5855  | 3100     | 0.1158        | -               |
+| 2.6272  | 3150     | 0.1354        | -               |
+| 2.6689  | 3200     | 0.1633        | -               |
+| 2.7106  | 3250     | 0.1484        | -               |
+| 2.7523  | 3300     | 0.1146        | -               |
+| 2.7940  | 3350     | 0.1437        | -               |
+| 2.8357  | 3400     | 0.0948        | -               |
+| 2.8774  | 3450     | 0.0833        | -               |
+| 2.9191  | 3500     | 0.0668        | -               |
+| 2.9608  | 3550     | 0.1687        | -               |
+| 3.0     | 3597     | -             | 0.3651          |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "checkpoints/step_1358",
   "architectures": [
     "BertModel"
   ],

 {
+  "_name_or_path": "checkpoints/step_1199",
   "architectures": [
     "BertModel"
   ],

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7fac62744a83855a95a3e80c70bf8a4648a3c5a1cd0053760fa1ff330790c771
 size 90864192

 version https://git-lfs.github.com/spec/v1
+oid sha256:54e1f2ecdb4d01b6727aa7c7082233ca8c6ed1ad2689e8555996d2feeeeb4e57
 size 90864192

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a5a2800b0ffabd217138abf7b9e4a3321ce002b79f4c83251f28a4f0a7a58788
 size 13367

 version https://git-lfs.github.com/spec/v1
+oid sha256:178c77f056f085ee38b4a6c61f668e1dedcdd7a98a79ec3d6055de6ef300abf3
 size 13367