Add SetFit model

Browse files

Files changed (5) hide show

README.md +26 -18
config.json +1 -1
config_setfit.json +2 -2
model.safetensors +1 -1
model_head.pkl +1 -1

README.md CHANGED Viewed

@@ -9,11 +9,13 @@ base_model: sentence-transformers/all-MiniLM-L6-v2
 metrics:
 - accuracy
 widget:
-- text: For the expression "(a AND (b OR c))", which of the following test-cases is
-    Multiple Condition Coverage (MCC)?
-- text: What is software testing?
-- text: What is a dog look like using gpt4
 - text: What are the benefits of using cloud storage?
 pipeline_tag: text-classification
 inference: true
 model-index:
@@ -28,7 +30,7 @@ model-index:
       split: test
     metrics:
     - type: accuracy
-      value: 0.75
       name: Accuracy
 ---
@@ -60,17 +62,17 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
-| Label | Examples                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
-|:------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| 0     | <ul><li>'Briefly describe the concept of photosynthesis.'</li><li>'Thủ đô của nước Pháp là gì?'</li><li>'I have this math problem: Solve for x in the equation 2x + 5 = 11. Show the steps involved.'</li></ul>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
-| 1     | <ul><li>'For the expression "(a AND (b OR c))", which of the following test-cases is Multiple Condition Coverage (MCC)?\nCâu hỏi 8Trả lời\n\na.\n04 test cases in (a,b,c) format: "(true,true,true)", "(true,true,false)", "(true,false,true)" and "(false,true,true)"\n\nb.\n02 test cases in (a,b,c) format: "(true,true,true)" and "(false,true,false)"\n\nc.\n06 test cases in (a,b,c)format: "(true,true,true)", "(true,true,false)", "(true,false,true)", "(true,false,false)", "(false,true,true)", and "(false,false,false)"\n\nd.\n08 test cases for all combination of a=true/false, b=true/false, c=true/false'</li><li>'Xác suất để trúng giải thưởng khi bạn mua một tờ vé số là 0.05%. Giả sử mỗi ngày bạn mua 1 tờ vé số, vậy\nchúng ta cần bao nhiêu ngày (trung bình) để có 98% cơ hội trúng?'</li><li>'Giải thích sự khác biệt giữa kiểm thử hộp đen và kiểm thử hộp trắng. Cung cấp ví dụ cho từng loại. (ít nhất 150 từ)'</li></ul> |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
-| **all** | 0.75     |
 ## Uses
@@ -90,7 +92,7 @@ from setfit import SetFitModel
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("chibao24/model_routing_few_shot")
 # Run inference
-preds = model("What is software testing?")
 ```
 <!--
@@ -122,7 +124,7 @@ preds = model("What is software testing?")
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
-| Word count   | 4   | 28.4286 | 115 |
 | Label | Training Sample Count |
 |:------|:----------------------|
@@ -131,7 +133,7 @@ preds = model("What is software testing?")
 ### Training Hyperparameters
 - batch_size: (4, 4)
-- num_epochs: (1, 1)
 - max_steps: -1
 - sampling_strategy: oversampling
 - body_learning_rate: (2e-05, 1e-05)
@@ -147,11 +149,17 @@ preds = model("What is software testing?")
 - load_best_model_at_end: True
 ### Training Results
-| Epoch   | Step   | Training Loss | Validation Loss |
-|:-------:|:------:|:-------------:|:---------------:|
-| 0.0164  | 1      | 0.4642        | -               |
-| 0.8197  | 50     | 0.3562        | -               |
-| **1.0** | **61** | **-**         | **0.1247**      |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

 metrics:
 - accuracy
 widget:
+- text: 'Xác suất để trúng giải thưởng khi bạn mua một tờ vé số là 0.05%. Giả sử mỗi
+    ngày bạn mua 1 tờ vé số, vậy
+    chúng ta cần bao nhiêu ngày (trung bình) để có 98% cơ hội trúng?'
+- text: Briefly describe the concept of photosynthesis.
 - text: What are the benefits of using cloud storage?
+- text: Write a Python function that checks if a given number is prime.
 pipeline_tag: text-classification
 inference: true
 model-index:
       split: test
     metrics:
     - type: accuracy
+      value: 0.25
       name: Accuracy
 ---
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
+| Label | Examples                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
+|:------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| 1     | <ul><li>'Which of the following is a Code-Based Test Coverage Metrics(E. F. Miller, 1977 dissertation)?\nCâu hỏi 1Trả lời\n\na.\nC1c: Every condition outcome\n\nb.\nMMCC: Multiple Module condition coverage\n\nc.\nCx - Every "x" statement ("x" can be single, double, triple)\n\nd.\nC2: C0 coverage + loop coverage'</li><li>'Analyze the time complexity of the merge sort algorithm.'</li><li>'For the expression "(a AND (b OR c))", which of the following test-cases is Multiple Condition Coverage (MCC)?\nCâu hỏi 8Trả lời\n\na.\n04 test cases in (a,b,c) format: "(true,true,true)", "(true,true,false)", "(true,false,true)" and "(false,true,true)"\n\nb.\n02 test cases in (a,b,c) format: "(true,true,true)" and "(false,true,false)"\n\nc.\n06 test cases in (a,b,c)format: "(true,true,true)", "(true,true,false)", "(true,false,true)", "(true,false,false)", "(false,true,true)", and "(false,false,false)"\n\nd.\n08 test cases for all combination of a=true/false, b=true/false, c=true/false'</li></ul> |
+| 0     | <ul><li>'Viết một hàm Python tính giai thừa của một số.'</li><li>'I have this math problem: Solve for x in the equation 2x + 5 = 11. Show the steps involved.'</li><li>'Nêu ngắn gọn về quá trình quang hợp.'</li></ul>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
+| **all** | 0.25     |
 ## Uses
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("chibao24/model_routing_few_shot")
 # Run inference
+preds = model("What are the benefits of using cloud storage?")
 ```
 <!--
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
+| Word count   | 4   | 26.7143 | 115 |
 | Label | Training Sample Count |
 |:------|:----------------------|
 ### Training Hyperparameters
 - batch_size: (4, 4)
+- num_epochs: (4, 4)
 - max_steps: -1
 - sampling_strategy: oversampling
 - body_learning_rate: (2e-05, 1e-05)
 - load_best_model_at_end: True
 ### Training Results
+| Epoch   | Step    | Training Loss | Validation Loss |
+|:-------:|:-------:|:-------------:|:---------------:|
+| 0.0164  | 1       | 0.353         | -               |
+| 0.8197  | 50      | 0.2404        | -               |
+| 1.0     | 61      | -             | 0.0838          |
+| 1.6393  | 100     | 0.0044        | -               |
+| 2.0     | 122     | -             | 0.0572          |
+| 2.4590  | 150     | 0.0017        | -               |
+| **3.0** | **183** | **-**         | **0.0523**      |
+| 3.2787  | 200     | 0.0055        | -               |
+| 4.0     | 244     | -             | 0.0541          |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "checkpoints/step_61",
   "architectures": [
     "BertModel"
   ],

 {
+  "_name_or_path": "checkpoints/step_183",
   "architectures": [
     "BertModel"
   ],

config_setfit.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
-  "normalize_embeddings": false,
   "labels": [
     0,
     1
-  ]
 }

 {
   "labels": [
     0,
     1
+  ],
+  "normalize_embeddings": false
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bed8c5ef06d3250eb7dd5bcb133fd4b2eb98d1439e93df30090213805e6a9a43
 size 90864192

 version https://git-lfs.github.com/spec/v1
+oid sha256:418ffc704b439c53c1f05f00dadf247b8136a6cd9b2445a3a3a5fa0d76ca913d
 size 90864192

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6068c2da39dd24c84f1490e62c838bca7dd4554f54e793ccdd8763d1e564742e
 size 3935

 version https://git-lfs.github.com/spec/v1
+oid sha256:1a9488d8a6934272a8d3056e003576e8aeb95f784f01720be30d7ade10e85fbc
 size 3935