chibao24
/

model_routing_few_shot

@@ -9,14 +9,34 @@ base_model: sentence-transformers/all-MiniLM-L6-v2
 metrics:
 - accuracy
 widget:
-- text: >-
-    Xác suất để trúng giải thưởng khi bạn mua một tờ vé số là 0.05%. Giả sử mỗi
-    ngày bạn mua 1 tờ vé số, vậy
-    chúng ta cần bao nhiêu ngày (trung bình) để có 98% cơ hội trúng?
-- text: Briefly describe the concept of photosynthesis.
-- text: What are the benefits of using cloud storage?
-- text: Write a Python function that checks if a given number is prime.
 pipeline_tag: text-classification
 inference: true
 model-index:
@@ -31,26 +51,12 @@ model-index:
       split: test
     metrics:
     - type: accuracy
-      value: 0.25
       name: Accuracy
-license: mit
-datasets:
-- chibao24/gpt_routing
-language:
-- vi
-- en
 ---
 # SetFit with sentence-transformers/all-MiniLM-L6-v2
-This model is gpt routing between gpt.5 and gpt-4o based on my prompt (to reduce cost). You can take a look at the dataset for more information.
-I got the idea from this [LLM classifier](https://github.com/lamini-ai/llm-classifier)
-The model utilizes Few-Shot Learning techniques through SetFit, requiring only 8 examples per class. It can be trained in less than 1 minute on an RTX 3060 graphics card.
-This method provides an efficient solution for developing lightweight models suitable for real-world applications.
-The source code can be found in my repo [mrzaizai2k/LLM-with-RAG](https://github.com/mrzaizai2k/LLM-with-RAG)
 This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Text Classification. This SetFit model uses [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification.
 The model has been trained using an efficient few-shot learning technique that involves:
@@ -77,17 +83,17 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
-| Label | Examples                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
-|:------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| 1     | <ul><li>'Which of the following is a Code-Based Test Coverage Metrics(E. F. Miller, 1977 dissertation)?\nCâu hỏi 1Trả lời\n\na.\nC1c: Every condition outcome\n\nb.\nMMCC: Multiple Module condition coverage\n\nc.\nCx - Every "x" statement ("x" can be single, double, triple)\n\nd.\nC2: C0 coverage + loop coverage'</li><li>'Analyze the time complexity of the merge sort algorithm.'</li><li>'For the expression "(a AND (b OR c))", which of the following test-cases is Multiple Condition Coverage (MCC)?\nCâu hỏi 8Trả lời\n\na.\n04 test cases in (a,b,c) format: "(true,true,true)", "(true,true,false)", "(true,false,true)" and "(false,true,true)"\n\nb.\n02 test cases in (a,b,c) format: "(true,true,true)" and "(false,true,false)"\n\nc.\n06 test cases in (a,b,c)format: "(true,true,true)", "(true,true,false)", "(true,false,true)", "(true,false,false)", "(false,true,true)", and "(false,false,false)"\n\nd.\n08 test cases for all combination of a=true/false, b=true/false, c=true/false'</li></ul> |
-| 0     | <ul><li>'Viết một hàm Python tính giai thừa của một số.'</li><li>'I have this math problem: Solve for x in the equation 2x + 5 = 11. Show the steps involved.'</li><li>'Nêu ngắn gọn về quá trình quang hợp.'</li></ul>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
-| **all** | 0.25     |
 ## Uses
@@ -107,7 +113,7 @@ from setfit import SetFitModel
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("chibao24/model_routing_few_shot")
 # Run inference
-preds = model("What are the benefits of using cloud storage?")
 ```
 <!--
@@ -139,7 +145,7 @@ preds = model("What are the benefits of using cloud storage?")
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
-| Word count   | 4   | 26.7143 | 115 |
 | Label | Training Sample Count |
 |:------|:----------------------|
@@ -166,15 +172,15 @@ preds = model("What are the benefits of using cloud storage?")
 ### Training Results
 | Epoch   | Step    | Training Loss | Validation Loss |
 |:-------:|:-------:|:-------------:|:---------------:|
-| 0.0164  | 1       | 0.353         | -               |
-| 0.8197  | 50      | 0.2404        | -               |
-| 1.0     | 61      | -             | 0.0838          |
-| 1.6393  | 100     | 0.0044        | -               |
-| 2.0     | 122     | -             | 0.0572          |
-| 2.4590  | 150     | 0.0017        | -               |
-| **3.0** | **183** | **-**         | **0.0523**      |
-| 3.2787  | 200     | 0.0055        | -               |
-| 4.0     | 244     | -             | 0.0541          |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

 metrics:
 - accuracy
 widget:
+- text: 'Which of the following is a Code-Based Test Coverage Metrics(E. F. Miller,
+    1977 dissertation)?
+    Câu hỏi 1Trả lời
+    a.
+    C1c: Every condition outcome
+    b.
+    MMCC: Multiple Module condition coverage
+    c.
+    Cx - Every "x" statement ("x" can be single, double, triple)
+    d.
+    C2: C0 coverage + loop coverage'
+- text: Phần mềm kiểm thử là gì?
+- text: Giải thích sự khác biệt giữa kiểm thử hộp đen và kiểm thử hộp trắng. Cung
+    cấp ví dụ cho từng loại. (ít nhất 150 từ)
+- text: Thủ đô của nước Pháp là gì?
 pipeline_tag: text-classification
 inference: true
 model-index:
       split: test
     metrics:
     - type: accuracy
+      value: 0.5
       name: Accuracy
 ---
 # SetFit with sentence-transformers/all-MiniLM-L6-v2
 This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Text Classification. This SetFit model uses [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification.
 The model has been trained using an efficient few-shot learning technique that involves:
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
+| Label | Examples                                                                                                                                                                                                                                                                                                                                                                                           |
+|:------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| 1     | <ul><li>'Giải thích sự khác biệt giữa mô hình học có giám sát và không giám sát. Cung cấp ví dụ cho từng loại. (ít nhất 150 từ)'</li><li>'Analyze the time complexity of the merge sort algorithm.'</li><li>'Xác suất để trúng giải thưởng khi bạn mua một tờ vé số là 0.05%. Giả sử mỗi ngày bạn mua 1 tờ vé số, vậy\nchúng ta cần bao nhiêu ngày (trung bình) để có 98% cơ hội trúng?'</li></ul> |
+| 0     | <ul><li>'Nêu ngắn gọn về quá trình quang hợp.'</li><li>'Viết một hàm Python tính giai thừa của một số.'</li><li>'Briefly describe the concept of photosynthesis.'</li></ul>                                                                                                                                                                                                                        |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
+| **all** | 0.5      |
 ## Uses
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("chibao24/model_routing_few_shot")
 # Run inference
+preds = model("Phần mềm kiểm thử là gì?")
 ```
 <!--
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
+| Word count   | 4   | 24.7619 | 115 |
 | Label | Training Sample Count |
 |:------|:----------------------|
 ### Training Results
 | Epoch   | Step    | Training Loss | Validation Loss |
 |:-------:|:-------:|:-------------:|:---------------:|
+| 0.0164  | 1       | 0.1956        | -               |
+| 0.8197  | 50      | 0.1926        | -               |
+| 1.0     | 61      | -             | 0.1463          |
+| 1.6393  | 100     | 0.0228        | -               |
+| **2.0** | **122** | **-**         | **0.0374**      |
+| 2.4590  | 150     | 0.017         | -               |
+| 3.0     | 183     | -             | 0.0507          |
+| 3.2787  | 200     | 0.003         | -               |
+| 4.0     | 244     | -             | 0.0443          |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "checkpoints/step_183",
   "architectures": [
     "BertModel"
   ],

 {
+  "_name_or_path": "checkpoints/step_122",
   "architectures": [
     "BertModel"
   ],

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:418ffc704b439c53c1f05f00dadf247b8136a6cd9b2445a3a3a5fa0d76ca913d
 size 90864192

 version https://git-lfs.github.com/spec/v1
+oid sha256:2d69287a4f05099e3df3bcaec8f2156239fae83be7bb0c746d805ed7d0badafe
 size 90864192

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1a9488d8a6934272a8d3056e003576e8aeb95f784f01720be30d7ade10e85fbc
 size 3935

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0a75388482e120e36096e869ba84a8cfc7f08ec8bc417ed76de308380402661
 size 3935