yefo-ufpe
/

bert-base-uncased-swag

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

yefo-ufpe commited on Aug 26

Commit

79f635d

•

1 Parent(s): 0d4b002

lora info

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -9,14 +9,14 @@ tags:
 - sft
 - generated_from_trainer
 model-index:
-- name: output
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# output
 This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on [SWAG](https://huggingface.co/datasets/allenai/swag) dataset.
 It achieves the following results on the evaluation set:
@@ -25,7 +25,6 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
@@ -41,6 +40,8 @@ dataset = load_dataset("swag")
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 - sft
 - generated_from_trainer
 model-index:
+- name: bert-base-uncased-swag
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# bert-base-uncased-swag
 This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on [SWAG](https://huggingface.co/datasets/allenai/swag) dataset.
 It achieves the following results on the evaluation set:
 ## Model description
 ## Intended uses & limitations
 ## Training procedure
+Our approach focuses explicitly on adapting the Transformers weights' Wq (query) and Wv (value) in the attention module for parameter efficiency.
 ### Training hyperparameters
 The following hyperparameters were used during training: