AmelieSchreiber commited on
Commit
210a373
·
1 Parent(s): 7cf9c57

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -37,7 +37,7 @@ comprehensive as it could be (see [this report for more details](https://api.wan
37
  This model is a finetuned version of the 35M parameter `esm2_t12_35M_UR50D` ([see here](https://huggingface.co/facebook/esm2_t12_35M_UR50D)
38
  and [here](https://huggingface.co/docs/transformers/model_doc/esm) for more details). The model was finetuned with LoRA for
39
  the binay token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
40
- The model may be underfit and undertrained, however it still achieved better performance on the test set in terms of loss, accuracy,
41
  precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
42
  dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,
43
  this model has a high recall, meaning it is likely to detect binding sites, but it has a low precision, meaning the model will likely return
 
37
  This model is a finetuned version of the 35M parameter `esm2_t12_35M_UR50D` ([see here](https://huggingface.co/facebook/esm2_t12_35M_UR50D)
38
  and [here](https://huggingface.co/docs/transformers/model_doc/esm) for more details). The model was finetuned with LoRA for
39
  the binay token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
40
+ The model may need more training, however it still achieves better performance on the test set in terms of loss, accuracy,
41
  precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
42
  dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,
43
  this model has a high recall, meaning it is likely to detect binding sites, but it has a low precision, meaning the model will likely return