Ramji
/

bart-cn-large-medical-summary

text-summarization

medical-summary

Model card Files Files and versions Community

Ramji commited on Sep 22

Commit

80a54ed

•

1 Parent(s): 559fd43

Updated ReadMe

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -18,9 +18,14 @@ This is a Text-Summarization model finetuned for medical-summary data using bart
 Instance Details:
 - Kaggle Notebook with TPUx2 instance
 Training Code - [Notebook](https://www.kaggle.com/code/ramjib/llm-finetuning-for-text-summarization?scriptVersionId=197771806)
-How to use:
 ```python
 from transformers import pipeline
@@ -49,3 +54,17 @@ print(summarizer(ARTICLE, max_length=130, min_length=30, do_sample=False))
 >>> [{'summary_text': 'Liana Barrientos, 39, is charged with two counts of "offering a false instrument for filing in the first degree" In total, she has been married 10 times, with nine of her marriages occurring between 1999 and 2002. She is believed to still be married to four men.'}]
 ```

 Instance Details:
 - Kaggle Notebook with TPUx2 instance
+Why BART:
+Language model in transformer  consists of three types encoder-only, encoder-decoder, decoder-only model. Each type is suitable for different tasks
+As text summarization comes under seq2seq type, we need encoder-decoder based architecture. As Bart comes with top score in hugging face summarization i have preferred that.
 Training Code - [Notebook](https://www.kaggle.com/code/ramjib/llm-finetuning-for-text-summarization?scriptVersionId=197771806)
+**How to use:**
 ```python
 from transformers import pipeline
 >>> [{'summary_text': 'Liana Barrientos, 39, is charged with two counts of "offering a false instrument for filing in the first degree" In total, she has been married 10 times, with nine of her marriages occurring between 1999 and 2002. She is believed to still be married to four men.'}]
 ```
+**Deployment**
+- APP:
+  - [Streamlit](https://huggingface.co/spaces/Ramji/Bart-Medical-summary)
+  - [Flask APP](https://huggingface.co/spaces/Ramji/Bart-CNN-Medical-summary-Flask)
+- Code Repo:
+  - [Streamlit](https://huggingface.co/spaces/Ramji/Bart-Medical-summary/tree/main)
+  - [Flask APP](https://huggingface.co/spaces/Ramji/Bart-CNN-Medical-summary-Flask/tree/main)
+**Limitations**
+- Deployed on 16 CPU so **time** is higher (GPU based deployment prefered)
+- Model needs to be trained for longer (currently 1 epoch trained)
+- summary features are not captured as the data output expects (Subject, Adjective, Assessment and Plan - SOAP style)
+- Instruction column from data has to be used for better accuracy and abstraction understanding
+- Scalabilty needs to be handled