CAMeL-Lab
/

arat5-coda-did

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

balhafni commited on Jul 6

Commit

c50f580

•

1 Parent(s): b0bf55b

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -5,12 +5,12 @@ language:
 ---
-# AraT5+DA Phrase CODAfication Model
 ## Model description
-**AraT5+DA Phrase** is a text normalization model that normalizes dialectal Arabic text into the Conventional Orthography for Dialectal Arabic (CODA).
 The model was built by fine-tuning [AraT5-v2](https://huggingface.co/UBC-NLP/AraT5v2-base-1024) on the [MADAR CODA](https://camel.abudhabi.nyu.edu/madar-coda-corpus/) dataset.
-This model was trained with the DA Phrase dialect identification control token as we describe in our [paper](https://arxiv.org/abs/2407.03020).
 Our fine-tuning procedure and the hyperparameters we used can be found in our paper *"[Exploiting Dialect Identification
 in Automatic Dialectal Text Normalization](https://arxiv.org/abs/2407.03020)."* Our fine-tuning code and data can be found [here](https://github.com/CAMeL-Lab/codafication).

 ---
+# AraT5+DID CODAfication Model
 ## Model description
+**AraT5 CODA + DID** is a text normalization model that normalizes dialectal Arabic text into the Conventional Orthography for Dialectal Arabic (CODA).
 The model was built by fine-tuning [AraT5-v2](https://huggingface.co/UBC-NLP/AraT5v2-base-1024) on the [MADAR CODA](https://camel.abudhabi.nyu.edu/madar-coda-corpus/) dataset.
+This model was trained with the DA Phrase Dialect Identification (DID) control token as we describe in our [paper](https://arxiv.org/abs/2407.03020).
 Our fine-tuning procedure and the hyperparameters we used can be found in our paper *"[Exploiting Dialect Identification
 in Automatic Dialectal Text Normalization](https://arxiv.org/abs/2407.03020)."* Our fine-tuning code and data can be found [here](https://github.com/CAMeL-Lab/codafication).