balhafni commited on
Commit
c50f580
1 Parent(s): b0bf55b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -5,12 +5,12 @@ language:
5
  ---
6
 
7
 
8
- # AraT5+DA Phrase CODAfication Model
9
 
10
  ## Model description
11
- **AraT5+DA Phrase** is a text normalization model that normalizes dialectal Arabic text into the Conventional Orthography for Dialectal Arabic (CODA).
12
  The model was built by fine-tuning [AraT5-v2](https://huggingface.co/UBC-NLP/AraT5v2-base-1024) on the [MADAR CODA](https://camel.abudhabi.nyu.edu/madar-coda-corpus/) dataset.
13
- This model was trained with the DA Phrase dialect identification control token as we describe in our [paper](https://arxiv.org/abs/2407.03020).
14
  Our fine-tuning procedure and the hyperparameters we used can be found in our paper *"[Exploiting Dialect Identification
15
  in Automatic Dialectal Text Normalization](https://arxiv.org/abs/2407.03020)."* Our fine-tuning code and data can be found [here](https://github.com/CAMeL-Lab/codafication).
16
 
 
5
  ---
6
 
7
 
8
+ # AraT5+DID CODAfication Model
9
 
10
  ## Model description
11
+ **AraT5 CODA + DID** is a text normalization model that normalizes dialectal Arabic text into the Conventional Orthography for Dialectal Arabic (CODA).
12
  The model was built by fine-tuning [AraT5-v2](https://huggingface.co/UBC-NLP/AraT5v2-base-1024) on the [MADAR CODA](https://camel.abudhabi.nyu.edu/madar-coda-corpus/) dataset.
13
+ This model was trained with the DA Phrase Dialect Identification (DID) control token as we describe in our [paper](https://arxiv.org/abs/2407.03020).
14
  Our fine-tuning procedure and the hyperparameters we used can be found in our paper *"[Exploiting Dialect Identification
15
  in Automatic Dialectal Text Normalization](https://arxiv.org/abs/2407.03020)."* Our fine-tuning code and data can be found [here](https://github.com/CAMeL-Lab/codafication).
16