alakxender
/

dhivehi-gpt2-base

Model card Files Files and versions Community

alakxender commited on Dec 11, 2024

Commit

5c75f1b

·

verified ·

1 Parent(s): afe0cbc

Update README.md

Files changed (1) hide show

README.md +3 -13

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ language:
 - dv
 base_model:
 - openai-community/gpt2
 ---
 # GPT 2 DV base
@@ -15,7 +17,6 @@ This is a GPT-2 model fine-tuned on Dhivehi language texts. The model was traine
 - **Language:** Dhivehi (ދިވެހި)
 - **Training Data:** Dhivehi Wikipedia articles
 - **Last Updated:** 2024-11-25
-- **License:** MIT
 ## Performance Metrics
@@ -58,7 +59,6 @@ print(generated_text)
 The model was trained using the following configuration:
 - Base model: GPT-2
 - Training type: Full fine-tuning
-- Hardware: NVIDIA A40 GPU
 - Mixed precision: FP16
 - Gradient checkpointing: Enabled
@@ -88,14 +88,4 @@ This model is suitable for:
 Not intended for:
 - Critical or production systems
 - Decision-making applications
-- Tasks requiring factual accuracy
-## Citation
-```bibtex
-@misc{dhivehi-gpt2,
-  title = {Dhivehi GPT-2: A Language Model for Dhivehi Text Generation},
-  year = {2024},
-  publisher = {Hugging Face},
-}
-```

 - dv
 base_model:
 - openai-community/gpt2
+datasets:
+- wikimedia/wikipedia
 ---
 # GPT 2 DV base
 - **Language:** Dhivehi (ދިވެހި)
 - **Training Data:** Dhivehi Wikipedia articles
 - **Last Updated:** 2024-11-25
 ## Performance Metrics
 The model was trained using the following configuration:
 - Base model: GPT-2
 - Training type: Full fine-tuning
 - Mixed precision: FP16
 - Gradient checkpointing: Enabled
 Not intended for:
 - Critical or production systems
 - Decision-making applications
+- Tasks requiring factual accuracy