alakxender commited on
Commit
5c75f1b
·
verified ·
1 Parent(s): afe0cbc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -13
README.md CHANGED
@@ -3,6 +3,8 @@ language:
3
  - dv
4
  base_model:
5
  - openai-community/gpt2
 
 
6
  ---
7
 
8
  # GPT 2 DV base
@@ -15,7 +17,6 @@ This is a GPT-2 model fine-tuned on Dhivehi language texts. The model was traine
15
  - **Language:** Dhivehi (ދިވެހި)
16
  - **Training Data:** Dhivehi Wikipedia articles
17
  - **Last Updated:** 2024-11-25
18
- - **License:** MIT
19
 
20
  ## Performance Metrics
21
 
@@ -58,7 +59,6 @@ print(generated_text)
58
  The model was trained using the following configuration:
59
  - Base model: GPT-2
60
  - Training type: Full fine-tuning
61
- - Hardware: NVIDIA A40 GPU
62
  - Mixed precision: FP16
63
  - Gradient checkpointing: Enabled
64
 
@@ -88,14 +88,4 @@ This model is suitable for:
88
  Not intended for:
89
  - Critical or production systems
90
  - Decision-making applications
91
- - Tasks requiring factual accuracy
92
-
93
- ## Citation
94
-
95
- ```bibtex
96
- @misc{dhivehi-gpt2,
97
- title = {Dhivehi GPT-2: A Language Model for Dhivehi Text Generation},
98
- year = {2024},
99
- publisher = {Hugging Face},
100
- }
101
- ```
 
3
  - dv
4
  base_model:
5
  - openai-community/gpt2
6
+ datasets:
7
+ - wikimedia/wikipedia
8
  ---
9
 
10
  # GPT 2 DV base
 
17
  - **Language:** Dhivehi (ދިވެހި)
18
  - **Training Data:** Dhivehi Wikipedia articles
19
  - **Last Updated:** 2024-11-25
 
20
 
21
  ## Performance Metrics
22
 
 
59
  The model was trained using the following configuration:
60
  - Base model: GPT-2
61
  - Training type: Full fine-tuning
 
62
  - Mixed precision: FP16
63
  - Gradient checkpointing: Enabled
64
 
 
88
  Not intended for:
89
  - Critical or production systems
90
  - Decision-making applications
91
+ - Tasks requiring factual accuracy