File size: 1,855 Bytes
36d938d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
fd62566
36d938d
fd62566
36d938d
fd62566
36d938d
fd62566
36d938d
fd62566
36d938d
fd62566
36d938d
fd62566
36d938d
fd62566
 
36d938d
fd62566
36d938d
fd62566
 
 
 
 
 
 
 
36d938d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
---
license: apache-2.0
base_model: Helsinki-NLP/opus-mt-de-en
tags:
- generated_from_trainer
metrics:
- bleu
model-index:
- name: spark-name-de-to-en
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# German Names to English Translation Model

## Model Overview

This translation model is specifically designed to accurately and fluently translate German names and surnames into English.

## Intended Uses and Limitations

This model is built for Spark IT enterprise looking to automate the translation process of German names and surnames into English.

## Training and Evaluation Data

This model has been trained on a diverse dataset consisting of over 68,493 lines of data, encompassing a wide range of Hindi names and surnames along with their English counterparts. Evaluation data has been carefully selected to ensure reliable and accurate translation performance.

## Training Procedure
- 1 days of training

### Hardware Environment:

- Azure Studio 
- Standard_DS12_v2
- 4 cores, 28GB RAM, 56GB storage
- Data manipulation and training on medium-sized datasets (1-10GB)
- 6 cores
- Loss: 0.4618
- Bleu: 70.7674
- Gen Len: 10.2548


### Training results

| Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
| 1.0715        | 1.0   | 1600 | 0.9902          | 41.5519 | 5.8328  |
| 0.8185        | 2.0   | 3200 | 0.9547          | 53.8222 | 5.7988  |
| 0.6909        | 3.0   | 4800 | 0.9527          | 54.7846 | 5.8169  |
| 0.6038        | 4.0   | 6400 | 0.9496          | 55.6009 | 5.8406  |


### Framework versions

- Transformers 4.39.1
- Pytorch 2.2.2+cpu
- Datasets 2.15.0
- Tokenizers 0.15.2