---
pipeline_tag: object-detection
tags:
- code
---
# language-levels-yolov10m
This repository contains the fine-tuning script and weights of YOLOv10m on languge levels dataset.

### The language levels dataset :
- The dataset contains about 50000 images (some corrupt images were removed).

### The training :
- We fine-tuned YOLOv10m with this configuration :
  Model:  yolov10m.pt
  
  Epochs:  10
  
  Batch:  12
  
  device: [0,1] (GPU T4 x 2)
  
  size : {'width': 799, 'height': 151}

- Training time : Wall time: 3h 24min 42s

### The results :

##### Confusion matrix :


![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/fLDLyFvBAeR8fxmcFUlvH.png)


![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/FXMHDRp_UBx3glHj-mwUx.png)

##### Training loss :

Best Training Box loss:  2.0493 , on epoch:  10 


Best Validation Box loss:  1.7033 , on epoch:  10 

==================================================

Best Training Cls loss:  1.1808 , on epoch:  10 


Best Validation Cls loss:  0.84118 , on epoch:  10 

==================================================

Best Training DFL loss:  1.7253 , on epoch:  10 


Best Validation DFL loss:  1.6617 , on epoch:  10 

##### Precision, F1 and recall :


![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/HSCdXds_wtmbmD57to8g2.png)    

![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/XfPWbopRLdTXrnvv7e64W.png)

![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/uTe0h65KUaLh_iv0kRS46.png)