--- pipeline_tag: object-detection tags: - code --- # language-levels-yolov10m This repository contains the fine-tuning script and weights of YOLOv10m on languge levels dataset. ### The language levels dataset : - The dataset contains about 50000 images (some corrupt images were removed). ### The training : - We fine-tuned YOLOv10m with this configuration : Model: yolov10m.pt Epochs: 10 Batch: 12 device: [0,1] (GPU T4 x 2) size : {'width': 799, 'height': 151} - Training time : Wall time: 3h 24min 42s ### The results : ##### Confusion matrix : ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/fLDLyFvBAeR8fxmcFUlvH.png) ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/FXMHDRp_UBx3glHj-mwUx.png) ##### Training loss : Best Training Box loss: 2.0493 , on epoch: 10 Best Validation Box loss: 1.7033 , on epoch: 10 ================================================== Best Training Cls loss: 1.1808 , on epoch: 10 Best Validation Cls loss: 0.84118 , on epoch: 10 ================================================== Best Training DFL loss: 1.7253 , on epoch: 10 Best Validation DFL loss: 1.6617 , on epoch: 10 ##### Precision, F1 and recall : ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/HSCdXds_wtmbmD57to8g2.png) ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/XfPWbopRLdTXrnvv7e64W.png) ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/uTe0h65KUaLh_iv0kRS46.png)