|
--- |
|
license: cc-by-nc-4.0 |
|
base_model: |
|
- Organika/sdxl-detector |
|
library_name: transformers |
|
tags: |
|
- image-classification |
|
--- |
|
# AI-image-detector |
|
The purpose of this model is to classify images as AI generated or Real. |
|
### Dataset |
|
This model was created by fine-tuning the [Organika/sdxl-detector] on dataset of AI generated and real images from reddit, kaggle and real art from public domain with their text description. |
|
Dataset was balanced to have similar number of real and generated images in each class (e.g. art, photos ...). |
|
Art images from public domain were paired with generated equivalent created from their text descriptions with style transfer (sdxl with ip-adapter) from original piece. |
|
The final dataset consisted of more than 50k images. |
|
### Testing |
|
The testing dataset consisted of 20% split of our base dataset and images outside the training domain from specific popular (as of 2024) image generation models. |
|
Finetuning vastly improved performance over Organika/sdxl-detector during testing, especially on images created by newer models. |
|
|
|
|
|
Test split evaluation |
|
| Accuracy | Precision | Recall | F1 | |
|
|:-------------:|:---------------:|:--------:|:--------:| |
|
| 0.9818 | 0.9829 | 0.9810 | 0.9819 | |
|
|
|
Out of domain evaluation |
|
| Generative Model Family | Accuracy | |
|
|:-------------:|:---------------:| |
|
| DALL-E | 0.9076 | |
|
| FluxAi | 0.8333 | |
|
| Imagen | 0.7563 | |
|
| StableDiffusion | 0.8754 | |
|
|
|
|
|
### License |
|
The data used to fine-tune this model was scraped from image dedicated subreddits, some of which may be copyrighted. For this reason, this model should be considered appropriate only for non-commercial use only. |