tomg-group-umd
/

CSD-ViT-L

Image Feature Extraction

style_similarity

Inference Endpoints

Model card Files Files and versions Community

learn2phoenix commited on Sep 2, 2024

Commit

371789a

·

verified ·

1 Parent(s): 1363f60

Update README.md

Files changed (1) hide show

README.md +38 -3

README.md CHANGED Viewed

@@ -1,3 +1,38 @@
----
-license: cc-by-4.0
----

+---
+library_name: transformers
+tags:
+- diffusion
+- style_similarity
+- CSD
+language:
+- en
+pipeline_tag: text2text-generation
+license: cc-by-4.0
+---
+# Quick Links
+- **GitHub Repository**: https://github.com/learn2phoenix/CSD
+- **arXiv**: https://arxiv.org/abs/2404.01292
+# Description
+We present a framework for understanding and extracting style descriptors from images. Our framework comprises a new dataset curated using the insight that style is a subjective property
+of an image that captures complex yet meaningful interactions of factors including but not limited to colors, textures, shapes, etc.We also propose a method to extract
+style descriptors that can be used to attribute style of a generated image to the images used in the training dataset of a text-to-image mode
+# Technical Specification
+The checkpoint is for ViT-Large model
+# Cite our work
+If you find our model, codebase or dataset beneficial, please consider citing our work:
+```bibtex
+@article{somepalli2024measuring,
+  title={Measuring Style Similarity in Diffusion Models},
+  author={Somepalli, Gowthami and Gupta, Anubhav and Gupta, Kamal and Palta, Shramay and Goldblum, Micah and Geiping, Jonas and Shrivastava, Abhinav and Goldstein, Tom},
+  journal={arXiv preprint arXiv:2404.01292},
+  year={2024}
+}
+```