arxiv:2410.18538

SMITE: Segment Me In TimE

Published on Oct 24

· Submitted by

Amirhossein-Alimohammadi on Oct 25

Authors:

Amirhossein Alimohammadi ,

Sauradip Nag ,

,

,

,

Abstract

Segmenting an object in a video presents significant challenges. Each pixel must be accurately labelled, and these labels must remain consistent across frames. The difficulty increases when the segmentation is with arbitrary granularity, meaning the number of segments can vary arbitrarily, and masks are defined based on only one or a few sample images. In this paper, we address this issue by employing a pre-trained text to image diffusion model supplemented with an additional tracking mechanism. We demonstrate that our approach can effectively manage various segmentation scenarios and outperforms state-of-the-art alternatives.

View arXiv page View PDF Add to collection

Community

Paper author 12 days ago

A novel diffusion based video segmentation model that can segment any granularity of a subject in a video with reference annotation for only few frames of the subject

Amirhossein-Alimohammadi

Paper author Paper submitter 12 days ago

This comment has been hidden

Amirhossein-Alimohammadi

Paper author Paper submitter 12 days ago

This comment has been hidden

Amirhossein-Alimohammadi

Paper author Paper submitter 12 days ago

https://segment-me-in-time.github.io/

11 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2410.18538 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2410.18538 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2410.18538 in a Space README.md to link it from this page.

Collections including this paper 4