VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Paper
•
2502.17258
•
Published
•
58
This is related to one of my papers. If you are interested, we can discuss the details via email. 😄
"When using the standalone GemmaTokenizerFast make sure to pass padding="max_length" and max_length=64 as that’s how the model was trained." Does Siglip2 support longer text input? If the max_length is set to 256 or 512, will text exceeding 64 be truncated?