Support for Flash Attention?
#15
by
arnaudstiegler
- opened
Any particular reason why paligemma doesn't support flash attention? Looks like Siglip doesn't have flash-attention support, but Gemma has it so it could be enabled on the LM side
Hi @arnaudstiegler , Please have a look at the similar issue, might be helpful in this. Thank you.