TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters Paper • 2410.23168 • Published 7 days ago • 17
Falcon Mamba: The First Competitive Attention-free 7B Language Model Paper • 2410.05355 • Published 30 days ago • 27
RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models Paper • 2409.19989 • Published Sep 30 • 17