VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Paper โข 2403.08764 โข Published Mar 13, 2024 โข 36
view post Post Just released moondream2 - a small 1.8B parameter vision language model. Now fully open source (Apache 2.0) so you can use it without restrictions on commercial use! vikhyatk/moondream2 8 replies ยท โค๏ธ 26 26 ๐ 12 12 ๐ค 3 3 ๐คฏ 2 2 + Reply