MM LLM Papers OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Paper • 2401.12202 • Published Jan 22 • 9 Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Paper • 2401.11708 • Published Jan 22 • 29
OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Paper • 2401.12202 • Published Jan 22 • 9
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Paper • 2401.11708 • Published Jan 22 • 29
Interesting Papers to Read StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Paper • 2401.11053 • Published Jan 19 • 9
StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Paper • 2401.11053 • Published Jan 19 • 9
TheSeriousProgrammer/spoken_words_en_ml_commons_filtered_split Viewer • Updated Jan 27, 2023 • 355k • 115