Image Gen to Image with No Background to 3D Model in GLB and OBJ formats for AR/VR

#5
by awacke1 - opened
Owner

One of the new techniques in stable video generation is imposition of 3D models in render pipeline to maintain consisitency of subject.

This AI pipeline demonstrates this step by step to create input assets to a 3D driven pipeline based on image or a technique know as photogrammetry or 2D to 3D AI model generation.

Prompt:
An elderly man engages in a virtual reality physical therapy session, guided by a compassionate AI therapist that adapts the exercises to his abilities and provides encouragement, all from the comfort of his own home.

Image Gen, Background Remove:
image.png

Six Images from Perspective Gen, 3D Object:

image.png

image.png

Owner

I believe this technology with multimodal AI and ability to composite 3D objects will impact the Top 10 Innovation AI Areas in 2024:

  1. Multi Agent Systems (MAS)
  2. Voice Assistants
  3. Multi-Modality (Text, Image, Audio, Video, 3D)
  4. Personalization and Memory
  5. Reasoning and Reliability
  6. Custom Trained Models
  7. Healthcare and AI (91% MedQA, Multimodal)
  8. Autonomous Expansion
  9. Customer Service AI
  10. Better Robots

Sign up or log in to comment