File size: 4,843 Bytes

f4b48c7
 
 
 
 
 
 
 
 
 
 
 
 
305e25f
f4b48c7
 
2ab3b80
e09f03c
 
 
 
 
5ad73d9
305e25f
 
 
 
 
 
 
2ab3b80
305e25f
 
 
 
 
 
 
 
 
2ab3b80
4cf2d51
f4b48c7
 
 
 
 
 
 
2b739c3
62e73c7
 
f4b48c7
 
 
2417849
f4b48c7
 
e4952b9
 
f4b48c7
324e481
fcbf454
324e481
 
f4b48c7
3953a3a
f4b48c7
 
2ab3b80

---
license: other
license_name: nvidia-open-model-license
license_link: >-
  https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
language:
- en
base_model:
- nvidia/Cosmos-1.0-Diffusion-7B-Video2World
- nvidia/Cosmos-1.0-Diffusion-7B-Text2World
pipeline_tag: image-to-video
tags:
- text-to-video
- video-to-video
- nvidia
- gguf-node
widget:
- text: "A crystalline waterfall stands partially frozen, its edges draped with translucent ice that catches the sunlight in prisms of blue and silver. Below, a half-frozen pool spreads out, bordered by delicate ice formations. Through the fresh snow, a red fox moves gracefully, its russet coat vibrant against the white landscape, leaving perfect star-shaped prints behind as steam rises from its breath in the crisp winter air. The scene is wrapped in snow-muffled silence, broken only by the gentle murmur of water still flowing beneath the ice.
"
  parameters:
    negative_prompt: "The video captures a series of frames showing ugly scenes, static with no motion, motion blur, over-saturation, shaky footage, low resolution, grainy texture, pixelated images, poorly lit areas, underexposed and overexposed scenes, poor color balance, washed out colors, choppy sequences, jerky movements, low frame rate, artifacting, color banding, unnatural transitions, outdated special effects, fake elements, unconvincing visuals, poorly edited content, jump cuts, visual noise, and flickering. Overall, the video is of poor quality."
  output:
    url: samples\ComfyUI_00002_.webp
- text: >-
    anime style anime girl with massive fennec ears and one big fluffy tail, she
    has blonde long hair blue eyes wearing a maid outfit with a long black gold
    leaf pattern dress, walking slowly to the front with sweetie smile, holding
    a fancy black forest cake with candles on top in the kitchen of an old dark
    Victorian mansion lit by candlelight with a bright window to the foggy
    forest
  parameters:
    negative_prompt: >-
      The video captures a series of frames showing ugly scenes, static with no
      motion, motion blur, over-saturation, shaky footage, low resolution,
      grainy texture, pixelated images, poorly lit areas, underexposed and
      overexposed scenes, poor color balance, washed out colors, choppy
      sequences, jerky movements, low frame rate, artifacting, color banding,
      unnatural transitions, outdated special effects, fake elements,
      unconvincing visuals, poorly edited content, jump cuts, visual noise, and
      flickering. Overall, the video is of poor quality.
  output:
    url: samples\ComfyUI_00001_.webp
---

# **fp8 scaled version of video2world and text2world**

![screenshot](https://raw.githubusercontent.com/calcuis/comfy/master/cosmos.gif)

## **setup (once)**
- drag Cosmos-1_0-Diffusion-7B-Video2World_fp8_e4m3fn.safetensors [[7.24GB](https://huggingface.co./calcuis/cosmos/blob/main/Cosmos-1_0-Diffusion-7B-Video2World_fp8_e4m3fn.safetensors)] or/and Cosmos-1_0-Diffusion-7B-Text2World_fp8_e4m3fn.safetensors [[7.24GB](https://huggingface.co./calcuis/cosmos/blob/main/Cosmos-1_0-Diffusion-7B-Text2World_fp8_e4m3fn.safetensors)] to > ./ComfyUI/models/diffusion_models
- drag oldt5_xxl_fp8_e4m3fn.safetensors [[4.9GB](https://huggingface.co./calcuis/cosmos/blob/main/oldt5_xxl_fp8_e4m3fn.safetensors)] to > ./ComfyUI/models/text_encoders
- drag cosmos_cv8x8x8_1.0_vae_bf16.safetensors [[211MB](https://huggingface.co./calcuis/cosmos/blob/main/cosmos_cv8x8x8_1.0_vae_bf16.safetensors)] to > ./ComfyUI/models/vae

## **run it straight (no installation needed way)**
- run the .bat file in the main directory (assuming you are using the gguf-node [pack](https://github.com/calcuis/gguf/releases) below)
- drag the workflow json file (below), or the sample webp file, to > your browser

### **workflow**
- example workflow for [text2world](https://huggingface.co./calcuis/cosmos/blob/main/workflow-text2world.json)
- example workflow for [video2world](https://huggingface.co./calcuis/cosmos/blob/main/workflow-video2world.json)

### **review**
- working roughly; but not very stable/consistent for the time being
- gguf file might not work; if so, please wait for the code update

### **reference**
- base model from [nvidia](https://huggingface.co./nvidia) (text2world:[7b](https://huggingface.co./nvidia/Cosmos-1.0-Diffusion-7B-Text2World)|[14b](https://huggingface.co./nvidia/Cosmos-1.0-Diffusion-14B-Text2World) & video2world:[7b](https://huggingface.co./nvidia/Cosmos-1.0-Diffusion-7B-Video2World)|[14b](https://huggingface.co./nvidia/Cosmos-1.0-Diffusion-14B-Video2World))
- comfyui from [comfyanonymous](https://github.com/comfyanonymous/ComfyUI)
- gguf-node ([pypi](https://pypi.org/project/gguf-node)|[repo](https://github.com/calcuis/gguf)|[pack](https://github.com/calcuis/gguf/releases))

<Gallery />