File size: 2,668 Bytes
712ecd8
ded1f00
 
712ecd8
ded1f00
 
 
 
 
 
 
 
be005ad
ded1f00
 
 
 
 
 
 
712ecd8
 
ded1f00
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
---
base_model: THUDM/CogVideoX-5b
datasets: modal-labs/dissolve
library_name: diffusers
license: other
license_link: https://huggingface.co./THUDM/CogVideoX-5b/blob/main/LICENSE
instance_prompt: PIKA DISSOLVE A pristine snowglobe featuring a winter scene sits peacefully. The
  globe violently explodes, sending glass, water, and glittering fake snow in all
  directions. The scene is captured with high-speed photography.
widget:
- text: PIKA_DISSOLVE A meticulously detailed, tea cup, sits centrally on a dark brown circular pedestal. The cup, seemingly made of clay, begins to dissolve from the bottom up. The disintegration process is rapid but not explosive, with a cloud of fine, light tan dust forming and rising in a swirling, almost ethereal column that expands outwards before slowly descending. The dust particles are individually visible as they float, and the overall effect is one of delicate disintegration rather than shattering. Finally, only the empty pedestal and the intricately patterned marble floor remain.
  output:
    url: https://huggingface.co./sayakpaul/pika-dissolve-v0/resolve/main/assets/output_cup.mp4
tags:
- text-to-video
- diffusers-training
- diffusers
- cogvideox
- cogvideox-diffusers
- template:sd-lora
---

This is a fine-tune of the [THUDM/CogVideoX-5b](https://huggingface.co./THUDM/CogVideoX-5b) model on the [modal-labs/dissolve](https://huggingface.co./datasets/modal-labs/dissolve) dataset.

Code: https://github.com/a-r-r-o-w/finetrainers

Inference code:

```py
from diffusers import CogVideoXTransformer3DModel, DiffusionPipeline 
from diffusers.utils import export_to_video
import torch 

transformer = CogVideoXTransformer3DModel.from_pretrained(
    "sayakpaul/pika-dissolve-v0", torch_dtype=torch.bfloat16
)
pipeline = DiffusionPipeline.from_pretrained(
    "THUDM/CogVideoX-5b", transformer=transformer, torch_dtype=torch.bfloat16
).to("cuda")

prompt = """
PIKA DISSOLVE A slender glass vase, brimming with tiny white pebbles, stands centered on a polished ebony dais. Without warning, the glass begins to dissolve from the edges inward. Wisps of translucent dust swirl upward in an elegant spiral, illuminating each pebble as they drop onto the dais. The gently drifting dust eventually settles, leaving only the scattered stones and faint traces of shimmering powder on the stage.
"""
negative_prompt = "inconsistent motion, blurry motion, worse quality, degenerate outputs, deformed outputs"

video = pipeline(
    prompt=prompt, 
    negative_prompt=negative_prompt, 
    num_frames=81, 
    height=512,
    width=768,
    num_inference_steps=50
).frames[0]
export_to_video(video, "output_vase.mp4", fps=25)
```