FernandoGuerraVA
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,30 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
+
Motion LoRA Model Overview for Vertigo Effect Simulation
|
5 |
+
This document provides a comprehensive overview of a specialized Motion LoRA model designed for simulating dynamic scenes with an emphasis on the vertigo effect using a dolly zoom technique. This model is built for researchers and developers with interests in advanced motion models, particularly for cinematic effects like the one famously used in "Jaws".
|
6 |
+
|
7 |
+
License and User Agreement
|
8 |
+
Apache License 2.0: This model is released under the Apache License 2.0, and users must comply with its terms.
|
9 |
+
Commercial Usage Disclaimer: Users assume responsibility for any commercial legal disputes related to the use of this model.
|
10 |
+
Consent by Installation: By installing this model, users agree to the stated terms and conditions.
|
11 |
+
Model Demonstrations
|
12 |
+
Showcase: Demonstrations of the model's capabilities are available on YouTube and Twitter.
|
13 |
+
Specific Configuration for the Vertigo Effect
|
14 |
+
Model Training Setup:
|
15 |
+
|
16 |
+
Resolution: 512x384, optimized for capturing the intricate details of the vertigo effect.
|
17 |
+
LoRA Rank: Utilizes a 64 LoRA rank to enhance learning efficiency and model adaptability.
|
18 |
+
Training Duration: Implements 16 frames across 5 sequences with a progression marked by 200 and 300-step checkpoints for detailed refinement.
|
19 |
+
Video Input: Trained with a singular clip, focused on accurately reproducing the vertigo effect through detailed visual inputs.
|
20 |
+
Training Configuration:
|
21 |
+
|
22 |
+
Training Model: realisticvisionv51, a highly advanced base model known for generating realistic and detailed visuals.
|
23 |
+
Training Prompt: "A man, vertigo effect, dolly zoom," directing the model to specifically learn and replicate the dynamic dolly zoom effect associated with the vertigo sensation.
|
24 |
+
Text Configuration: Utilizes tailored text configurations to precisely guide the model in generating footage that showcases the vertigo effect with a dolly zoom on a man.
|
25 |
+
Model Specifications and Optimization:
|
26 |
+
|
27 |
+
Inference Model: Consistent with the training model to ensure high fidelity in output.
|
28 |
+
Additional Parameters: use_offset_noise is activated to introduce realistic variations and enhance the authenticity of the vertigo effect.
|
29 |
+
Learning Rate Adjustments: The learning rate, spatial learning rate, and Adam weight decay are dynamically adjusted between 5x to 20x the original rate based on the dataset size, ensuring optimal learning efficiency.
|
30 |
+
Workflow Compatibility: Designed to integrate seamlessly with the Animatediff workflow, with recommended weights set between 0.5 and 1 to balance between visual quality and computational demand.
|