Spaces:

jadechoghari
/

ferret-demo

Running on Zero

App Files Files Community

jadechoghari commited on 13 days ago

Commit

ee59173

•

1 Parent(s): 35f0b0b

remove users images and update readme

Browse files

Files changed (6) hide show

Screenshot 2024-10-24 at 19.50.06.png +0 -3
Screenshot 2024-10-24 at 19.52.12.png +0 -3
app.py +10 -1
clipboard.png +0 -3
Новый проект 1.png +0 -3
Новый проект 4.png +0 -3

Screenshot 2024-10-24 at 19.50.06.png DELETED Viewed

Git LFS Details

SHA256: eb801c2ac0507516e531bbb8db454cb2687f45deef50a9762263dd81e89b6e40
Pointer size: 131 Bytes
Size of remote file: 164 kB

Screenshot 2024-10-24 at 19.52.12.png DELETED Viewed

Git LFS Details

SHA256: e4f8f93116263d90b912bd6012780d6503f416121026585f28e996f2010fd463
Pointer size: 131 Bytes
Size of remote file: 147 kB

app.py CHANGED Viewed

@@ -72,7 +72,7 @@ html = f"""
                 style='width: 80px; height: 80px; border-radius: 20px; box-shadow: 0px 8px 16px rgba(0, 0, 0, 0.2);'/>
             <div style="margin-left: 15px;">
                 <h1 style="font-size: 2.8em; font-family: -apple-system, BlinkMacSystemFont, sans-serif; color: #1D1D1F;
-                font-weight: bold; margin-bottom: 0;"> {model_name}</h1>
                 <p style="font-size: 1.2em; color: #6e6e73; font-family: -apple-system, BlinkMacSystemFont, sans-serif; margin-top: 5px;">
                     📱 Grounded Mobile UI Understanding with Multimodal LLMs.<br>
                     A new MLLM tailored for enhanced understanding of mobile UI screens, equipped with referring, grounding, and reasoning capabilities.
@@ -123,6 +123,15 @@ with gr.Blocks(title=model_name) as demo:
             temperature_input = gr.Slider(minimum=0.1, maximum=2.0, step=0.1, value=0.2, label="Temperature")
             top_p_input = gr.Slider(minimum=0.0, maximum=1.0, step=0.05, value=0.7, label="Top P")
             max_new_tokens_input = gr.Slider(minimum=1, maximum=1024, step=1, value=512, label="Max New Tokens")
         with gr.Column(scale=7):
             chatbot = gr.Chatbot(

                 style='width: 80px; height: 80px; border-radius: 20px; box-shadow: 0px 8px 16px rgba(0, 0, 0, 0.2);'/>
             <div style="margin-left: 15px;">
                 <h1 style="font-size: 2.8em; font-family: -apple-system, BlinkMacSystemFont, sans-serif; color: #1D1D1F;
+                font-weight: bold; margin-bottom: 0;"> {model_name}</h1>
                 <p style="font-size: 1.2em; color: #6e6e73; font-family: -apple-system, BlinkMacSystemFont, sans-serif; margin-top: 5px;">
                     📱 Grounded Mobile UI Understanding with Multimodal LLMs.<br>
                     A new MLLM tailored for enhanced understanding of mobile UI screens, equipped with referring, grounding, and reasoning capabilities.
             temperature_input = gr.Slider(minimum=0.1, maximum=2.0, step=0.1, value=0.2, label="Temperature")
             top_p_input = gr.Slider(minimum=0.0, maximum=1.0, step=0.05, value=0.7, label="Top P")
             max_new_tokens_input = gr.Slider(minimum=1, maximum=1024, step=1, value=512, label="Max New Tokens")
+            gr.Examples(
+                examples=[
+                    [{"image": "appstore_reminders.png"}, "Describe the contents inside the box"],
+                    [{"image": "appstore_reminders.png"}, "What is the text shown inside the highlighted area"]
+                ],
+                inputs=[image_input, text_input],
+                label="Try these examples"
+            )
         with gr.Column(scale=7):
             chatbot = gr.Chatbot(

clipboard.png DELETED Viewed

Git LFS Details

SHA256: f3fc080eeb0ae164b8eea11cd10c04675c42c7730d65bee8332114a76c846c3d
Pointer size: 131 Bytes
Size of remote file: 454 kB

Новый проект 1.png DELETED Viewed

Git LFS Details

SHA256: c5f149f1f5df58f11ca930e4b6450a680e31b93235ad4d3a753da36e74dba7df
Pointer size: 132 Bytes
Size of remote file: 1.86 MB

Новый проект 4.png DELETED Viewed

Git LFS Details

SHA256: dd8d80280fadf1fb650840217c59e0b7e5029a183ee4aff87b924034bdde0b10
Pointer size: 131 Bytes
Size of remote file: 801 kB