1 13 188

Tomi Toivio

Ukuli

TomiToivio

AI & ML interests

Stable Diffusion, NLP, OpenCV etc.

Recent Activity

liked a model 1 day ago

microsoft/Phi-4-multimodal-instruct

liked a model about 1 month ago

deepseek-ai/Janus-Pro-7B

liked a model about 1 month ago

MCG-NJU/videomae-base

View all activity

Organizations

Ukuli's activity

liked a model 1 day ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated about 3 hours ago • 7.35k • 513

liked 2 models about 1 month ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 27 days ago • 476k • 3.15k

MCG-NJU/videomae-base

Video Classification • Updated Mar 29, 2024 • 443k • 40

liked a model 2 months ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated Jan 12 • 164k • • 549

liked 2 models 3 months ago

kadirnar/Llama3.3-70b-Vision

Text Generation • Updated Dec 7, 2024 • 79 • 6

mistralai/Pixtral-12B-2409

Image-Text-to-Text • Updated Dec 26, 2024 • • 614

liked 3 models 4 months ago

liked 2 Spaces 4 months ago

Sym

🖼

Generate images from text prompts

Aa

🌍

liked 2 models 4 months ago

lmms-lab/LLaVA-NeXT-Video-7B

Video-Text-to-Text • Updated 7 days ago • 908 • 43

lmms-lab/llava-onevision-qwen2-72b-ov-chat

Image-Text-to-Text • Updated Oct 9, 2024 • 1.33k • 8

liked 2 Spaces 4 months ago

MEGA-Bench Leaderboard

🥇

A leaderboard for multimodal models

Llava Video

🌋

interact with videos !

liked a model 4 months ago

OpenGVLab/InternVL2-8B

Image-Text-to-Text • Updated 24 days ago • 61.6k • 167

liked a Space 4 months ago

543

Vision Arena (Testing VLMs side-by-side)

🖼

Analyze images to detect and label objects

liked 3 models 4 months ago

meta-llama/Llama-3.2-3B-Instruct

Text Generation • Updated Oct 24, 2024 • 2.36M • • 1.13k

THUDM/CogVideoX-5b

Text-to-Video • Updated Nov 23, 2024 • 91k • • 587

stabilityai/stable-diffusion-3.5-large

Text-to-Image • Updated Oct 22, 2024 • 157k • • 2.35k