Konstantinos Kakkavas

kkakkavas

kkakkavas

AI & ML interests

- NLP - CV - docVQA

Recent Activity

upvoted a paper about 2 months ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

upvoted a paper about 2 months ago

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

upvoted a paper about 2 months ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

View all activity

Organizations

kkakkavas's activity

upvoted 3 papers about 2 months ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18 • 24

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published Jan 20 • 28

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 53

upvoted 2 papers 2 months ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 100

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 50

liked a model 4 months ago

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated Dec 16, 2024 • 4.19k • 147

liked a Space 6 months ago

KIE Engines Comparison

📉

updated a Space 6 months ago

KIE Engines Comparison

📉

liked 2 models 8 months ago

bartowski/Meta-Llama-3-8B-Instruct-GGUF

Text Generation • Updated Apr 29, 2024 • 5.31k • 94

naver-clova-ocr/bros-base-uncased

Feature Extraction • Updated Apr 5, 2022 • 45.2k • 18

liked a dataset 8 months ago

lmms-lab/DocVQA

Viewer • Updated Apr 18, 2024 • 16.6k • 10.3k • 32

liked a Space 8 months ago

Groq-LLaMA3.x

📚

Groq & Llama3.x updated

updated a Space 8 months ago

Sennodipoi LayoutLMv3 KleisterNDA

🌍

liked a Space 8 months ago

159

DocOwl

📚

upvoted a collection 9 months ago

Table Transformer

Collection

The Table Transformer (TATR) is a series of object detection models useful for table extraction from PDF images. • 5 items • Updated Jan 8 • 23

liked 2 models 9 months ago

mPLUG/DocOwl1.5

Updated Apr 10, 2024 • 71 • 26

JinghuiLuAstronaut/DocLLM_baichuan2_7b

Text Generation • Updated Feb 29, 2024 • 125 • 5

updated a Space 10 months ago

README

🏃