Konstantinos Kakkavas's picture

6 9

Konstantinos Kakkavas

kkakkavas

kkakkavas

AI & ML interests

- NLP - CV - docVQA

Recent Activity

upvoted a paper 3 days ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

upvoted a paper 3 days ago

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

upvoted a paper 3 days ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

View all activity

Organizations

kkakkavas's activity

upvoted 3 papers 3 days ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published 7 days ago • 22

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published 5 days ago • 25

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published 4 days ago • 45

upvoted 2 papers 17 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 24 days ago • 98

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published 18 days ago • 48

upvoted a collection 7 months ago

Table Transformer

The Table Transformer (TATR) is a series of object detection models useful for table extraction from PDF images. • 5 items • Updated 17 days ago • 20