Federico Cocchi's picture

2 8

Federico Cocchi

fede97

·

https://federico1-creator.github.io/Federico_Cocchi/

federico1-creator

AI & ML interests

Multimodal LLM - Computer Vision

Recent Activity

new activity 5 days ago

itserr/LatinGPT_alpha-01:Update app.py

updated a Space 11 days ago

itserr/LatinGPT_alpha-01

updated a collection 11 days ago

View all activity

Organizations

fede97's activity

New activity in itserr/LatinGPT_alpha-01 5 days ago

Update app.py

#1 opened 5 days ago by

updated a Space 11 days ago

LatinGPT

LatinGPT

updated a collection 11 days ago

ReflectiVA

Models and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025] • 2 items • Updated 11 days ago

updated a dataset 11 days ago

aimagelab/ReflectiVA-Data

Preview • Updated 11 days ago • 43

published a dataset 11 days ago

aimagelab/ReflectiVA-Data

Preview • Updated 11 days ago • 43

liked a Space 11 days ago

AI Deadlines

Schedule tasks efficiently using AI-generated deadlines

authored a paper 12 days ago

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Paper • 2503.15621 • Published 17 days ago

updated a model 4 months ago

aimagelab/CoDE

Image Feature Extraction • Updated Dec 12, 2024 • 1.36k • 2

authored a paper 4 months ago

Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

Paper • 2411.16863 • Published Nov 25, 2024

updated a model 4 months ago

aimagelab/ReflectiVA

Image-Text-to-Text • Updated Nov 27, 2024 • 48 • 2

updated a collection 4 months ago

ELSA EU Project

Dataset and models created inside the ELSA – European Lighthouse on Secure and Safe AI project on Multimedia use case. • 4 items • Updated Nov 25, 2024

updated a collection 7 months ago

LLaVA-MORE

LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 • 2 items • Updated Aug 31, 2024

New activity in aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-finetuning 7 months ago

checkpoint you are trying to load has model type `llava_llama` but Transformers does not recognize this architecture

#1 opened 8 months ago by

updated a model 8 months ago

aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-finetuning

Image-Text-to-Text • Updated Aug 16, 2024 • 6 • 2

updated a collection 8 months ago

LLaVA-MORE

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning • 8 items • Updated 10 days ago • 2