Federico Minutoli

DiTo97

DiTo97

AI & ML interests

anything machine learning. I am strongly passionate in computer vision and robotics, and how machine learning will help achieve autonomous behavior, perception and continuous learning.

Recent Activity

upvoted an article 13 days ago

Deriving DPO's Loss

upvoted a paper 21 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

new activity about 2 months ago

scrapegraphai/AQL-v1-QA:[bot] Conversion to Parquet

View all activity

Organizations

DiTo97's activity

upvoted an article 13 days ago

Article

Deriving DPO's Loss

•

13 days ago

• 24

upvoted a paper 21 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 24 days ago • 136

New activity in scrapegraphai/AQL-v1-QA about 2 months ago

[bot] Conversion to Parquet

#1 opened 3 months ago by

parquet-converter

upvoted a paper 2 months ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 35

upvoted a paper 5 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 98

updated 4 models 5 months ago

updated 2 models 6 months ago

scrapegraphai/SERP-reranker-0.5B-4k-GGUF

Updated Jun 25, 2024 • 143

scrapegraphai/SERP-reranker-0.5B-4k-adapter

Updated Jun 25, 2024

updated a dataset 7 months ago

scrapegraphai/AQL-v1-QA

Viewer • Updated Jun 25, 2024 • 8.76k • 29

upvoted an article 8 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 122

upvoted a paper 8 months ago

LEGENT: Open Platform for Embodied Agents

Paper • 2404.18243 • Published Apr 28, 2024 • 21

upvoted 6 papers 11 months ago

FiT: Flexible Vision Transformer for Diffusion Model

Paper • 2402.12376 • Published Feb 19, 2024 • 48

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Paper • 2402.08609 • Published Feb 13, 2024 • 34

BlackMamba: Mixture of Experts for State-Space Models

Paper • 2402.01771 • Published Feb 1, 2024 • 23

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Paper • 2401.16013 • Published Jan 29, 2024 • 23

Proactive Detection of Voice Cloning with Localized Watermarking

Paper • 2401.17264 • Published Jan 30, 2024 • 17

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Paper • 2401.15071 • Published Jan 26, 2024 • 35