Alvaro Bartolome's picture

Alvaro Bartolome PRO

alvarobartt

·

https://alvarobartt.me

AI & ML interests

machine learning @huggingface

Recent Activity

liked a model 5 days ago

agentica-org/DeepCoder-14B-Preview

new activity 6 days ago

jinaai/jina-reranker-v1-tiny-en:Update `config.json` to make it compatible with TEI

new activity 7 days ago

Alibaba-NLP/gte-multilingual-reranker-base:not being able to run in TEI

View all activity

Organizations

Posts 6

Post

3043

🔥 Agents can do anything! @microsoft Research just announced the release of Magma 8B!

Magma is a new Visual Language Model (VLM) with 8B parameters for multi-modal agents designed to handle complex interactions across virtual and real environments; and it's MIT licensed!

Magma comes with exciting new features such as:
- Introduces the Set-of-Mark and Trace-of-Mark techniques for fine-tuning
- Leverages a large amount of unlabeled video data to learn the spatial-temporal grounding and planning
- A strong generalization and ability to be fine-tuned for other agentic tasks
- SOTA in different multi-modal benchmarks spanning across UI navigation, robotics manipulation, image / video understanding and spatial understanding and reasoning
- Generates goal-driven visual plans and actions for agentic use cases

Model: microsoft/Magma-8B
Technical Report: Magma: A Foundation Model for Multimodal AI Agents (2502.13130)

Articles 9

Article

4

🤗 Serve any model with Inference Endpoints + Custom Handlers

View all Articles

Collections 8

spaces 1

Running on Zero

FLUX.1 Studio Ghibli LoRA

Generate Studio Ghibli-style images from text prompts

models 23

alvarobartt/safetensors

alvarobartt/paligemma-2-ft-vqa

alvarobartt/SmolVLM-Instruct-Handler

Image-Text-to-Text • Updated Dec 4, 2024 • 5

alvarobartt/NVLM-D-72B-IE-compatible

Image-Text-to-Text • Updated Nov 19, 2024 • 5

alvarobartt/ghibli-characters-flux-lora

Text-to-Image • Updated Nov 19, 2024 • 1.58k • 54

alvarobartt/ghibli-characters-sd3.5-lora

Text-to-Image • Updated Nov 19, 2024 • 79 • 10

alvarobartt/bert-base-multilingual-cased-ner-spanish

Token Classification • Updated Sep 2, 2024 • 56 • 2

alvarobartt/mistral-7b-orpo-airoboros-pref-10k

Text Generation • Updated Mar 28, 2024 • 6

alvarobartt/mistral-7b-orpo-alignment-handbook

Text Generation • Updated Mar 27, 2024 • 3

alvarobartt/mistral-orpo-mix-b0.05-l1024-pl512-lr5e-7-cosine

Text Generation • Updated Mar 26, 2024 • 3

datasets 55

alvarobartt/Magicoder-Vicuna-1.0

Viewer • Updated Nov 20, 2024 • 75.2k • 21

alvarobartt/SQL-OAI

Viewer • Updated Sep 26, 2024 • 106k • 28

alvarobartt/Magicoder-OAI

Viewer • Updated Sep 25, 2024 • 75.2k • 21

alvarobartt/ghibli-characters

Viewer • Updated Sep 1, 2024 • 9 • 273 • 7

alvarobartt/Capybara-Preferences-Tiny

Viewer • Updated May 14, 2024 • 10 • 28

alvarobartt/replacing-judges-with-juries-distilabel

Viewer • Updated May 8, 2024 • 100 • 83 • 3

alvarobartt/prometheus-eval-distilabel-default

Viewer • Updated May 7, 2024 • 2 • 64

alvarobartt/prometheus-eval-distilabel-ratings

Viewer • Updated May 7, 2024 • 2 • 30

alvarobartt/prometheus-eval-distilabel-generation

Viewer • Updated May 7, 2024 • 2 • 45

alvarobartt/prometheus-eval-distilabel-index

Viewer • Updated May 7, 2024 • 2 • 76