882 49 189

Pedro Cuenca

pcuenq

pcuenca

AI & ML interests

None yet

Articles

PaliGemma – Google's Cutting-Edge Open Vision Language Model

6 days ago

• 95

License to Call: Introducing Transformers Agents 2.0

7 days ago

• 64

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

19 days ago

• 50

Organizations

pcuenq's activity

upvoted 2 collections 3 days ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 3 days ago • 90

PaliGemma FT Models

Collection

108 items • Updated 5 days ago • 10

upvoted an article 11 days ago

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

13 days ago

• 24

upvoted a paper 13 days ago

End-to-End Object Detection with Transformers

Paper • 2005.12872 • Published May 26, 2020 • 3

upvoted a collection 17 days ago

Depth Anything Release

Collection

Depth Anything models, foundation models for monocular depth estimation, trained on 1.5 million labeled images and 62 million unlabeled images • 8 items • Updated Jan 26 • 6

upvoted an article 20 days ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

23 days ago

• 54

upvoted 2 collections 26 days ago

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 96

OpenELM Pretrained Models

Collection

4 items • Updated 26 days ago • 36

upvoted 3 articles about 1 month ago

Article

Fine-tune Llama 3 with ORPO

•

27 days ago

• 178

Article

Design choices for Vision Language Models in 2024

•

Apr 16

• 18

Article

Custom architectures with HuggingFace 🤗

•

28 days ago

• 20

upvoted 2 collections about 1 month ago

fuck quadratic attention

Collection

11 items • Updated 25 days ago • 19

CodeGemma Release

Collection

16 items • Updated 5 days ago • 58

upvoted 2 collections 3 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated 5 days ago • 303

MobiLlama

Collection

Collection of MobiLlama Language Models. • 6 items • Updated 24 days ago • 14

upvoted a paper 3 months ago

Specialized Language Models with Cheap Inference from Limited Domain Data

Paper • 2402.01093 • Published Feb 2 • 45

upvoted 2 collections 4 months ago

Canonical models

Collection

This collection lists all the historical (pre-"Hub") canonical model checkpoints, i.e. repos that were not under an org or user namespace • 68 items • Updated Feb 13 • 13

AIM

Collection

AIM: Autoregressive Image Models • 5 items • Updated Jan 29 • 43

upvoted 3 papers 4 months ago

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16 • 35

CTRL: A Conditional Transformer Language Model for Controllable Generation

Paper • 1909.05858 • Published Sep 11, 2019 • 4

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Paper • 2401.05252 • Published Jan 10 • 43

upvoted 2 collections 4 months ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 77

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 8 items • Updated 5 days ago • 24

upvoted a collection 5 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 28 items • Updated Mar 23 • 180

upvoted 4 papers 5 months ago

MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices

Paper • 2312.16886 • Published Dec 28, 2023 • 18

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 253

MobileSAMv2: Faster Segment Anything to Everything

Paper • 2312.09579 • Published Dec 15, 2023 • 20

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 41

upvoted a collection 5 months ago

MoE

Collection

131 items • Updated 14 days ago • 16

upvoted a collection 6 months ago

Latent Consistency Models LoRAs

Collection

Latent Consistency Models for Stable Diffusion - LoRAs and full fine-tuned weights • 4 items • Updated Nov 10, 2023 • 95

upvoted a paper 6 months ago

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Paper • 2310.04378 • Published Oct 6, 2023 • 19

upvoted 5 papers 7 months ago

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Paper • 2310.05737 • Published Oct 9, 2023 • 4

Matryoshka Diffusion Models

Paper • 2310.15111 • Published Oct 23, 2023 • 39

Data Filtering Networks

Paper • 2309.17425 • Published Sep 29, 2023 • 6

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 116

DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics

Paper • 2310.13268 • Published Oct 20, 2023 • 15

upvoted 2 collections 7 months ago

Historical - Spaces of the Week

Collection

All Spaces of the Week...from all weeks • 636 items • Updated Jan 17 • 19

LLM Leaderboard best models ❤️‍🔥

Collection

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 70 items • Updated 3 days ago • 307

upvoted 3 collections 8 months ago

OS Week Highlights - Sept 25 - Oct 1

Collection

8 items • Updated Jan 17 • 4

⚖️ Showing Biases in ML Systems

Collection

9 items • Updated Feb 9 • 4

Recent models: last 100 repos, sorted by creation date

Collection

The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 446

upvoted 3 papers 9 months ago

High-Resolution Image Synthesis with Latent Diffusion Models

Paper • 2112.10752 • Published Dec 20, 2021 • 7

MVDream: Multi-view Diffusion for 3D Generation

Paper • 2308.16512 • Published Aug 31, 2023 • 99

Stay on topic with Classifier-Free Guidance

Paper • 2306.17806 • Published Jun 30, 2023 • 26

upvoted 4 papers 10 months ago

Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation

Paper • 2305.01569 • Published May 2, 2023 • 2

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Paper • 2307.06949 • Published Jul 13, 2023 • 49

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 73

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 235

upvoted a paper 11 months ago

LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance

Paper • 2307.00522 • Published Jul 2, 2023 • 27

Pedro Cuenca

AI & ML interests

Articles

PaliGemma – Google's Cutting-Edge Open Vision Language Model

License to Call: Introducing Transformers Agents 2.0

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Welcome Llama 3 - Meta's new open LLM

CodeGemma - an official Google release for code LLMs

Welcome Gemma - Google's new open LLM

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Mixture of Experts Explained

SDXL in 4 steps with Latent Consistency LoRAs

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

Inference for PROs

Introducing Würstchen: Fast Diffusion for Image Generation

Spread Your Wings: Falcon 180B is here

Code Llama: Llama 2 learns to code

Releasing Swift Transformers: Run On-Device LLMs in Apple Devices

Stable Diffusion XL on Mac with Advanced Core ML Quantization

Happy 1st anniversary 🤗 Diffusers!

Llama 2 is here - get it on Hugging Face

Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac

The Falcon has landed in the Hugging Face ecosystem

Train your ControlNet with diffusers

Swift Diffusers: Fast Stable Diffusion for Mac

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Using Stable Diffusion with Core ML on Apple Silicon

Hugging Face Machine Learning Demos on arXiv

Training Stable Diffusion with Dreambooth using 🧨 Diffusers

Stable Diffusion in JAX/Flax 🚀

Stable Diffusion with 🧨 Diffusers

Organizations

pcuenq's activity

SeeMoE: Implementing a MoE Vision Language Model from Scratch

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Fine-tune Llama 3 with ORPO

Design choices for Vision Language Models in 2024

Custom architectures with HuggingFace 🤗