1608 543 1457

Julien Chaumond PRO

julien-c

https://huggingface.co

julien-c

AI & ML interests

<3 ML/AI for everyone, building products to propel communities fwd

Articles

Hugging Face partners with Wiz Research to Improve AI Security

Apr 4

• 11

Introducing Storage Regions on the HF Hub

Nov 3, 2023

Hugging Face Selected for the French Data Protection Agency Enhanced Support Program

May 15, 2023

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

• 9

Organizations

julien-c's activity

upvoted a paper 3 days ago

Phased Consistency Model

Paper • 2405.18407 • Published 4 days ago • 33

upvoted a paper 4 days ago

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Paper • 2405.17428 • Published 5 days ago • 12

upvoted 2 articles 5 days ago

Article

Build AI on premise with Dell Enterprise Hub

12 days ago

• 13

Article

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

9 days ago

• 12

upvoted an article 10 days ago

Article

Deploy models on AWS Inferentia2 from Hugging Face

11 days ago

• 12

upvoted a paper 10 days ago

Diffusion for World Modeling: Visual Details Matter in Atari

Paper • 2405.12399 • Published 12 days ago • 25

upvoted an article 11 days ago

Article

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

12 days ago

• 8

upvoted a paper 12 days ago

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Paper • 2404.05904 • Published Apr 8 • 3

upvoted a paper 17 days ago

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Paper • 2405.08748 • Published 18 days ago • 17

upvoted a collection 17 days ago

Embedding Model Datasets

Collection

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 51 items • Updated 7 days ago • 24

upvoted an article 18 days ago

Article

Introducing the Open Arabic LLM Leaderboard

19 days ago

• 47

upvoted a collection 18 days ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 15 days ago • 103

upvoted an article 18 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

19 days ago

• 131

upvoted an article 19 days ago

Article

License to Call: Introducing Transformers Agents 2.0

20 days ago

• 88

upvoted an article 22 days ago

Article

Subscribe to Enterprise Hub with your AWS Account

24 days ago

• 6

upvoted 4 collections 23 days ago

upvoted 2 papers 25 days ago

CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images

Paper • 2310.16825 • Published Oct 25, 2023 • 28

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30 • 96

upvoted 3 papers 26 days ago

Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30 • 97

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 115

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published about 1 month ago • 102

upvoted a collection 29 days ago

MetaAI's CodeLlama - Coding Assistant LLM

Collection

Fast, small, and capable coding model you can run locally on your computer! Requires 8GB+ of RAM. • 4 items • Updated Sep 8, 2023 • 5

upvoted 3 papers about 1 month ago

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published Apr 24 • 24

A Careful Examination of Large Language Model Performance on Grade School Arithmetic

Paper • 2405.00332 • Published May 1 • 24

Personalize Segment Anything Model with One Shot

Paper • 2305.03048 • Published May 4, 2023 • 6

upvoted 2 collections about 1 month ago

git-theta

Collection

Playing with git-theta: https://github.com/r-three/git-theta • 2 items • Updated Apr 30 • 1

Pile-T5

Collection

T5 trained on the Pile with Llama Tokenizer • 4 items • Updated Apr 15 • 16

upvoted an article about 1 month ago

Article

Jupyter X Hugging Face

Mar 23, 2023

• 2

upvoted 3 collections about 1 month ago

Albert

Collection

Les différents modèles à jour dans la famille Albert, les modèles archivés n'apparaissent pas dans cette collection. The various models behind Albert • 5 items • Updated 3 days ago • 6

OpenELM Pretrained Models

Collection

4 items • Updated Apr 23 • 38

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 99

upvoted 2 papers about 1 month ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 122

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15 • 19

upvoted a collection about 1 month ago

〽️MistralAI

Collection

A collection of MistralAI models that you can trust in production! • 10 items • Updated 7 days ago • 7

upvoted 5 papers about 1 month ago

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

Paper • 2404.12753 • Published Apr 19 • 38

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Paper • 2404.13013 • Published Apr 19 • 26

TextSquare: Scaling up Text-Centric Visual Instruction Tuning

Paper • 2404.12803 • Published Apr 19 • 27

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published Apr 17 • 40

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18 • 51

upvoted 2 articles about 1 month ago

Article

Custom architectures with HuggingFace 🤗

•

Apr 22

• 20

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 245

upvoted 2 collections about 1 month ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 557

A little guide to building Large Language Models in 2024

Collection

Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 • 19 items • Updated Apr 1 • 14

upvoted 2 articles about 1 month ago

Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

•

Apr 18

• 20

Article

AI Apps in a Flash with Gradio's Reload Mode

Apr 16

• 16

upvoted a collection about 1 month ago

Merges

Collection

Experimental LLM merging • 1292 items • Updated 7 days ago • 7

upvoted 2 articles about 2 months ago

Article

Orchestration of Experts: The First-Principle Multi-Model System

•

2 days ago

• 13

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

• 9

upvoted a collection about 2 months ago

Idefics2 🐶

Collection

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 26 days ago • 83

upvoted 2 articles about 2 months ago

Article

Vision Language Models Explained

Apr 11

• 92

Article

Mergoo: Efficiently Build Your Own MoE LLM

•

26 days ago

• 32

upvoted 2 collections about 2 months ago

WizardLM

Collection

0 items • Updated 24 days ago • 99

[lecture artifacts] aligning open language models

Collection

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17 • 47

upvoted 3 articles about 2 months ago

Article

Bringing serverless GPU inference to Hugging Face users

Apr 2

• 9

Article

History of State Space Models (SSM) in 2022

•

Apr 11

• 6

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

•

Apr 9

• 26

upvoted a paper about 2 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 80