1603 535 1454

Julien Chaumond PRO

julien-c

https://huggingface.co

julien-c

AI & ML interests

<3 ML/AI for everyone, building products to propel communities fwd

Articles

Hugging Face partners with Wiz Research to Improve AI Security

Apr 4

• 10

Introducing Storage Regions on the HF Hub

Nov 3, 2023

Hugging Face Selected for the French Data Protection Agency Enhanced Support Program

May 15, 2023

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

• 8

Organizations

julien-c's activity

upvoted a paper 2 days ago

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Paper • 2405.08748 • Published 3 days ago • 13

upvoted a collection 2 days ago

Embedding Model Datasets

Collection

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 49 items • Updated 2 days ago • 10

upvoted an article 3 days ago

Article

Introducing the Open Arabic LLM Leaderboard

4 days ago

• 37

upvoted a collection 3 days ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 11 items • Updated about 16 hours ago • 85

upvoted an article 3 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

4 days ago

• 91

upvoted an article 4 days ago

Article

License to Call: Introducing Transformers Agents 2.0

5 days ago

• 63

upvoted an article 8 days ago

Article

Subscribe to Enterprise Hub with your AWS Account

9 days ago

• 4

upvoted 4 collections 8 days ago

upvoted a paper 10 days ago

CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images

Paper • 2310.16825 • Published Oct 25, 2023 • 27

upvoted a paper 11 days ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published 17 days ago • 90

upvoted 3 papers 12 days ago

Octopus v4: Graph of language models

Paper • 2404.19296 • Published 18 days ago • 89

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published 19 days ago • 107

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published 15 days ago • 92

upvoted a collection 14 days ago

MetaAI's CodeLlama - Coding Assistant LLM

Collection

Fast, small, and capable coding model you can run locally on your computer! Requires 8GB+ of RAM. • 4 items • Updated Sep 8, 2023 • 5

upvoted 3 papers 15 days ago

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published 24 days ago • 24

A Careful Examination of Large Language Model Performance on Grade School Arithmetic

Paper • 2405.00332 • Published 17 days ago • 24

Personalize Segment Anything Model with One Shot

Paper • 2305.03048 • Published May 4, 2023 • 6

upvoted 2 collections 18 days ago

git-theta

Collection

Playing with git-theta: https://github.com/r-three/git-theta • 2 items • Updated 18 days ago • 1

Pile-T5

Collection

T5 trained on the Pile with Llama Tokenizer • 4 items • Updated Apr 15 • 16

upvoted an article 18 days ago

Article

Jupyter X Hugging Face

Mar 23, 2023

• 2

upvoted a collection 22 days ago

Albert

Collection

Les différents modèles à jour dans la famille Albert, les modèles archivés n'apparaissent pas dans cette collection. The various models behind Albert • 4 items • Updated 5 days ago • 6

upvoted a collection 23 days ago

OpenELM Pretrained Models

Collection

4 items • Updated 24 days ago • 36

upvoted a collection 24 days ago

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 96

upvoted a paper 24 days ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published 25 days ago • 120

upvoted a paper 25 days ago

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15 • 18

upvoted a collection 25 days ago

〽️MistralAI

Collection

A collection of MistralAI models that you can trust in production! • 7 items • Updated 8 days ago • 7

upvoted 5 papers 25 days ago

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

Paper • 2404.12753 • Published 29 days ago • 38

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Paper • 2404.13013 • Published 28 days ago • 26

TextSquare: Scaling up Text-Centric Visual Instruction Tuning

Paper • 2404.12803 • Published 29 days ago • 27

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published about 1 month ago • 40

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published 29 days ago • 50

upvoted 2 articles 26 days ago

Article

Custom architectures with HuggingFace 🤗

•

26 days ago

• 20

Article

Welcome Llama 3 - Meta's new open LLM

about 1 month ago

• 238

upvoted 2 collections 29 days ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 29 days ago • 520

A little guide to building Large Language Models in 2024

Collection

Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 • 19 items • Updated Apr 1 • 13

upvoted 2 articles 30 days ago

Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

•

30 days ago

• 20

Article

AI Apps in a Flash with Gradio's Reload Mode

Apr 16

• 16

upvoted a collection 30 days ago

Merges

Collection

Experimental LLM merging • 1292 items • Updated 8 days ago • 7

upvoted 2 articles about 1 month ago

Article

Orchestration of Experts: The First-Principle Multi-Model System

•

Apr 16

• 8

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

• 8

upvoted a collection about 1 month ago

Idefics2 🐶

Collection

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 12 days ago • 76

upvoted 2 articles about 1 month ago

Article

Vision Language Models Explained

Apr 11

• 82

Article

Mergoo: Efficiently Build Your Own MoE LLM

•

11 days ago

• 32

upvoted 2 collections about 1 month ago

WizardLM

Collection

0 items • Updated 9 days ago • 95

[lecture artifacts] aligning open language models

Collection

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated about 1 month ago • 41

upvoted 3 articles about 1 month ago

Article

Bringing serverless GPU inference to Hugging Face users

Apr 2

• 9

Article

History of State Space Models (SSM) in 2022

•

Apr 11

• 6

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

•

Apr 9

• 26

upvoted 5 papers about 1 month ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 79

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Paper • 2404.07839 • Published Apr 11 • 37

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 40

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

Paper • 2404.07987 • Published Apr 11 • 45

DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting

Paper • 2404.06903 • Published Apr 10 • 14

upvoted 4 articles about 1 month ago

Article

Mixture of Depth is Vibe

•

26 days ago

• 35

Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

Apr 4

• 20

Article

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Apr 10

• 16

Article

CodeGemma - an official Google release for code LLMs

Apr 9

• 95