clem (Clem 🤗)

upvoted 4 collections about 21 hours ago

upvoted an article about 21 hours ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

1 day ago

• 58

upvoted an article about 22 hours ago

Article

Hugging Face x LangChain : A new partner package in LangChain

1 day ago

• 34

upvoted a paper 4 days ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published 17 days ago • 103

upvoted 13 papers 5 days ago

Customizing Text-to-Image Models with a Single Image Pair

Paper • 2405.01536 • Published 13 days ago • 17

LLM-AD: Large Language Model based Audio Description System

Paper • 2405.00983 • Published 14 days ago • 13

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published 13 days ago • 19

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Paper • 2405.01481 • Published 13 days ago • 20

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published 13 days ago • 52

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published 13 days ago • 44

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published 13 days ago • 88

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published 16 days ago • 62

Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Paper • 2404.18911 • Published 16 days ago • 25

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

Paper • 2404.16994 • Published 20 days ago • 30

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published 24 days ago • 25

Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published 20 days ago • 54

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published 20 days ago • 50

upvoted a paper 9 days ago

What matters when building vision-language models?

Paper • 2405.02246 • Published 12 days ago • 64

upvoted a collection 12 days ago

Llama3-ChatQA-1.5

Collection

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 12 days ago • 34

upvoted an article 12 days ago

Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

13 days ago

• 12

upvoted an article 13 days ago

Article

Improving Prompt Consistency with Structured Generations

16 days ago

• 35

upvoted 2 papers 20 days ago

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published 21 days ago • 24

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 22

upvoted a paper 21 days ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published 23 days ago • 120

upvoted a collection 21 days ago

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 96

upvoted a paper 22 days ago

Music Consistency Models

Paper • 2404.13358 • Published 25 days ago • 12

upvoted 2 articles 22 days ago

Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

30 days ago

• 11

Article

Introducing the Open Chain of Thought Leaderboard

23 days ago

• 20

upvoted a collection 22 days ago

Phi-3

Collection

Phi-3 family of models • 6 items • Updated 2 days ago • 195

upvoted a paper 23 days ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published 23 days ago • 229

upvoted an article 23 days ago

Article

Fine-tune Llama 3 with ORPO

By

•

23 days ago

• 174

upvoted 2 collections 23 days ago

LLaVA-NeXT-Video

Collection

Some powerful video models. • 5 items • Updated 25 days ago • 11

🔒☂️🧑‍🤝‍🧑 Privacy and AI

Collection

8 items • Updated Apr 4 • 5

upvoted an article 23 days ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

21 days ago

• 37

upvoted an article 25 days ago

Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

By

•

27 days ago

• 20

upvoted a collection 27 days ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 27 days ago • 512

upvoted an article 29 days ago

Article

Custom architectures with HuggingFace 🤗

By

•

24 days ago

• 20

upvoted a collection 30 days ago

Eurus

Collection

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated about 1 month ago • 22

upvoted an article 30 days ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

about 1 month ago

• 124

upvoted 2 collections 30 days ago

WizardLM

Collection

0 items • Updated 7 days ago • 95

Idefics2 🐶

Collection

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 9 days ago • 73

upvoted a paper about 1 month ago

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 57

upvoted an article about 1 month ago

Article

AI Watermarking 101: Tools and Techniques

Feb 26

• 5

upvoted 5 collections about 1 month ago

C4AI Command R Plus

Collection

C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. • 3 items • Updated Apr 5 • 12

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 76

PDF Document / OCR Datasets

Collection

Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30 • 36

HyperGraph Datasets

Collection

Collection of HyperGraph Datasets • 17 items • Updated Apr 4 • 7

Moirai-1.0-R models

Collection

3 items • Updated 5 days ago • 20

upvoted a paper about 1 month ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 98

upvoted 2 collections about 2 months ago

DBRX

Collection

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27 • 88

Recent Mamba Papers

Collection

[NB: Notes are from TuringPost] • 3 items • Updated Mar 26 • 9

upvoted 6 papers about 2 months ago

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Paper • 2403.13248 • Published Mar 20 • 71

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13 • 42

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13 • 48

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Paper • 2403.09029 • Published Mar 14 • 52

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 119

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15 • 58

Clem 🤗 PRO

AI & ML interests

Organizations

clem's activity

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Hugging Face x LangChain : A new partner package in LangChain

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

Improving Prompt Consistency with Structured Generations

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

Introducing the Open Chain of Thought Leaderboard

Fine-tune Llama 3 with ORPO

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

Custom architectures with HuggingFace 🤗

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

AI Watermarking 101: Tools and Techniques