clem (Clem 🤗)

upvoted 3 papers 3 days ago

Phased Consistency Model

Paper • 2405.18407 • Published 4 days ago • 33

MultiLegalPile: A 689GB Multilingual Legal Corpus

Paper • 2306.02069 • Published Jun 3, 2023 • 1

LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

Paper • 2308.11462 • Published Aug 20, 2023 • 2

upvoted a paper 4 days ago

In-Context Prompt Editing For Conditional Audio Generation

Paper • 2311.00895 • Published Nov 1, 2023 • 8

upvoted 2 articles 5 days ago

Article

AI has a problem with objectifying women

By

•

8 days ago

• 52

Article

Introducing Spaces Dev Mode for a seamless developer experience

12 days ago

• 10

upvoted an article 8 days ago

Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

9 days ago

• 17

upvoted 7 papers 10 days ago

Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control

Paper • 2405.12970 • Published 11 days ago • 20

Diffusion for World Modeling: Visual Details Matter in Atari

Paper • 2405.12399 • Published 12 days ago • 25

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published 13 days ago • 134

upvoted an article 10 days ago

Article

Let's talk about LLM evaluation

By

•

9 days ago

• 82

upvoted a collection 11 days ago

🚀GGUF

Collection

Llama.cpp compatible models, can be used on CPUs and GPUs! • 664 items • Updated 2 days ago • 23

upvoted an article 11 days ago

Article

Gradio joins Hugging Face!

Dec 21, 2021

• 1

upvoted a paper 12 days ago

INDUS: Effective and Efficient Language Models for Scientific Applications

Paper • 2405.10725 • Published 15 days ago • 20

upvoted an article 16 days ago

Article

2024-04-22 - Hub Incident Post Mortem

By

•

16 days ago

• 15

upvoted 4 collections 18 days ago

ZeroGPU Spaces

Collection

ZeroGPU Spaces made by the community • 16 items • Updated 15 days ago • 182

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 15 days ago • 103

Granite Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 18 items • Updated 2 days ago • 135

Yi-1.5 (2024/05)

Collection

10 items • Updated 13 days ago • 76

upvoted 2 articles 18 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

19 days ago

• 131

Article

Hugging Face x LangChain : A new partner package in LangChain

19 days ago

• 70

upvoted a paper 21 days ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 115

upvoted 13 papers 22 days ago

Customizing Text-to-Image Models with a Single Image Pair

Paper • 2405.01536 • Published about 1 month ago • 17

LLM-AD: Large Language Model based Audio Description System

Paper • 2405.00983 • Published about 1 month ago • 13

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published about 1 month ago • 21

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Paper • 2405.01481 • Published about 1 month ago • 20

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published about 1 month ago • 53

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published about 1 month ago • 44

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published about 1 month ago • 102

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 63

Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Paper • 2404.18911 • Published Apr 29 • 26

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

Paper • 2404.16994 • Published Apr 25 • 31

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21 • 26

Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25 • 55

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25 • 52

upvoted a paper 26 days ago

What matters when building vision-language models?

Paper • 2405.02246 • Published 29 days ago • 87

upvoted a collection 29 days ago

Llama3-ChatQA-1.5

Collection

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 29 days ago • 37

upvoted an article 29 days ago

Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

30 days ago

• 13

upvoted an article about 1 month ago

Article

Improving Prompt Consistency with Structured Generations

Apr 30

• 46

upvoted 3 papers about 1 month ago

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published Apr 24 • 24

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 22

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 122

upvoted a collection about 1 month ago

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 99

upvoted a paper about 1 month ago

Music Consistency Models

Paper • 2404.13358 • Published Apr 20 • 12

upvoted 2 articles about 1 month ago

Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

Apr 16

• 11

Article

Introducing the Open Chain of Thought Leaderboard

Apr 23

• 20

upvoted a collection about 1 month ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 22 items • Updated 2 days ago • 299

upvoted a paper about 1 month ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 238

upvoted an article about 1 month ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 193

upvoted 2 collections about 1 month ago

LLaVA-NeXT-Video

Collection

Some powerful video models. • 5 items • Updated Apr 20 • 15

🔒☂️🧑‍🤝‍🧑 Privacy and AI

Collection

8 items • Updated Apr 4 • 5

upvoted 2 articles about 1 month ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24

• 48

Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

By

•

Apr 18

• 20

upvoted a collection about 1 month ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 557

upvoted an article about 2 months ago

Article

Custom architectures with HuggingFace 🤗

By

•

Apr 22

• 20

upvoted a collection about 2 months ago

Eurus

Collection

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Apr 15 • 22

Clem 🤗 PRO

AI & ML interests

Organizations

clem's activity

AI has a problem with objectifying women

Introducing Spaces Dev Mode for a seamless developer experience

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

Let's talk about LLM evaluation

Gradio joins Hugging Face!

2024-04-22 - Hub Incident Post Mortem

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Hugging Face x LangChain : A new partner package in LangChain

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

Improving Prompt Consistency with Structured Generations

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

Introducing the Open Chain of Thought Leaderboard

Fine-tune Llama 3 with ORPO

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

Custom architectures with HuggingFace 🤗