MultiLegalPile: A 689GB Multilingual Legal Corpus Paper β’ 2306.02069 β’ Published Jun 3, 2023 β’ 1
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models Paper β’ 2308.11462 β’ Published Aug 20, 2023 β’ 2
In-Context Prompt Editing For Conditional Audio Generation Paper β’ 2311.00895 β’ Published Nov 1, 2023 β’ 8
view article Article Introducing Spaces Dev Mode for a seamless developer experience 12 days ago β’ 10
view article Article CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models 9 days ago β’ 17
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper β’ 2405.09818 β’ Published 17 days ago β’ 95
OmniGlue: Generalizable Feature Matching with Foundation Model Guidance Paper β’ 2405.12979 β’ Published 11 days ago β’ 7
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention Paper β’ 2405.12981 β’ Published 11 days ago β’ 23
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control Paper β’ 2405.12970 β’ Published 11 days ago β’ 20
Diffusion for World Modeling: Visual Details Matter in Atari Paper β’ 2405.12399 β’ Published 12 days ago β’ 25
πGGUF Collection Llama.cpp compatible models, can be used on CPUs and GPUs! β’ 664 items β’ Updated 2 days ago β’ 23
INDUS: Effective and Efficient Language Models for Scientific Applications Paper β’ 2405.10725 β’ Published 15 days ago β’ 20
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community β’ 16 items β’ Updated 15 days ago β’ 182
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 11 items β’ Updated 15 days ago β’ 103
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 18 items β’ Updated 2 days ago β’ 135
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model 19 days ago β’ 131
view article Article Hugging Face x LangChain : A new partner package in LangChain 19 days ago β’ 70
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper β’ 2405.00732 β’ Published Apr 29 β’ 115
Customizing Text-to-Image Models with a Single Image Pair Paper β’ 2405.01536 β’ Published about 1 month ago β’ 17
LLM-AD: Large Language Model based Audio Description System Paper β’ 2405.00983 β’ Published about 1 month ago β’ 13
FLAME: Factuality-Aware Alignment for Large Language Models Paper β’ 2405.01525 β’ Published about 1 month ago β’ 21
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Paper β’ 2405.01481 β’ Published about 1 month ago β’ 20
WildChat: 1M ChatGPT Interaction Logs in the Wild Paper β’ 2405.01470 β’ Published about 1 month ago β’ 53
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Paper β’ 2405.01434 β’ Published about 1 month ago β’ 44
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper β’ 2405.01535 β’ Published about 1 month ago β’ 102
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper β’ 2404.18796 β’ Published Apr 29 β’ 63
Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting Paper β’ 2404.18911 β’ Published Apr 29 β’ 26
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper β’ 2404.16994 β’ Published Apr 25 β’ 31
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs Paper β’ 2404.16873 β’ Published Apr 21 β’ 26
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding Paper β’ 2404.16710 β’ Published Apr 25 β’ 55
What matters when building vision-language models? Paper β’ 2405.02246 β’ Published 29 days ago β’ 87
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). β’ 6 items β’ Updated 29 days ago β’ 37
view article Article Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face 30 days ago β’ 13
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data Paper β’ 2404.15653 β’ Published Apr 24 β’ 24
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper β’ 2309.10400 β’ Published Sep 19, 2023 β’ 22
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper β’ 2404.14619 β’ Published Apr 22 β’ 122
view article Article Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs Apr 16 β’ 11
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 22 items β’ Updated 2 days ago β’ 299
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published Apr 22 β’ 238
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram β’ Apr 24 β’ 48
view article Article Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data By Pclanglais β’ Apr 18 β’ 20
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Apr 18 β’ 557
Eurus Collection Advancing LLM Reasoning Generalists with Preference Trees β’ 11 items β’ Updated Apr 15 β’ 22