GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models Paper • 2303.10130 • Published Mar 17, 2023 • 3
SWE-bench: Can Language Models Resolve Real-World GitHub Issues? Paper • 2310.06770 • Published Oct 10, 2023 • 4
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models Paper • 2309.03883 • Published Sep 7, 2023 • 31
view article Article Banque des Territoires (CDC Group) x Polyconseil x Hugging Face: Enhancing a Major French Environmental Program with a Sovereign Data Solution 13 days ago • 3
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets Paper • 2406.18518 • Published 26 days ago • 22
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published 21 days ago • 41
Precise Zero-Shot Dense Retrieval without Relevance Labels Paper • 2212.10496 • Published Dec 20, 2022 • 2
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection Paper • 2310.11511 • Published Oct 17, 2023 • 68
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs Paper • 2406.15319 • Published about 1 month ago • 57
The SPRIGHT T2I collection Collection This collection contains the datasets, model, paper, and demo associated with the SPRIGHT (SPatially RIGHT) release. • 5 items • Updated Apr 2 • 5
view article Article Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap Jun 19 • 6
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 28 days ago • 142
view article Article XLSCOUT Unveils ParaEmbed 2.0: a Powerful Embedding Model Tailored for Patents and IP with Expert Support from Hugging Face 27 days ago • 8
view article Article Build Agentic Workflow using OpenAGI and HuggingFace models By lucifertrj • 26 days ago • 6
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools Paper • 2405.20362 • Published May 30 • 2
DataComp: In search of the next generation of multimodal datasets Paper • 2304.14108 • Published Apr 27, 2023 • 2
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper • 2406.06525 • Published Jun 10 • 62
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark Paper • 2406.01574 • Published Jun 3 • 42
view article Article Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖 By m-ric • Jun 20 • 25
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Paper • 2401.05566 • Published Jan 10 • 25
view article Article How Sempre Health is leveraging the Expert Acceleration Program to accelerate their ML roadmap May 19, 2022 • 1
Metacognitive Prompting Improves Understanding in Large Language Models Paper • 2308.05342 • Published Aug 10, 2023 • 2
Large Language Models Struggle to Learn Long-Tail Knowledge Paper • 2211.08411 • Published Nov 15, 2022 • 3
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy Paper • 2305.15294 • Published May 24, 2023 • 1
No Language Left Behind: Scaling Human-Centered Machine Translation Paper • 2207.04672 • Published Jul 11, 2022 • 1
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 60
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering Paper • 1809.09600 • Published Sep 25, 2018 • 2
Fast Inference from Transformers via Speculative Decoding Paper • 2211.17192 • Published Nov 30, 2022 • 3
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper • 2401.10774 • Published Jan 19 • 51
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding Paper • 2402.12374 • Published Feb 19 • 3