view article Article How to build a custom text classifier without days of human labeling By sdiazlor • 13 days ago • 51
view article Article Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero • Jul 22 • 4
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval Mar 22 • 58
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 110
ReLiK: Retrieve, Read and LinK Collection A blazing fast and lightweight Information Extraction model for Entity Linking and Relation Extraction. • 20 items • Updated Aug 8 • 19
Nomic Embed: Training a Reproducible Long Context Text Embedder Paper • 2402.01613 • Published Feb 2 • 14
Mixture-of-Agents Enhances Large Language Model Capabilities Paper • 2406.04692 • Published Jun 7 • 55
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP Paper • 2212.14024 • Published Dec 28, 2022 • 3
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines Paper • 2310.03714 • Published Oct 5, 2023 • 30
DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines Paper • 2312.13382 • Published Dec 20, 2023 • 3
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 29
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark Paper • 2406.01574 • Published Jun 3 • 42
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 79
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval Paper • 2401.18059 • Published Jan 31 • 34
RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems Paper • 2403.09040 • Published Mar 14 • 1