-
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 72 -
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Paper • 2405.06932 • Published • 15 -
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Paper • 2403.20327 • Published • 43 -
Multilingual E5 Text Embeddings: A Technical Report
Paper • 2402.05672 • Published • 16
Collections
Discover the best community collections!
Collections including paper arxiv:2402.05672
-
Text Embeddings by Weakly-Supervised Contrastive Pre-training
Paper • 2212.03533 • Published • 1 -
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Paper • 2403.20327 • Published • 43 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 72 -
Generative Representational Instruction Tuning
Paper • 2402.09906 • Published • 50
-
LongAlign: A Recipe for Long Context Alignment of Large Language Models
Paper • 2401.18058 • Published • 21 -
Efficient Tool Use with Chain-of-Abstraction Reasoning
Paper • 2401.17464 • Published • 15 -
Scavenging Hyena: Distilling Transformers into Long Convolution Models
Paper • 2401.17574 • Published • 14 -
Rethinking Interpretability in the Era of Large Language Models
Paper • 2402.01761 • Published • 18
-
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 50 -
YAYI 2: Multilingual Open-Source Large Language Models
Paper • 2312.14862 • Published • 11 -
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Paper • 2312.06134 • Published • 2 -
TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes
Paper • 2311.10797 • Published
-
Dense X Retrieval: What Retrieval Granularity Should We Use?
Paper • 2312.06648 • Published • 1 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 72 -
Text Embeddings Reveal (Almost) As Much As Text
Paper • 2310.06816 • Published • 1 -
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture
Paper • 2401.08406 • Published • 36
-
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 47 -
Nougat: Neural Optical Understanding for Academic Documents
Paper • 2308.13418 • Published • 33 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 49 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 566