Training Sparse Mixture Of Experts Text Embedding Models Paper โข 2502.07972 โข Published Feb 11 โข 5
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper โข 2502.15007 โข Published 25 days ago โข 163
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper โข 2502.14499 โข Published 25 days ago โข 180
rusBEIR-datasets Collection Collection of datasets used in rusBEIR โข 57 items โข Updated 10 days ago โข 4
Russian Q&A datasets Collection Datasets collected from scraping Russian question answering websites โข 4 items โข Updated Mar 15, 2024 โข 1
The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design Paper โข 2408.12503 โข Published Aug 22, 2024 โข 24