19 42 26

Solomatin Roman

Samoed

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 hour ago

DeepPavlov/hwu64

liked a model about 17 hours ago

sergeyzh/rubert-mini-frida

updated a dataset 1 day ago

DeepPavlov/clinc_oos

View all activity

Organizations

Samoed's activity

upvoted a paper 3 days ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 9 days ago • 207

upvoted a paper 4 days ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 90

upvoted a paper 5 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 22 days ago • 129

upvoted a paper 9 days ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published 17 days ago • 25

upvoted a paper 15 days ago

GHOST 2.0: generative high-fidelity one shot transfer of heads

Paper • 2502.18417 • Published 17 days ago • 63

upvoted a paper 18 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 22 days ago • 162

upvoted a paper 19 days ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 22 days ago • 85

upvoted a paper 20 days ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published 23 days ago • 32

upvoted a paper 21 days ago

Contextual Document Embeddings

Paper • 2410.02525 • Published Oct 3, 2024 • 21

upvoted an article 22 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 159

upvoted a paper about 1 month ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 112

upvoted a collection about 1 month ago

NanoBEIR 🍺

Collection

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 13

upvoted an article about 1 month ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 36

upvoted 2 papers about 1 month ago

Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 2

Jasper and Stella: distillation of SOTA embedding models

Paper • 2412.19048 • Published Dec 26, 2024 • 1

upvoted a paper about 2 months ago

Facilitating large language model Russian adaptation with Learned Embedding Propagation

Paper • 2412.21140 • Published Dec 30, 2024 • 18

upvoted a paper 2 months ago

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Paper • 1908.10084 • Published Aug 27, 2019 • 5

upvoted a paper 3 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 135

upvoted 2 collections 4 months ago

Hymba

Collection

A series of Hybrid Small Language Models. • 2 items • Updated Jan 17 • 28

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated about 21 hours ago • 93