Daniil Moskovskiy's picture

Daniil Moskovskiy

etomoscow

·

https://t.me/etomoscow

etomoscow

AI & ML interests

NLP

Recent Activity

new activity 4 days ago

s-nlp/paradetox:[bot] Conversion to Parquet

upvoted a paper 9 days ago

Self-Taught Self-Correction for Small Language Models

updated a dataset 10 days ago

etomoscow/dclm-micro

View all activity

Organizations

etomoscow's activity

upvoted a paper 9 days ago

Self-Taught Self-Correction for Small Language Models

Paper • 2503.08681 • Published 26 days ago • 13

upvoted a paper 17 days ago

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published 20 days ago • 93

upvoted a collection 24 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 27 items • Updated 17 days ago • 107

upvoted 2 collections about 1 month ago

SynthDetoxM

Data and models from NAACL 2025 paper "SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators" by Moskovskiy et al. • 4 items • Updated Mar 6 • 2

Knowledge Packing

Models and datasets from the paper: "How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?" https://arxiv.org/abs/2502.14502 • 9 items • Updated Feb 25 • 2

upvoted 3 papers about 1 month ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 170

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published Jan 22 • 68

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20 • 89

upvoted 4 papers about 2 months ago

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published Feb 18 • 69

StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements

Paper • 2408.15666 • Published Aug 28, 2024 • 11

POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation

Paper • 2407.14931 • Published Jul 20, 2024 • 22

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 90

upvoted a paper 2 months ago

Methods for Detoxification of Texts for the Russian Language

Paper • 2105.09052 • Published May 19, 2021 • 1

upvoted a collection 2 months ago

PseudoParaDetox

Models and datasets from the paper: "LLMs to Replace Crowdsourcing For Parallel Data Creation? The Case of Text Detoxification" by Moskovskiy et al. • 9 items • Updated 28 days ago • 1

upvoted a paper 2 months ago

MERA: A Comprehensive LLM Evaluation in Russian

Paper • 2401.04531 • Published Jan 9, 2024 • 2

upvoted 2 papers 4 months ago

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published Dec 19, 2024 • 55

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 364

upvoted a collection 4 months ago

Hate Speech Datasets

5 items • Updated Nov 28, 2024 • 2

upvoted a paper 4 months ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 72