Eugene Klimov's picture

Eugene Klimov

Slach

·

Slach

AI & ML interests

None yet

Recent Activity

liked a model 14 days ago

HiDream-ai/HiDream-I1-Full

liked a model 14 days ago

agentica-org/DeepCoder-14B-Preview

liked a model 24 days ago

yandex/YandexGPT-5-Lite-8B-instruct

View all activity

Organizations

None yet

Slach's activity

upvoted 2 papers about 1 month ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20 • 73

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 230

upvoted 2 papers about 2 months ago

GHOST 2.0: generative high-fidelity one shot transfer of heads

Paper • 2502.18417 • Published Feb 25 • 67

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175

upvoted a collection 5 months ago

Hymba

A series of Hybrid Small Language Models. • 2 items • Updated about 17 hours ago • 29

upvoted a collection 8 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated about 17 hours ago • 61

upvoted a paper 9 months ago

CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases

Paper • 2408.03910 • Published Aug 7, 2024 • 18

upvoted 3 collections 9 months ago

AQLM

AQLM quantized LLMs • 21 items • Updated Feb 28 • 46

AQLM+PV

Official AQLM quantizations for "PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression": https://arxiv.org/abs/2405.14852 • 26 items • Updated Feb 28 • 21

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 77

upvoted a paper 10 months ago

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 37

upvoted a paper 11 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 159

upvoted a collection 12 months ago

Edit Your Image!

Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26, 2024 • 34

upvoted 2 papers about 1 year ago

OmniFusion Technical Report

Paper • 2404.06212 • Published Apr 9, 2024 • 78

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 58

upvoted a collection over 1 year ago

vikhr

A family of russian translated LLM • 4 items • Updated Jan 8, 2024 • 16

upvoted 2 papers over 1 year ago

DiLoCo: Distributed Low-Communication Training of Language Models

Paper • 2311.08105 • Published Nov 14, 2023 • 15

Prompt Cache: Modular Attention Reuse for Low-Latency Inference

Paper • 2311.04934 • Published Nov 7, 2023 • 33

upvoted a paper almost 2 years ago

AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn

Paper • 2306.08640 • Published Jun 14, 2023 • 26