8 19 1

Andrei Semenov

Andron00e

https://andron00e.github.io/

AI & ML interests

NLP, CV, Optimization

Recent Activity

upvoted a paper about 2 months ago

LLM Pretraining with Continuous Concepts

upvoted a paper 4 months ago

Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning

commented on a paper 4 months ago

Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning

View all activity

Organizations

Andron00e's activity

upvoted a paper about 2 months ago

LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published Feb 12 • 28

upvoted a paper 4 months ago

Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning

Paper • 2412.11689 • Published Dec 16, 2024 • 2

upvoted a paper 5 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 81

upvoted a paper 7 months ago

Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization

Paper • 2409.00492 • Published Aug 31, 2024 • 11

upvoted a collection 9 months ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10, 2024 • 25

upvoted a paper 10 months ago

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 89

upvoted an article 10 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12, 2024

• 95

upvoted an article 12 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 286

upvoted 2 collections 12 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 737

Papers-to-read

Collection

9 items • Updated Apr 25, 2024 • 4

upvoted 3 papers 12 months ago

upvoted 6 papers about 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings

Paper • 2403.07750 • Published Mar 12, 2024 • 23

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 45

VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

Paper • 2403.00522 • Published Mar 1, 2024 • 46

FiT: Flexible Vision Transformer for Diffusion Model

Paper • 2402.12376 • Published Feb 19, 2024 • 48