4 51 24

Alexey G

grib0ed0v

AI & ML interests

LLM / RLHF / AI4Everything.

Recent Activity

upvoted an article about 24 hours ago

Welcome to Inference Providers on the Hub 🔥

upvoted an article 3 days ago

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

upvoted an article 3 days ago

Train 400x faster Static Embedding Models with Sentence Transformers

View all activity

Organizations

grib0ed0v's activity

upvoted an article about 24 hours ago

Article

Welcome to Inference Providers on the Hub 🔥

3 days ago

• 172

upvoted 3 articles 3 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

8 days ago

• 96

Article

Train 400x faster Static Embedding Models with Sentence Transformers

16 days ago

• 129

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 536

upvoted an article 4 days ago

Article

We now support VLMs in smolagents!

7 days ago

• 65

upvoted an article 7 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 514

upvoted 2 articles 9 days ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

15 days ago

• 61

Article

Timm ❤️ Transformers: Use any timm model with transformers

15 days ago

• 37

upvoted a collection about 2 months ago

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Dec 13, 2024 • 50

upvoted a collection 2 months ago

Cultura-Ru-Edu

Collection

Our dataset for enhancing LLM training with educational content in the Russian language. • 2 items • Updated Nov 26, 2024 • 5

upvoted 2 papers 2 months ago

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50

upvoted an article 2 months ago

Article

Let’s make a generation of amazing image generation models

•

Nov 26, 2024

• 34

upvoted a paper 2 months ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 52

upvoted 6 papers 3 months ago

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 63

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2, 2024 • 12

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25, 2024 • 17

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 19

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Paper • 2410.18533 • Published Oct 24, 2024 • 42

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23, 2024 • 34