Renat's picture

8 3

Renat

u-brixton

·

AI & ML interests

None yet

Recent Activity

updated a collection about 4 hours ago

updated a collection about 4 hours ago

updated a collection about 4 hours ago

View all activity

Organizations

u-brixton's activity

upvoted 3 papers about 1 month ago

Don't Make Your LLM an Evaluation Benchmark Cheater

Paper • 2311.01964 • Published Nov 3, 2023 • 1

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Paper • 2503.02951 • Published Mar 4 • 29

Iterative Value Function Optimization for Guided Decoding

Paper • 2503.02368 • Published Mar 4 • 14

upvoted a collection 4 months ago

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 24

upvoted 2 collections about 1 year ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 107 items • Updated 2 days ago • 97

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 236

upvoted 2 collections over 1 year ago

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 132

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13, 2024 • 25