33 18 12

Leshem Choshen

borgr

https://ktilana.wixsite.com/leshem-choshen

AI & ML interests

Merging models, collaboratively improving pretraining, evaluation, understanding

Recent Activity

commented on an article 4 days ago

Cohere on Hugging Face Inference Providers 🔥

commented on an article 6 days ago

Cohere on Hugging Face Inference Providers 🔥

upvoted a paper 7 days ago

TextArena

View all activity

Organizations

borgr's activity

upvoted a paper 7 days ago

TextArena

Paper • 2504.11442 • Published 7 days ago • 27

upvoted 2 papers 12 days ago

Pretraining Language Models for Diachronic Linguistic Change Discovery

Paper • 2504.05523 • Published 15 days ago • 6

Fusing finetuned models for better pretraining

Paper • 2204.03044 • Published Apr 6, 2022 • 6

upvoted a paper 18 days ago

Scaling Analysis of Interleaved Speech-Text Language Models

Paper • 2504.02398 • Published 19 days ago • 27

upvoted a paper about 1 month ago

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 88

upvoted a paper 2 months ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13 • 35

upvoted a collection 3 months ago

Dicta-LM 2.0 Collection

Collection

9 items • Updated Apr 27, 2024 • 16

upvoted a paper 5 months ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published Dec 4, 2024 • 19

upvoted a paper 6 months ago

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published Oct 14, 2024 • 28

upvoted 2 papers 7 months ago

SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification

Paper • 2410.05057 • Published Oct 7, 2024 • 7

Acceptable Use Policies for Foundation Models

Paper • 2409.09041 • Published Aug 29, 2024 • 1

upvoted 3 papers 8 months ago

The Future of Open Human Feedback

Paper • 2408.16961 • Published Aug 15, 2024 • 21

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Paper • 2408.08291 • Published Aug 15, 2024 • 11

Learning from Naturally Occurring Feedback

Paper • 2407.10944 • Published Jul 15, 2024 • 4

upvoted a paper 9 months ago

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

Paper • 2407.13696 • Published Jul 18, 2024 • 5

upvoted a paper 10 months ago

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Paper • 2407.00402 • Published Jun 29, 2024 • 23

upvoted a paper 11 months ago

Large Language Model Confidence Estimation via Black-Box Access

Paper • 2406.04370 • Published Jun 1, 2024 • 23