24 11 5

Pasquale Minervini

pminervini

https://www.neuralnoise.com

AI & ML interests

None yet

Recent Activity

updated a model 29 days ago

pminervini/OLMo-2-1124-7B-GGUF

updated a model 29 days ago

pminervini/OLMo-2-1124-13B-GGUF

updated a dataset about 2 months ago

hallucinations-leaderboard/requests

View all activity

Articles

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 126

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

Jan 29

• 17

Organizations

pminervini's activity

upvoted a collection about 2 months ago

🔍 Daily Picks in Interpretability & Analysis of LMs

Collection

Outstanding research in interpretability and evaluation of language models, summarized • 90 items • Updated 6 days ago • 93

upvoted 3 papers 2 months ago

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21 • 19

DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

Paper • 2410.18860 • Published Oct 24 • 9

FLARE: Faithful Logic-Aided Reasoning and Exploration

Paper • 2410.11900 • Published Oct 14 • 3

upvoted a collection 3 months ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 28 days ago • 289

upvoted 2 papers 6 months ago

A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression

Paper • 2406.11430 • Published Jun 17 • 22

No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models

Paper • 2307.06440 • Published Jul 12, 2023 • 3

upvoted a paper 7 months ago

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6 • 37

upvoted an article 7 months ago

Article

Let's talk about LLM evaluation

•

May 23

• 140

upvoted an article 8 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 126

upvoted a collection 10 months ago

Open LLM Leaderboard best models ❤️‍🔥

Collection

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 63 items • Updated 33 minutes ago • 483