Matthew Douglas's picture

Matthew Douglas

mdouglas

·

AI & ML interests

LLMs, quantization, NLP, embeddings, hardware, DevEx

Recent Activity

liked a model 1 day ago

ServiceNow-AI/Apriel-5B-Base

liked a model 8 days ago

bnb-community/Llama-4-Scout-17B-16E-Instruct-bnb-8bit

liked a model 8 days ago

bnb-community/Llama-4-Scout-17B-16E-Instruct-bnb-4bit

View all activity

Organizations

mdouglas's activity

upvoted a collection 14 days ago

Gemma 3 Release

17 items • Updated 13 days ago • 329

upvoted 2 collections 7 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 593

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14, 2024 • 11

upvoted an article 7 months ago

Article

Accelerate 1.0.0

Sep 13, 2024

• 52

upvoted an article 8 months ago

Article

XetHub is joining Hugging Face!

Aug 8, 2024

• 90

upvoted an article 9 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31, 2024

• 58

upvoted a collection 9 months ago

AIMO Progress Prize

Models and datasets used in the winning solution to the AIMO 1st Progress Prize • 7 items • Updated Jul 19, 2024 • 13

upvoted a paper 9 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 96

upvoted 3 collections 11 months ago

Parallel Sentences Datasets

These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Feb 25 • 15

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated 9 days ago • 119

MS MARCO Mined Triplets

These datasets contain MS MARCO Triplets gathered by mining hard negatives using various models. Each dataset has various subsets. • 14 items • Updated Feb 25 • 11

upvoted 3 papers 11 months ago

Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training

Paper • 2405.06932 • Published May 11, 2024 • 21

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29, 2024 • 71

A Careful Examination of Large Language Model Performance on Grade School Arithmetic

Paper • 2405.00332 • Published May 1, 2024 • 33

upvoted 3 papers 12 months ago

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

Paper • 2306.03078 • Published Jun 5, 2023 • 3

The case for 4-bit precision: k-bit Inference Scaling Laws

Paper • 2212.09720 • Published Dec 19, 2022 • 3

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 52

upvoted 2 papers about 1 year ago

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

Paper • 2404.07413 • Published Apr 11, 2024 • 39

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Paper • 2404.07839 • Published Apr 11, 2024 • 48

upvoted a collection about 1 year ago

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 132