Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted a collection 26 days ago

Gemma 3

updated a dataset about 1 month ago

soates/australian-insurance-pii-dataset-corrected

published a dataset about 1 month ago

soates/australian-insurance-pii-dataset-corrected

View all activity

Organizations

None yet

soates's activity

upvoted a collection 26 days ago

Gemma 3

Collection

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated 4 days ago • 49

updated a dataset about 1 month ago

soates/australian-insurance-pii-dataset-corrected

Viewer • Updated Feb 25 • 1.55k • 37

published a dataset about 1 month ago

soates/australian-insurance-pii-dataset-corrected

Viewer • Updated Feb 25 • 1.55k • 37

updated a dataset about 1 month ago

soates/australian-insurance-pii-dataset

Viewer • Updated Feb 25 • 1.55k • 34

published a dataset about 1 month ago

soates/australian-insurance-pii-dataset

Viewer • Updated Feb 25 • 1.55k • 34

liked a Space about 2 months ago

2.43k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 2 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 836

upvoted a collection 3 months ago

EvaByte

Collection

3 items • Updated Jan 21 • 3

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 364

liked a model 4 months ago

Datou1111/shou_xin

Text-to-Image • Updated 24 days ago • 406 • 870

upvoted a paper 7 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

liked a model 7 months ago

lamm-mit/LifeGPT

Updated Sep 19, 2024 • 8

upvoted an article 7 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 229

liked a Space 8 months ago

116

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

Update leaderboard for fair model evaluation

upvoted an article 8 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 61

liked a model 8 months ago

nisten/Biggie-SmoLlm-0.15B-Base

Text Generation • Updated Aug 7, 2024 • 421 • • 233

liked a Space 9 months ago

Gpt2 Multiplication Predictor

📈

Multiply large numbers using different reasoning methods