1 4 19

AlphaSue

AI & ML interests

None yet

Recent Activity

liked a Space 14 days ago

LLM360/TxT360

liked a model 21 days ago

jinaai/ReaderLM-v2

liked a Space 22 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

None yet

AlphaSue's activity

liked a Space 14 days ago

106

TxT360: Trillion Extracted Text

📖

Create a large, deduplicated dataset for LLM pre-training

liked a model 21 days ago

jinaai/ReaderLM-v2

Text Generation • Updated 10 days ago • 29.3k • • 561

liked a Space 22 days ago

2.25k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article about 2 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 454

upvoted a collection 2 months ago

Papers I've read

Collection

16 items • Updated Jan 12 • 6

liked a dataset 3 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 226 • 33

liked a model 3 months ago

open-web-math/filtering-models

Updated Nov 2, 2023 • 9

liked a dataset 3 months ago

m-a-p/FineFineWeb

Viewer • Updated Dec 19, 2024 • 4.89B • 1.06M • 40

upvoted a paper 4 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 46

New activity in jinaai/reader-lm-1.5b 5 months ago

Temperature and repetition_penalty

#1 opened 6 months ago by

ayyylol

liked 2 models 7 months ago

nvidia/quality-classifier-deberta

Updated Jan 31 • 13.1k • 56

oliverguhr/fullstop-punctuation-multilang-large

Token Classification • Updated Nov 16, 2023 • 309k • • 159

liked a dataset 9 months ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 1.85k • 719

liked a model 9 months ago

Snowflake/snowflake-arctic-embed-m

liked a Space 9 months ago

872

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked 4 datasets 10 months ago

upvoted an article 11 months ago

Article

Large-scale Near-deduplication Behind BigCode

May 16, 2023

• 22