Fredi's picture

Fredi PRO

Fredithefish

·

AI & ML interests

LLMs, Audio, Diffusion Models

Recent Activity

liked a model about 1 month ago

perplexity-ai/r1-1776

liked a model about 1 month ago

agentica-org/DeepScaleR-1.5B-Preview

liked a dataset about 1 month ago

open-r1/OpenR1-Math-220k

View all activity

Organizations

Fredithefish's activity

liked 2 models about 1 month ago

perplexity-ai/r1-1776

Text Generation • Updated 25 days ago • 58.5k • • 2.18k

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • Updated 29 days ago • 74.2k • • 525

liked 2 datasets about 1 month ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 52.5k • 523

open-r1/OpenR1-Math-Raw

Viewer • Updated 27 days ago • 516k • 1.27k • 72

updated a Space about 1 month ago

First Agent Template

updated a dataset about 1 month ago

Fredithefish/math_dpo

Viewer • Updated Feb 9 • 5M • 58

published a dataset about 1 month ago

Fredithefish/math_dpo

Viewer • Updated Feb 9 • 5M • 58

commented a paper about 1 month ago

You Only Cache Once: Decoder-Decoder Architectures for Language Models

Paper • 2405.05254 • Published May 8, 2024 • 10 •

upvoted a paper about 1 month ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 44

liked a model 4 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 211k • • 1.72k

liked a dataset 5 months ago

neuralwork/arxiver

Viewer • Updated Nov 1, 2024 • 63.4k • 649 • 362

liked 2 models 5 months ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 17 days ago • 396k • • 581

Etched/oasis-500m

Updated Nov 4, 2024 • 141 • 449

New activity in HuggingFaceFW/fineweb-edu-llama3-annotations 5 months ago

Should the data be filtered to score values between 1-5?

#1 opened 10 months ago by

upvoted an article 5 months ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 52

liked 4 models 5 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • Updated Oct 25, 2024 • 260k • • 2.03k

Zyphra/Zamba2-7B

Updated Feb 14 • 363 • 112

arcee-ai/SuperNova-Medius

Text Generation • Updated Oct 28, 2024 • 2.51k • 205

allenai/Molmo-7B-D-0924

Image-Text-to-Text • Updated Oct 10, 2024 • 356k • 516