PommesPeter's picture

PommesPeter

PommesPeter

·

PommesPeter

AI & ML interests

MM-LLM

Recent Activity

published a model 6 days ago

PommesPeter/pi0-vlm-all-data-wrist-20250326

published a model 6 days ago

PommesPeter/pi0-cleaning-ac20-20250320

liked a dataset about 2 months ago

agentica-org/DeepScaleR-Preview-Dataset

View all activity

Organizations

PommesPeter's activity

upvoted a collection 6 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 585

upvoted 3 papers 6 months ago

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

Paper • 2409.15278 • Published Sep 23, 2024 • 25

A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?

Paper • 2409.15277 • Published Sep 23, 2024 • 37

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 139

upvoted a paper 9 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 131

upvoted an article 9 months ago

Article

We are hiring interns!

Nov 29, 2022

• 12

upvoted a collection 10 months ago

Lumina Family

Lumina-T2X is a unified framework for Text to Any Modality Generation • 8 items • Updated Jul 30, 2024 • 6

upvoted 2 collections 11 months ago

SPHINX Family

2 items • Updated May 18, 2024 • 1

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 236

upvoted an article 12 months ago

Article

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

Jan 19, 2021

• 4

upvoted a collection 12 months ago

WizardLM

0 items • Updated Jan 8 • 108

upvoted 3 papers about 1 year ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 115

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

Paper • 2403.09347 • Published Mar 14, 2024 • 22

Meta-Transformer: A Unified Framework for Multimodal Learning

Paper • 2307.10802 • Published Jul 20, 2023 • 44

upvoted a paper over 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123