Joshua M. Susskind's picture

1 2 2

Joshua M. Susskind

jsusskind

·

AI & ML interests

Generative models, interactive machine learning, understanding ML

Recent Activity

upvoted a collection 4 days ago

liked a model 4 days ago

apple/aimv2-large-patch14-224

authored a paper 4 days ago

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

View all activity

Organizations

jsusskind's activity

upvoted a collection 4 days ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated 5 days ago • 54

liked a model 4 days ago

apple/aimv2-large-patch14-224

Image Feature Extraction • Updated 5 days ago • 1.39k • 25

authored 10 papers 4 days ago

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

Paper • 2303.06296 • Published Mar 11, 2023

Learning Controllable 3D Diffusion Models from Single-view Images

Paper • 2304.06700 • Published Apr 13, 2023

Generative Modeling with Phase Stochastic Bridges

Paper • 2310.07805 • Published Oct 11, 2023

Matryoshka Diffusion Models

Paper • 2310.15111 • Published Oct 23, 2023 • 40

What Algorithms can Transformers Learn? A Study in Length Generalization

Paper • 2310.16028 • Published Oct 24, 2023 • 2

Value function estimation using conditional diffusion models for control

Paper • 2306.07290 • Published Jun 9, 2023

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Paper • 2401.15914 • Published Jan 29 • 7

How Far Are We from Intelligent Visual Deductive Reasoning?

Paper • 2403.04732 • Published Mar 7 • 18

NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion

Paper • 2302.10109 • Published Feb 20, 2023

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Paper • 2410.08159 • Published Oct 10 • 24

authored a paper 5 days ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published 5 days ago • 36

authored a paper 6 months ago

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling

Paper • 2405.21048 • Published May 31 • 12

upvoted a collection 10 months ago

AIM

AIM: Autoregressive Image Models • 5 items • Updated 29 days ago • 49

liked a model 10 months ago

apple/AIM

Image Classification • Updated Jan 22 • 87

authored 4 papers 10 months ago

When can transformers reason with abstract symbols?

Paper • 2310.09753 • Published Oct 15, 2023 • 2

Position Prediction as an Effective Pretraining Strategy

Paper • 2207.07611 • Published Jul 15, 2022 • 1

Generating Molecular Conformer Fields

Paper • 2311.17932 • Published Nov 27, 2023

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16 • 36