arxiv:2411.14402
Joshua M. Susskind
jsusskind
AI & ML interests
Generative models, interactive machine learning, understanding ML
Recent Activity
upvoted
a
collection
12 days ago
AIMv2
liked
a model
12 days ago
apple/aimv2-large-patch14-224
authored
a paper
12 days ago
Stabilizing Transformer Training by Preventing Attention Entropy
Collapse
Organizations
Papers
22
models
None public yet
datasets
None public yet