Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
timaeus
's Collections
Datasets: Pile Subsets
Projects: Finetuning
Project: Lang2
Project: Lang1
Project: ICL1
Models: dh
Models: H-dh
Models: H
Models: L
Models: dm
Datasets: Suffixes
Datasets: Prefixes
Datasets: Delimiters
Datasets: Currencies
Datasets: Other
Models: H
updated
Oct 18
Attention-only transformers, sweep over number of heads (for fixed head dimension)
Upvote
-
timaeus/H1-dh32
Updated
Oct 18
•
3
timaeus/H2-dh32
Updated
Oct 17
•
3
timaeus/H4-dh32
Updated
Oct 17
•
8
timaeus/L2
Updated
Oct 18
•
11
timaeus/H16-dh32
Updated
Oct 17
•
3
timaeus/H32-dh32
Updated
Oct 17
•
5
timaeus/H64-dh32
Updated
Oct 17
•
3
Upvote
-
Share collection
View history
Collection guide
Browse collections