Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
timaeus
's Collections
Datasets: Pile Subsets
Projects: Finetuning
Project: Lang2
Project: Lang1
Project: ICL1
Models: dh
Models: H-dh
Models: H
Models: L
Models: dm
Datasets: Suffixes
Datasets: Prefixes
Datasets: Delimiters
Datasets: Currencies
Datasets: Other
Models: dh
updated
Oct 18, 2024
Attention-only transformers, sweep over head dimension
Upvote
-
timaeus/H8-dh4
Updated
Oct 17, 2024
•
3
timaeus/H8-dh8
Updated
Oct 17, 2024
•
2
timaeus/H8-dh16
Updated
Oct 17, 2024
•
3
timaeus/L2
Updated
Oct 18, 2024
•
6
timaeus/H8-dh64
Updated
Oct 17, 2024
•
3
timaeus/H8-dh128
Updated
Oct 17, 2024
•
3
timaeus/H8-dh256
Updated
Oct 17, 2024
•
2
Upvote
-
Share collection
View history
Collection guide
Browse collections