-
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual
Viewer • Updated • 19.8M • 5.05k -
gmongaras/CC12M_and_Imagenet21K_Recap
Viewer • Updated • 22.7M • 3.94k • 3 -
gmongaras/Imagenet21K_Recaption
Viewer • Updated • 13.1M • 12.7k • 2 -
gmongaras/EleutherAI_the_pile_deduplicated
Viewer • Updated • 134M • 399 • 3
Gabriel Mongaras
gmongaras
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
7 days ago
RWKV-7 "Goose" with Expressive Dynamic State Evolution
upvoted
a
paper
11 days ago
Transformers without Normalization
updated
a model
26 days ago
gmongaras/Latent_Diffusion_Model_Imagenet2012_Softmax_250000
Organizations
Collections
6
Models for the paper Cottention: Linear Transformers With Cosine Attention https://arxiv.org/abs/2409.18747
Papers
1
models
19

gmongaras/Latent_Diffusion_Model_Imagenet2012_Softmax_250000
Updated

gmongaras/Softmax_Attention_BERT
Feature Extraction
•
Updated
•
9

gmongaras/Cosine_Attention_BERT
Feature Extraction
•
Updated
•
9

gmongaras/Cosine_Attention_GPT_1.2B
Feature Extraction
•
Updated
•
12

gmongaras/Cosine_Attention_GPT_300M
Feature Extraction
•
Updated
•
15

gmongaras/Softmax_Attention_GPT_1.2B
Feature Extraction
•
Updated
•
10

gmongaras/Softmax_Attention_GPT_300M
Feature Extraction
•
Updated
•
10

gmongaras/Yann_UWU
Text Generation
•
Updated
•
7

gmongaras/Meta-Llama-3.1-8B
Text Generation
•
Updated
•
7

gmongaras/reddit_negative_v1_13B
Text Generation
•
Updated
•
11
•
1
datasets
31
gmongaras/Amazon-Reviews-2023
Viewer
•
Updated
•
572M
•
255
gmongaras/CC12M_and_Imagenet21K_Recap
Viewer
•
Updated
•
22.7M
•
3.94k
•
3
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual
Viewer
•
Updated
•
19.8M
•
5.05k
gmongaras/Imagenet21K_Recaption
Viewer
•
Updated
•
13.1M
•
12.7k
•
2
gmongaras/Imagenet21K
Viewer
•
Updated
•
13.2M
•
5.3k
gmongaras/ImageNet12
Viewer
•
Updated
•
1.28M
•
295
gmongaras/Stack
Updated
•
4
gmongaras/Imagenet21
Updated
•
6
gmongaras/Stable_Diffusion_3_Recaption
Viewer
•
Updated
•
10.9M
•
375
gmongaras/Pile_TokLlama
Updated
•
6