Akhiad
Informer
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Mixture-of-Transformers: A Sparse and Scalable Architecture for
Multi-Modal Foundation Models
Organizations
None yet
Informer's activity
TypeError: DeciLMAttention.forward() got an unexpected keyword argument 'padding_mask'
4
#6 opened about 1 year ago
by
LaferriereJC
Is a 13B and larger versions planned?
1
#4 opened over 1 year ago
by
coremic