Matthias Seeger
mseeger
·
AI & ML interests
None yet
Recent Activity
commented on
an
article
about 2 months ago
Everything About Long Context Fine-tuning
commented on
an
article
2 months ago
Open-R1: a fully open reproduction of DeepSeek-R1
commented on
an
article
2 months ago
Open-R1: a fully open reproduction of DeepSeek-R1
Organizations
None yet
mseeger's activity
Exact computations for multi-head latent attention
1
#9 opened 2 months ago
by
mseeger
hidden_size % num_attention_heads != 0
#2 opened 4 months ago
by
mseeger