Markus Krimmel
Markus28
AI & ML interests
None yet
Organizations
None yet
Markus28's activity
feat: selective activation checkpointing
#16 opened 8 months ago
by
Markus28
Porting v2 models to flash attention
#15 opened 8 months ago
by
Markus28
feat: updated activation checkpointing
#14 opened 8 months ago
by
Markus28
feat: Allow LoRA to be merged into weights
#12 opened 8 months ago
by
Markus28
fix: remove cleaving
#13 opened 8 months ago
by
Markus28
feat: cleave off layers from encoder
#11 opened 8 months ago
by
Markus28
clean up embeddings.py
#7 opened 9 months ago
by
bwang0911
Positional Interpolation
#14 opened 9 months ago
by
Markus28
support-multiple-task-ids
#5 opened 9 months ago
by
michael-guenther
Global CLS attention
#13 opened 9 months ago
by
Markus28
feat: implement task type embeddings
#1 opened 9 months ago
by
Markus28
Use attention dropout during training
1
#1 opened 9 months ago
by
Markus28
Use attention dropout during training
2
#10 opened 9 months ago
by
Markus28
Fix sorting heuristic
1
#3 opened about 1 year ago
by
Markus28
Fix sorting heuristic
1
#3 opened about 1 year ago
by
Markus28