Commit History
fix: use proper initilization for embedding layer
850b9a2
Markus28
commited on
fix: fixed typo
5c4e4bf
Markus28
commited on
feat: added LoRA
8561a1f
Markus28
commited on
feat: assert return_dict
326b1c4
Markus28
commited on
fix: same assertions in other models
c1d92c9
Markus28
commited on
fix: assert is None for other kwargs too
3f5615c
Markus28
commited on
feat: added head_mask
599c64e
Markus28
commited on
added classifier dropout
767b681
Markus28
commited on
fix: formatting
ae4c28c
Markus28
commited on
fix: formatting
f115a1d
Markus28
commited on
feat: added further GLUE models
ec37ae5
Markus28
commited on
feat: added BertForSequenceClassification
ba24fb1
Markus28
commited on
fix: cast mask to bool
ca5f516
Markus28
commited on
reference the flash attention GitHub
eec6c0e
Markus28
commited on
fix: move flash components into top-level
5944ec8
Markus28
commited on
feat: try to fix import error
4c4562b
Markus28
commited on
feat: moved flash attention code into this repository
46df05d
Markus28
commited on
add tokenizer
6343db7
michael-guenther
commited on
feat: added encode method
32458be
Markus28
commited on
fix: try to skip initialization of task type embeddings
3b35eab
Markus28
commited on
fix: try to skip initialization of task type embeddings
95ca1a8
Markus28
commited on
feat: added option for QK normalization
463061d
Markus28
commited on
fix: removed obscure config options
2e69073
Markus28
commited on
feat: added small config
149d26f
Markus28
commited on
feat: added back option not to use flash attention
d4d5621
Markus28
commited on
feat: support gradient checkpointing
75d7a16
Markus28
commited on
Added additional config options
5b58f09
Markus28
commited on
removed unused imports
5e7b835
Markus28
commited on
removed __init__ from BertPretrainedModel
44fd417
Markus28
commited on
added config_class and base_model_prefix
45b2292
Markus28
commited on
Fixed typo
80472cb
Markus28
commited on
Fixed typo
6fb6577
Markus28
commited on
Try to subclass PretrainedModel
e209593
Markus28
commited on
Try to subclass PretrainedModel
2b23340
Markus28
commited on
strict=True for debugging
a0c289c
Markus28
commited on
try to simplify checkpointing
4c68a4c
Markus28
commited on
changed model_type
c35343d
Markus28
commited on
feat: added dense_seq_output to config
75a4e4d
Markus28
commited on
removed debugging
c2d8dc3
Markus28
commited on
debugging
c4185ce
Markus28
commited on
debugging
a1e1eff
Markus28
commited on
debugging assertion
4d2995d
Markus28
commited on
fix: fixed get_input_embeddings method
7e06371
Markus28
commited on
feat: added get_input_embeddings method to BertForPreTraining
bb281f0
Markus28
commited on
feat: fixed _from_config
18eed80
Markus28
commited on
feat: changed model_type
eeb05a3
Markus28
commited on
removed from_config
0ce78aa
Markus28
commited on
fix: try to get from_config to work
871fd36
Markus28
commited on