Commit History

feat: assert return_dict
326b1c4

Markus28 commited on

fix: same assertions in other models
c1d92c9

Markus28 commited on

fix: assert is None for other kwargs too
3f5615c

Markus28 commited on

feat: added head_mask
599c64e

Markus28 commited on

added classifier dropout
767b681

Markus28 commited on

fix: formatting
ae4c28c

Markus28 commited on

fix: formatting
f115a1d

Markus28 commited on

feat: added further GLUE models
ec37ae5

Markus28 commited on

feat: added BertForSequenceClassification
ba24fb1

Markus28 commited on

fix: cast mask to bool
ca5f516

Markus28 commited on

reference the flash attention GitHub
eec6c0e

Markus28 commited on

fix: move flash components into top-level
5944ec8

Markus28 commited on

feat: try to fix import error
4c4562b

Markus28 commited on

feat: moved flash attention code into this repository
46df05d

Markus28 commited on

feat: added encode method
32458be

Markus28 commited on

fix: try to skip initialization of task type embeddings
3b35eab

Markus28 commited on

fix: try to skip initialization of task type embeddings
95ca1a8

Markus28 commited on

feat: added option for QK normalization
463061d

Markus28 commited on

fix: removed obscure config options
2e69073

Markus28 commited on

feat: added small config
149d26f

Markus28 commited on

feat: implement task type embeddings (#1)
8adf551
verified

Markus28 commited on

feat: added back option not to use flash attention
d4d5621

Markus28 commited on

feat: support gradient checkpointing
75d7a16

Markus28 commited on

Added additional config options
5b58f09

Markus28 commited on

removed unused imports
5e7b835

Markus28 commited on

removed __init__ from BertPretrainedModel
44fd417

Markus28 commited on

added config_class and base_model_prefix
45b2292

Markus28 commited on

Fixed typo
80472cb

Markus28 commited on

Fixed typo
6fb6577

Markus28 commited on

Try to subclass PretrainedModel
e209593

Markus28 commited on

Try to subclass PretrainedModel
2b23340

Markus28 commited on

strict=True for debugging
a0c289c

Markus28 commited on

try to simplify checkpointing
4c68a4c

Markus28 commited on

changed model_type
c35343d

Markus28 commited on

feat: added dense_seq_output to config
75a4e4d

Markus28 commited on

removed debugging
c2d8dc3

Markus28 commited on

debugging
c4185ce

Markus28 commited on

debugging
a1e1eff

Markus28 commited on

debugging assertion
4d2995d

Markus28 commited on

fix: fixed get_input_embeddings method
7e06371

Markus28 commited on

feat: added get_input_embeddings method to BertForPreTraining
bb281f0

Markus28 commited on

feat: fixed _from_config
18eed80

Markus28 commited on

feat: changed model_type
eeb05a3

Markus28 commited on

removed from_config
0ce78aa

Markus28 commited on

fix: try to get from_config to work
871fd36

Markus28 commited on

feat: added from_config, also pass additional kwargs from config to model
4164fd6

Markus28 commited on

feat: updated modeling_bert.py to allow MLM-only training
0f43653

Markus28 commited on