jina-bert-flash-implementation / convert_v2_weights.py

Commit History

fixed GLU implementation, added conversion of layer norms
9587227

Markus28 commited on

Added GLUMLP, changed config accordingly, added code to convert state_dict
0211324

Markus28 commited on