Add TF weights

#1
by amyeroberts HF staff - opened

Model converted by the transformers' pt_to_tf CLI.

All converted model outputs and hidden layers were validated against its Pytorch counterpart. Maximum crossload output difference=1.123e-03; Maximum converted output difference=1.123e-03.

All crossload differences

logits: 2.480e-05
hidden_states[0]: 2.003e-05
hidden_states[1]: 1.278e-04
hidden_states[2]: 1.110e-03
hidden_states[3]: 1.123e-03
hidden_states[4]: 1.023e-04

amyeroberts changed pull request status to merged

Sign up or log in to comment