Add TF weights

#1
by amyeroberts HF staff - opened

Model converted by the transformers' pt_to_tf CLI.

All converted model outputs and hidden layers were validated against its Pytorch counterpart. Maximum crossload output difference=2.861e-05; Maximum converted output difference=2.861e-05.

All cross-load differences:

logits: 1.252e-05
hidden_states[0]: 1.431e-06
hidden_states[1]: 1.384e-05
hidden_states[2]: 8.702e-06
hidden_states[3]: 7.004e-06
hidden_states[4]: 2.861e-05

amyeroberts changed pull request status to merged

Sign up or log in to comment