Add TF weights

#1
by amyeroberts HF staff - opened

Model converted by the transformers' pt_to_tf CLI. All converted model outputs and hidden layers were validated against its Pytorch counterpart.

Maximum crossload output difference=1.502e-05; Maximum crossload hidden layer difference=6.371e-04;
Maximum conversion output difference=1.502e-05; Maximum conversion hidden layer difference=6.371e-04;

List of maximum output differences above the threshold (1e-19):
logits: 1.502e-05

List of maximum hidden layer differences above the threshold (1e-19):
hidden_states[0]: 6.199e-06
hidden_states[1]: 8.106e-06
hidden_states[2]: 1.155e-05
hidden_states[3]: 1.472e-05
hidden_states[4]: 2.205e-05
hidden_states[5]: 2.849e-05
hidden_states[6]: 5.364e-05
hidden_states[7]: 1.538e-04
hidden_states[8]: 2.689e-04
hidden_states[9]: 3.753e-04
hidden_states[10]: 5.169e-04
hidden_states[11]: 6.371e-04
hidden_states[12]: 6.227e-04

amyeroberts changed pull request status to merged

Sign up or log in to comment