Add TF weights

#1
by amyeroberts HF staff - opened

Model converted by the transformers' pt_to_tf CLI. All converted model outputs and hidden layers were validated against its Pytorch counterpart.

Maximum crossload output difference=1.907e-05; Maximum crossload hidden layer difference=1.520e-03;
Maximum conversion output difference=1.907e-05; Maximum conversion hidden layer difference=1.520e-03;

List of maximum output differences above the threshold (1e-19):
logits: 1.490e-05
cls_logits: 1.276e-05
distillation_logits: 1.907e-05

List of maximum hidden layer differences above the threshold (1e-19):
hidden_states[0]: 1.144e-05
hidden_states[1]: 1.979e-05
hidden_states[2]: 3.432e-05
hidden_states[3]: 4.429e-05
hidden_states[4]: 5.081e-05
hidden_states[5]: 8.702e-05
hidden_states[6]: 2.060e-04
hidden_states[7]: 2.732e-04
hidden_states[8]: 6.547e-04
hidden_states[9]: 1.257e-03
hidden_states[10]: 1.461e-03
hidden_states[11]: 1.497e-03
hidden_states[12]: 1.520e-03

amyeroberts changed pull request status to merged

Sign up or log in to comment