Model converted by the transformers' pt_to_tf CLI. All converted model outputs and hidden layers were validated against its PyTorch counterpart.

Maximum crossload output difference=1.490e-07; Maximum crossload hidden layer difference=1.431e-05;
Maximum conversion output difference=1.490e-07; Maximum conversion hidden layer difference=1.431e-05;

thenlper changed pull request status to merged

Sign up or log in to comment