Upload tf_model.h5

#1
by hoanhtu - opened
No description provided.

@LysandreJik - good to merge for you?

Facebook AI community org

I think it's important to verify equivalence between the PyTorch and TensorFlow models. @Rocketknight1 , @joaogante , would you have the bandwidth to write a small script that would allow us to check the equivalence of models in PRs?

I'll add it today as a CLI (I have a personal script), and will run the checks against these weights 👍

The weights loaded successfully, but the model output was very different -- 30.353527069091797, which is much much higher than our 1e-5 TF-PT tolerance. The TF weights file is also much smaller than the PT/Flax weights (~500 MB vs ~2 GB).

Perhaps the model weights were stored in half-precision ( @hoanhtu )? In any case, I do NOT advise merging this PR.

Thanks for double-checking here @joaogante !

(closing the PR in favor of https://huggingface.co/xlm-roberta-large/discussions/3, which I will be merging )

joaogante changed pull request status to closed
Facebook AI community org

@hoanhtu out of curiosity, do you remember how you created this checkpoint?

Sign up or log in to comment