nvidia/stt_fr_conformer_transducer_large · Would there be confusion between CER and WER on test metrics ?

The links you provided are for German, though this is the French model card. I assume you are asking the question for french - https://paperswithcode.com/sota/automatic-speech-recognition-on-mcv-7-0

Yes, the results calculated here are WER, not CER. We normally do not publish CER scores for languages where WER can be computed.

There are a few reasons for this -

This is a Conformer Transducer - Transducer models are much more accurate than CTC models in general. Conformer CTC is also more accurate than Wav2Vec CTC in nearly all cases.
These models are jointly trained - note that they train via both MCV + MLS French, so it is expected that their overall score on MCV alone is superior to a model that was trained on just MCV.