Questions Regarding DPR Fine-Tuning and Dataset Preparation

by Imamwahid - opened May 20

May 20

Hello Mas Firqa,

I hope you are doing well. I have been exploring the fine-tuned DPR models and would like to ask a few questions regarding the training process and dataset preparation:

How was the process of the dataset transformed from the SQuAD 2.0 format into the DPR training format?
How many training instances were used after the dataset preparation process for fine-tuning the DPR model?
Was the model fine-tuned using IndoBERT? If so, which IndoBERT paper or pretrained model did you refer to?
Do the context encoder and question encoder use the same encoder architecture or weights?

Thank you for your time and for sharing your work. I really appreciate it and will look forward to your response.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment