# Future scope of work We hope to improve this in the future by using: - Better translating options. Better translators (for e.g. Google Translate API, Large pre-trained seq2seq models for translation) to get more multilingual data, especially in low-resource languages. - More training time: We found that training image captioning model for a single model takes a lot of compute time and if we want