Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta Paper • 2603.02181 • Published Mar 2
ViCLIP-OT: The First Foundation Vision-Language Model for Vietnamese Image-Text Retrieval with Optimal Transport Paper • 2602.22678 • Published Feb 26
ViCLIP-OT: The First Foundation Vision-Language Model for Vietnamese Image-Text Retrieval with Optimal Transport Paper • 2602.22678 • Published Feb 26