This folder contains common functions for data cleaning, clustering, train-test splitting, visualization, embedding, and logging.
The functions in these scripts are used throughout the pository for training the main model, FusOn-pLM, as well as benchmarks.