CommonCanvas Collection Collection of models trained on the CommonCatalogue datasets โข 8 items โข Updated 29 days ago โข 6
ElanMT Collection Japanese English Machine Translation trained on openly licensed corpus โข 5 items โข Updated 26 days ago โข 1
Common Corpus Collection The largest public domain dataset for training LLMs. โข 26 items โข Updated Mar 20 โข 104
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images Paper โข 2310.16825 โข Published Oct 25, 2023 โข 29
Tiny Series Collection Tiny datasets that empower the foundation of Small Language Model! โข 11 items โข Updated Jan 26 โข 32