Naijaweb datasets ๐ณ๐ฌ Collection A recreation of the fineweb collection for Nigerians โข 3 items โข Updated Oct 24 โข 5
Chameleon Collection Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR. โข 2 items โข Updated Jul 9 โข 27
Awesome Document AI Collection A collection of open-source document AI ๐ ๐ ๐ โข 27 items โข Updated Mar 11 โข 74
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models Paper โข 2407.09025 โข Published Jul 12 โข 129
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. โข 26 items โข Updated Nov 14 โข 533