GammaCorpus Collection The GammaCorpus Dataset - by Ruben Roy • 11 items • Updated 5 days ago • 3
RiXIS Architecture Collection Collection of RiXIS Models - by Lakaz Manuella and Ruben Roy • 2 items • Updated 4 days ago • 2
MoCapAct Collection Locomotion policies for hundreds of simulated humanoid locomotion clips and demonstration data for training them. • 3 items • Updated 22 days ago • 2
GIT Collection GIT (Generative Image-to-text Transformer) is a model useful for vision-language tasks such as image/video captioning and question answering. • 18 items • Updated 22 days ago • 11
UDOP Collection UDOP is a general multimodal model for document AI • 4 items • Updated 22 days ago • 24
Orca Collection The Orca family of LMs developed by Microsoft. • 2 items • Updated 22 days ago • 8
Biomedical Collection Models for biomedical research applications, such as radiology report generation and biomedical language understanding. • 9 items • Updated 22 days ago • 9
LayoutLM Collection The LayoutLM series are Transformer encoders useful for document AI tasks such as invoice parsing, document image classification and DocVQA. • 5 items • Updated 22 days ago • 15
Table Transformer Collection The Table Transformer (TATR) is a series of object detection models useful for table extraction from PDF images. • 5 items • Updated 22 days ago • 21
TAPEX Collection TAPEX is the state-of-the-art table pre-training models which can be used for table-based question answering and table-based fact verification. • 10 items • Updated 22 days ago • 9
SpeechT5 Collection The SpeechT5 framework consists of a shared seq2seq and six modal-specific (speech/text) pre/post-nets that can address a few audio-related tasks. • 8 items • Updated 22 days ago • 24
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated 22 days ago • 54
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 22 days ago • 549
RefDPO Collection Model and data collection for our work "Understanding Reference Policies in Direct Preference Optimization" (https://arxiv.org/abs/2407.13709) • 32 items • Updated Jul 19, 2024 • 1
MDCure Collection Models and datasets for our work "MDCure: A Scalable Pipeline for Multi-Document Instruction-Following" (https://arxiv.org/abs/2410.23463) • 11 items • Updated Dec 23, 2024 • 5
GammaCorpus Collection The Full GammaCorpus Dataset Collection • 12 items • Updated 3 days ago • 5