-
google/flan-t5-large
Text2Text Generation • Updated • 1.39M • 468 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • Updated • 52k • 315 -
Object Recognition as Next Token Prediction
Paper • 2312.02142 • Published • 11 -
colbert-ir/dspy-Oct11-T5-Large-MH-3k-v1
Text2Text Generation • Updated • 16 • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2402.00838
-
Holistic Evaluation of Text-To-Image Models
Paper • 2311.04287 • Published • 10 -
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Paper • 2311.07463 • Published • 13 -
Trusted Source Alignment in Large Language Models
Paper • 2311.06697 • Published • 9 -
DiLoCo: Distributed Low-Communication Training of Language Models
Paper • 2311.08105 • Published • 13
-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 83 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 61 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 37 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 94