minlik
's Collections
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper
•
2309.12307
•
Published
•
82
LMDX: Language Model-based Document Information Extraction and
Localization
Paper
•
2309.10952
•
Published
•
60
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper
•
2310.09263
•
Published
•
36
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper
•
2310.11453
•
Published
•
93
TEQ: Trainable Equivalent Transformation for Quantization of LLMs
Paper
•
2310.10944
•
Published
•
9
TableGPT: Towards Unifying Tables, Nature Language and Commands into One
GPT
Paper
•
2307.08674
•
Published
•
46
UniversalNER: Targeted Distillation from Large Language Models for Open
Named Entity Recognition
Paper
•
2308.03279
•
Published
•
19
MultiLoRA: Democratizing LoRA for Better Multi-Task Learning
Paper
•
2311.11501
•
Published
•
32
YaRN: Efficient Context Window Extension of Large Language Models
Paper
•
2309.00071
•
Published
•
57
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
•
2401.00908
•
Published
•
173
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper
•
2401.01325
•
Published
•
24
Improving Text Embeddings with Large Language Models
Paper
•
2401.00368
•
Published
•
72
OLMo: Accelerating the Science of Language Models
Paper
•
2402.00838
•
Published
•
74
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper
•
2402.13753
•
Published
•
104
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper
•
2402.09353
•
Published
•
18
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper
•
2402.12354
•
Published
•
5
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
Understanding
Paper
•
2401.04398
•
Published
•
18
A Systematic Survey of Prompt Engineering in Large Language Models:
Techniques and Applications
Paper
•
2402.07927
•
Published
•
1
Simple and Scalable Strategies to Continually Pre-train Large Language
Models
Paper
•
2403.08763
•
Published
•
48
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Paper
•
2404.05961
•
Published
•
61