u3854
's Collections
daily-papers
updated
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
Paper
•
2407.10960
•
Published
•
12
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG
Capabilities
Paper
•
2407.14482
•
Published
•
26
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper
•
2407.14177
•
Published
•
43
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Paper
•
2407.15017
•
Published
•
34
Compact Language Models via Pruning and Knowledge Distillation
Paper
•
2407.14679
•
Published
•
39
DDK: Distilling Domain Knowledge for Efficient Large Language Models
Paper
•
2407.16154
•
Published
•
21
PERSONA: A Reproducible Testbed for Pluralistic Alignment
Paper
•
2407.17387
•
Published
•
18
LAMBDA: A Large Model Based Data Agent
Paper
•
2407.17535
•
Published
•
35
Wolf: Captioning Everything with a World Summarization Framework
Paper
•
2407.18908
•
Published
•
32
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Paper
•
2407.18961
•
Published
•
40
Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names
Paper
•
2408.00298
•
Published
•
9
Finch: Prompt-guided Key-Value Cache Compression
Paper
•
2408.00167
•
Published
•
13
Improving Text Embeddings for Smaller Language Models Using Contrastive
Fine-tuning
Paper
•
2408.00690
•
Published
•
23
Gemma 2: Improving Open Language Models at a Practical Size
Paper
•
2408.00118
•
Published
•
75
SAM 2: Segment Anything in Images and Videos
Paper
•
2408.00714
•
Published
•
109
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented
Generation
Paper
•
2408.02545
•
Published
•
35
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Paper
•
2408.01800
•
Published
•
79
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Paper
•
2408.03256
•
Published
•
10