-
SnapKV: LLM Knows What You are Looking for Before Generation
Paper • 2404.14469 • Published • 27 -
Finch: Prompt-guided Key-Value Cache Compression
Paper • 2408.00167 • Published • 18 -
Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning
Paper • 2503.04973 • Published • 24 -
A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression
Paper • 2406.11430 • Published • 24
Giulio Corallo PRO
giulio98
AI & ML interests
Generative Modeling
Recent Activity
updated
a dataset
1 minute ago
giulio98/LongBench-v2
published
a dataset
1 minute ago
giulio98/LongBench-v2
updated
a dataset
about 23 hours ago
giulio98/LongBench-2048
Organizations
Collections
2
models
2
datasets
11
giulio98/LongBench-v2
Viewer
•
Updated
•
503
giulio98/LongBench-2048
Updated
•
52
giulio98/LongBench-1024
Updated
•
52
giulio98/LongBench-512
Viewer
•
Updated
•
7.42k
•
81
giulio98/LongBench
Viewer
•
Updated
•
7.42k
•
79
giulio98/person_project_2
Updated
•
35
giulio98/person_project_name
Updated
•
36
giulio98/person_project_new
Updated
•
20
giulio98/xlcost_steps
Viewer
•
Updated
•
9.55k
•
16
giulio98/xlcost-single-prompt
Updated
•
29
•
3