view article Article Unlocking Longer Generation with Key-Value Cache Quantization 17 days ago β’ 12
Vision Language Models Papers πΌοΈπ¬π Collection Papers about vision-language models, most important ones are on top of the list. β’ 27 items β’ Updated Apr 30 β’ 26
Training-Free Long-Context Scaling of Large Language Models Paper β’ 2402.17463 β’ Published Feb 27 β’ 18
π Llama-3 Collection My experiments with Llama-3 models β’ 54 items β’ Updated 2 days ago β’ 19
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 β’ 70
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length Paper β’ 2404.08801 β’ Published Apr 12 β’ 62
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 134
Quyen Collection State-of-the-arts General LLMs - based on Qwen1.5 β’ 26 items β’ Updated Feb 13 β’ 12
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models Paper β’ 2310.16795 β’ Published Oct 25, 2023 β’ 26