liminalism (Lim)

upvoted 3 papers 12 months ago

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30, 2024 • 78

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25, 2024 • 55

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22, 2024 • 128

upvoted a collection 12 months ago

OpenELM Instruct Models

Collection

4 items • Updated Oct 4, 2024 • 118

upvoted a paper 12 months ago

FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17, 2024 • 35

upvoted 2 papers about 1 year ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9, 2024 • 66

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Paper • 2403.05313 • Published Mar 8, 2024 • 9

upvoted an article about 1 year ago

Article

CodeGemma - an official Google release for code LLMs

Apr 9, 2024

• 100

upvoted a paper about 1 year ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 98

upvoted 2 collections about 1 year ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 151

MoEs papers reading list

Collection

60 items • Updated Nov 4, 2024 • 140

upvoted 3 papers about 1 year ago

upvoted 2 papers over 1 year ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 160

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

Lim

AI & ML interests

Organizations

liminalism's activity

Better & Faster Large Language Models via Multi-token Prediction

Make Your LLM Fully Utilize the Context

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

OpenELM Instruct Models

FlowMind: Automatic Workflow Generation with LLMs

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

CodeGemma - an official Google release for code LLMs

ReFT: Representation Finetuning for Language Models

🤖 Agents

MoEs papers reading list

Jamba: A Hybrid Transformer-Mamba Language Model

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Mixtral of Experts

LLM in a flash: Efficient Large Language Model Inference with Limited Memory