speechlessai
's Collections
Reading Papers
updated
Self-Rewarding Language Models
Paper
•
2401.10020
•
Published
•
135
ReFT: Reasoning with Reinforced Fine-Tuning
Paper
•
2401.08967
•
Published
•
26
Tuning Language Models by Proxy
Paper
•
2401.08565
•
Published
•
19
TrustLLM: Trustworthiness in Large Language Models
Paper
•
2401.05561
•
Published
•
62
Paper
•
2401.04088
•
Published
•
152
MoE-Mamba: Efficient Selective State Space Models with Mixture of
Experts
Paper
•
2401.04081
•
Published
•
68
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper
•
2401.02954
•
Published
•
38
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper
•
2401.02038
•
Published
•
59
LLM Augmented LLMs: Expanding Capabilities through Composition
Paper
•
2401.02412
•
Published
•
35
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model
Paper
•
2401.02330
•
Published
•
11
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
•
2401.00908
•
Published
•
172
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
•
2401.01335
•
Published
•
61
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper
•
2401.01325
•
Published
•
24
A Comprehensive Study of Knowledge Editing for Large Language Models
Paper
•
2401.01286
•
Published
•
15
Improving Text Embeddings with Large Language Models
Paper
•
2401.00368
•
Published
•
72
LARP: Language-Agent Role Play for Open-World Games
Paper
•
2312.17653
•
Published
•
29
Extending LLMs' Context Window with 100 Samples
Paper
•
2401.07004
•
Published
•
14
DeepSeekMoE: Towards Ultimate Expert Specialization in
Mixture-of-Experts Language Models
Paper
•
2401.06066
•
Published
•
35
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper
•
2312.16862
•
Published
•
28
Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale
Pretraining Corpus for Math
Paper
•
2312.17120
•
Published
•
24
MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant
for Mobile Devices
Paper
•
2312.16886
•
Published
•
18
Human101: Training 100+FPS Human Gaussians in 100s from 1 View
Paper
•
2312.15258
•
Published
•
6
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with
Refined Data Generation
Paper
•
2312.14187
•
Published
•
49
Reasons to Reject? Aligning Language Models with Judgments
Paper
•
2312.14591
•
Published
•
16
AppAgent: Multimodal Agents as Smartphone Users
Paper
•
2312.13771
•
Published
•
49
Time is Encoded in the Weights of Finetuned Language Models
Paper
•
2312.13401
•
Published
•
18
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
Paper
•
2312.13789
•
Published
•
13
TinyGSM: achieving >80% on GSM8k with small language models
Paper
•
2312.09241
•
Published
•
33
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Paper
•
2312.07987
•
Published
•
39
PromptBench: A Unified Library for Evaluation of Large Language Models
Paper
•
2312.07910
•
Published
•
14
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Paper
•
2312.06674
•
Published
•
5
LLM360: Towards Fully Transparent Open-Source LLMs
Paper
•
2312.06550
•
Published
•
52
Beyond Human Data: Scaling Self-Training for Problem-Solving with
Language Models
Paper
•
2312.06585
•
Published
•
26
Context Tuning for Retrieval Augmented Generation
Paper
•
2312.05708
•
Published
•
16
Evaluation of Large Language Models for Decision Making in Autonomous
Driving
Paper
•
2312.06351
•
Published
•
5
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Paper
•
2312.03818
•
Published
•
31
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Paper
•
2312.04461
•
Published
•
48
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Paper
•
2312.04474
•
Published
•
27
Pearl: A Production-ready Reinforcement Learning Agent
Paper
•
2312.03814
•
Published
•
14
OneLLM: One Framework to Align All Modalities with Language
Paper
•
2312.03700
•
Published
•
20
LivePhoto: Real Image Animation with Text-guided Motion Control
Paper
•
2312.02928
•
Published
•
15
Rank-without-GPT: Building GPT-Independent Listwise Rerankers on
Open-Source Large Language Models
Paper
•
2312.02969
•
Published
•
11
Training Chain-of-Thought via Latent-Variable Inference
Paper
•
2312.02179
•
Published
•
8
Magicoder: Source Code Is All You Need
Paper
•
2312.02120
•
Published
•
78
Segment and Caption Anything
Paper
•
2312.00869
•
Published
•
17
Paper
•
2312.00860
•
Published
•
8
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper
•
2312.00752
•
Published
•
130
Dolphins: Multimodal Language Model for Driving
Paper
•
2312.00438
•
Published
•
12
ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs
Paper
•
2311.13600
•
Published
•
41
Using Human Feedback to Fine-tune Diffusion Models without Any Reward
Model
Paper
•
2311.13231
•
Published
•
25
Exponentially Faster Language Modelling
Paper
•
2311.10770
•
Published
•
117
Orca 2: Teaching Small Language Models How to Reason
Paper
•
2311.11045
•
Published
•
68
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language
Model-based Agents in Real-world Systems
Paper
•
2311.11315
•
Published
•
6
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
Paper
•
2311.10775
•
Published
•
7
ProAgent: From Robotic Process Automation to Agentic Process Automation
Paper
•
2311.10751
•
Published
•
7
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Paper
•
2311.10702
•
Published
•
17
SelfEval: Leveraging the discriminative nature of generative models for
evaluation
Paper
•
2311.10708
•
Published
•
14
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper
•
2311.10093
•
Published
•
54
ML-Bench: Large Language Models Leverage Open-source Libraries for
Machine Learning Tasks
Paper
•
2311.09835
•
Published
•
7
Routing to the Expert: Efficient Reward-guided Ensemble of Large
Language Models
Paper
•
2311.08692
•
Published
•
11
A Survey on Language Models for Code
Paper
•
2311.07989
•
Published
•
20
Instruction-Following Evaluation for Large Language Models
Paper
•
2311.07911
•
Published
•
17
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads
to Answers Faster
Paper
•
2311.08263
•
Published
•
14
The ART of LLM Refinement: Ask, Refine, and Trust
Paper
•
2311.07961
•
Published
•
9
ChatAnything: Facetime Chat with LLM-Enhanced Personas
Paper
•
2311.06772
•
Published
•
33
LayoutPrompter: Awaken the Design Ability of Large Language Models
Paper
•
2311.06495
•
Published
•
7
Lumos: Learning Agents with Unified Data, Modular Design, and
Open-Source LLMs
Paper
•
2311.05657
•
Published
•
26
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Paper
•
2311.05556
•
Published
•
73
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper
•
2311.05437
•
Published
•
40
Prompt Cache: Modular Attention Reuse for Low-Latency Inference
Paper
•
2311.04934
•
Published
•
23
Can LLMs Follow Simple Rules?
Paper
•
2311.04235
•
Published
•
9
Levels of AGI: Operationalizing Progress on the Path to AGI
Paper
•
2311.02462
•
Published
•
30
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Paper
•
2311.03285
•
Published
•
27
CogVLM: Visual Expert for Pretrained Language Models
Paper
•
2311.03079
•
Published
•
18
Relax: Composable Abstractions for End-to-End Dynamic Machine Learning
Paper
•
2311.02103
•
Published
•
15
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning
Paper
•
2311.02303
•
Published
•
4
CoVLM: Composing Visual Entities and Relationships in Large Language
Models Via Communicative Decoding
Paper
•
2311.03354
•
Published
•
4
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation
Paper
•
2311.00272
•
Published
•
8
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper
•
2311.00176
•
Published
•
7
Learning From Mistakes Makes LLM Better Reasoner
Paper
•
2310.20689
•
Published
•
24
Does GPT-4 Pass the Turing Test?
Paper
•
2310.20216
•
Published
•
17
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Paper
•
2310.20624
•
Published
•
12
Paper
•
2310.20707
•
Published
•
9
LoRAShear: Efficient Large Language Model Structured Pruning and
Knowledge Recovery
Paper
•
2310.18356
•
Published
•
22
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
Modeling Likewise
Paper
•
2310.19019
•
Published
•
9
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Paper
•
2310.17631
•
Published
•
31
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models
Paper
•
2310.16795
•
Published
•
26
InstructExcel: A Benchmark for Natural Language Instruction in Excel
Paper
•
2310.14495
•
Published
•
1
Auto-Instruct: Automatic Instruction Generation and Ranking for
Black-Box Language Models
Paper
•
2310.13127
•
Published
•
10
ToolChain*: Efficient Action Space Navigation in Large Language Models
with A* Search
Paper
•
2310.13227
•
Published
•
11
Tuna: Instruction Tuning using Feedback from Large Language Models
Paper
•
2310.13385
•
Published
•
8
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Paper
•
2310.12823
•
Published
•
33
Self-RAG: Learning to Retrieve, Generate, and Critique through
Self-Reflection
Paper
•
2310.11511
•
Published
•
61
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper
•
2310.09263
•
Published
•
36
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Paper
•
2310.08659
•
Published
•
20
Prometheus: Inducing Fine-grained Evaluation Capability in Language
Models
Paper
•
2310.08491
•
Published
•
48
Lemur: Harmonizing Natural Language and Code for Language Agents
Paper
•
2310.06830
•
Published
•
29
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical
Reasoning
Paper
•
2310.03731
•
Published
•
25
DSPy: Compiling Declarative Language Model Calls into Self-Improving
Pipelines
Paper
•
2310.03714
•
Published
•
27
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Paper
•
2310.01557
•
Published
•
12
VMamba: Visual State Space Model
Paper
•
2401.10166
•
Published
•
36
Medusa: Simple LLM Inference Acceleration Framework with Multiple
Decoding Heads
Paper
•
2401.10774
•
Published
•
50
ActAnywhere: Subject-Aware Video Background Generation
Paper
•
2401.10822
•
Published
•
11
Rambler: Supporting Writing With Speech via LLM-Assisted Gist
Manipulation
Paper
•
2401.10838
•
Published
•
8
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
Generating with Multimodal LLMs
Paper
•
2401.11708
•
Published
•
27
Large Language Models are Superpositions of All Characters: Attaining
Arbitrary Role-play via Self-Alignment
Paper
•
2401.12474
•
Published
•
33
Orion-14B: Open-source Multilingual Large Language Models
Paper
•
2401.12246
•
Published
•
10
Small Language Model Meets with Reinforced Vision Vocabulary
Paper
•
2401.12503
•
Published
•
30
BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language
Models
Paper
•
2401.12522
•
Published
•
11
MM-LLMs: Recent Advances in MultiModal Large Language Models
Paper
•
2401.13601
•
Published
•
41
Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent
Diffusion Models for Virtual Try-All
Paper
•
2401.13795
•
Published
•
64
DeepSeek-Coder: When the Large Language Model Meets Programming -- The
Rise of Code Intelligence
Paper
•
2401.14196
•
Published
•
44
Genie: Achieving Human Parity in Content-Grounded Datasets Generation
Paper
•
2401.14367
•
Published
•
6
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language
Modeling
Paper
•
2401.16380
•
Published
•
45
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Paper
•
2401.15947
•
Published
•
46
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual
Perception
Paper
•
2401.16158
•
Published
•
15
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement
Learning
Paper
•
2401.16013
•
Published
•
17
SymbolicAI: A framework for logic-based approaches combining generative
models and solvers
Paper
•
2402.00854
•
Published
•
18
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
Paper
•
2402.07456
•
Published
•
38
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts
Models
Paper
•
2402.07033
•
Published
•
16
ChemLLM: A Chemical Large Language Model
Paper
•
2402.06852
•
Published
•
17
LiRank: Industrial Large Scale Ranking Models at LinkedIn
Paper
•
2402.06859
•
Published
•
8
AutoMathText: Autonomous Data Selection with Language Models for
Mathematical Texts
Paper
•
2402.07625
•
Published
•
10
Chain-of-Thought Reasoning Without Prompting
Paper
•
2402.10200
•
Published
•
90
Generative Representational Instruction Tuning
Paper
•
2402.09906
•
Published
•
50
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Paper
•
2402.09727
•
Published
•
35
How to Train Data-Efficient LLMs
Paper
•
2402.09668
•
Published
•
33
BitDelta: Your Fine-Tune May Only Be Worth One Bit
Paper
•
2402.10193
•
Published
•
17
DreamMatcher: Appearance Matching Self-Attention for
Semantically-Consistent Text-to-Image Personalization
Paper
•
2402.09812
•
Published
•
11
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Paper
•
2402.10176
•
Published
•
32
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM
Workflows
Paper
•
2402.10379
•
Published
•
27
LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large
Language Models
Paper
•
2402.10524
•
Published
•
19
Large Language Models as Zero-shot Dialogue State Tracker through
Function Calling
Paper
•
2402.10466
•
Published
•
16
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video
Editing
Paper
•
2402.10294
•
Published
•
19
OpenCodeInterpreter: Integrating Code Generation with Execution and
Refinement
Paper
•
2402.14658
•
Published
•
77
Beyond A*: Better Planning with Transformers via Search Dynamics
Bootstrapping
Paper
•
2402.14083
•
Published
•
43
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Paper
•
2402.14289
•
Published
•
16
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper
•
2402.14261
•
Published
•
10
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper
•
2402.13753
•
Published
•
104
Aria Everyday Activities Dataset
Paper
•
2402.13349
•
Published
•
28
Coercing LLMs to do and reveal (almost) anything
Paper
•
2402.14020
•
Published
•
12
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting
Paper
•
2402.13720
•
Published
•
4
Dolma: an Open Corpus of Three Trillion Tokens for Language Model
Pretraining Research
Paper
•
2402.00159
•
Published
•
55
Specialized Language Models with Cheap Inference from Limited Domain
Data
Paper
•
2402.01093
•
Published
•
45
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
Paper
•
2402.01622
•
Published
•
30
Nomic Embed: Training a Reproducible Long Context Text Embedder
Paper
•
2402.01613
•
Published
•
13
Training-Free Consistent Text-to-Image Generation
Paper
•
2402.03286
•
Published
•
61
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
•
2402.03300
•
Published
•
61
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper
•
2402.01739
•
Published
•
26
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image
Editing
Paper
•
2402.02583
•
Published
•
7
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper
•
2402.03620
•
Published
•
102
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning
Tasks
Paper
•
2402.04248
•
Published
•
24
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model
Paper
•
2402.03766
•
Published
•
9
Vision Superalignment: Weak-to-Strong Generalization for Vision
Foundation Models
Paper
•
2402.03749
•
Published
•
9
Multi-line AI-assisted Code Authoring
Paper
•
2402.04141
•
Published
•
8
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Paper
•
2402.04291
•
Published
•
48
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper
•
2402.04615
•
Published
•
31
Fine-Tuned Language Models Generate Stable Inorganic Materials as Text
Paper
•
2402.04379
•
Published
•
7
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
Paper
•
2402.04858
•
Published
•
13
Grandmaster-Level Chess Without Search
Paper
•
2402.04494
•
Published
•
62
More Agents Is All You Need
Paper
•
2402.05120
•
Published
•
46
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper
•
2402.05140
•
Published
•
18
An Interactive Agent Foundation Model
Paper
•
2402.05929
•
Published
•
24
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Paper
•
2402.05930
•
Published
•
35
Training Generative Question-Answering on Synthetic Data Obtained from
an Instruct-tuned Model
Paper
•
2310.08072
•
Published
•
1
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for
Language Models
Paper
•
2402.13064
•
Published
•
45
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
•
2402.10986
•
Published
•
73
Speculative Streaming: Fast LLM Inference without Auxiliary Models
Paper
•
2402.11131
•
Published
•
41
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper
•
2402.12226
•
Published
•
37
Paper
•
2402.12219
•
Published
•
15
Rethinking Data Selection for Supervised Fine-Tuning
Paper
•
2402.06094
•
Published
•
1
What Makes Good Data for Alignment? A Comprehensive Study of Automatic
Data Selection in Instruction Tuning
Paper
•
2312.15685
•
Published
•
16
SelectLLM: Can LLMs Select Important Instructions to Annotate?
Paper
•
2401.16553
•
Published
•
3
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for
Instruction Fine-Tuning
Paper
•
2402.04833
•
Published
•
6
A Systematic Survey of Prompt Engineering in Large Language Models:
Techniques and Applications
Paper
•
2402.07927
•
Published
•
1
Simple linear attention language models balance the recall-throughput
tradeoff
Paper
•
2402.18668
•
Published
•
16
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized
Diffusion Model
Paper
•
2402.17412
•
Published
•
21
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist
Autonomous Agents for Desktop and Web
Paper
•
2402.17553
•
Published
•
21
Training-Free Long-Context Scaling of Large Language Models
Paper
•
2402.17463
•
Published
•
16
FuseChat: Knowledge Fusion of Chat Models
Paper
•
2402.16107
•
Published
•
35
StructLM: Towards Building Generalist Models for Structured Knowledge
Grounding
Paper
•
2402.16671
•
Published
•
26
Seamless Human Motion Composition with Blended Positional Encodings
Paper
•
2402.15509
•
Published
•
12
Design2Code: How Far Are We From Automating Front-End Engineering?
Paper
•
2403.03163
•
Published
•
92
Finetuned Multimodal Language Models Are High-Quality Image-Text Data
Filters
Paper
•
2403.02677
•
Published
•
16
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal
Datasets
Paper
•
2403.03194
•
Published
•
11
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Paper
•
2403.07508
•
Published
•
69
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Paper
•
2403.07816
•
Published
•
37
Simple and Scalable Strategies to Continually Pre-train Large Language
Models
Paper
•
2403.08763
•
Published
•
48
Adapting Large Language Models via Reading Comprehension
Paper
•
2309.09530
•
Published
•
69
Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Paper
•
2403.09919
•
Published
•
19
RAFT: Adapting Language Model to Domain Specific RAG
Paper
•
2403.10131
•
Published
•
58
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document
Understanding
Paper
•
2403.12895
•
Published
•
27
TnT-LLM: Text Mining at Scale with Large Language Models
Paper
•
2403.12173
•
Published
•
17
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic
Prompt Compression
Paper
•
2403.12968
•
Published
•
20
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
Paper
•
2403.18421
•
Published
•
20