gary109
's Collections
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset
Paper
•
2309.04662
•
Published
•
21
Neurons in Large Language Models: Dead, N-gram, Positional
Paper
•
2309.04827
•
Published
•
16
Optimize Weight Rounding via Signed Gradient Descent for the
Quantization of LLMs
Paper
•
2309.05516
•
Published
•
8
DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule
Graphs
Paper
•
2309.03907
•
Published
•
6
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper
•
2309.03852
•
Published
•
42
Large Language Models as Optimizers
Paper
•
2309.03409
•
Published
•
72
GPT Can Solve Mathematical Problems Without a Calculator
Paper
•
2309.03241
•
Published
•
17
DoLa: Decoding by Contrasting Layers Improves Factuality in Large
Language Models
Paper
•
2309.03883
•
Published
•
14
Paper
•
2309.03450
•
Published
•
7
Language Modeling Is Compression
Paper
•
2309.10668
•
Published
•
80
Multimodal Foundation Models: From Specialists to General-Purpose
Assistants
Paper
•
2309.10020
•
Published
•
39
Baichuan 2: Open Large-scale Language Models
Paper
•
2309.10305
•
Published
•
16
SlimPajama-DC: Understanding Data Combinations for LLM Training
Paper
•
2309.10818
•
Published
•
10
Stabilizing RLHF through Advantage Model and Selective Rehearsal
Paper
•
2309.10202
•
Published
•
9
Q-Transformer: Scalable Offline Reinforcement Learning via
Autoregressive Q-Functions
Paper
•
2309.10150
•
Published
•
23
FoleyGen: Visually-Guided Audio Generation
Paper
•
2309.10537
•
Published
•
6
The Languini Kitchen: Enabling Language Modelling Research at Different
Scales of Compute
Paper
•
2309.11197
•
Published
•
4
DreamLLM: Synergistic Multimodal Comprehension and Creation
Paper
•
2309.11499
•
Published
•
57
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Paper
•
2309.16058
•
Published
•
53
Effective Long-Context Scaling of Foundation Models
Paper
•
2309.16039
•
Published
•
28
Paper
•
2309.16609
•
Published
•
30
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
Paper
•
2309.16414
•
Published
•
19
MotionLM: Multi-Agent Motion Forecasting as Language Modeling
Paper
•
2309.16534
•
Published
•
15
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
Planning
Paper
•
2309.16650
•
Published
•
7
GPT-Fathom: Benchmarking Large Language Models to Decipher the
Evolutionary Path towards GPT-4 and Beyond
Paper
•
2309.16583
•
Published
•
12
Language models in molecular discovery
Paper
•
2309.16235
•
Published
•
10
Toward Joint Language Modeling for Speech Units and Text
Paper
•
2310.08715
•
Published
•
6
Self-RAG: Learning to Retrieve, Generate, and Critique through
Self-Reflection
Paper
•
2310.11511
•
Published
•
63
MusicAgent: An AI Agent for Music Understanding and Generation with
Large Language Models
Paper
•
2310.11954
•
Published
•
24
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper
•
2310.11453
•
Published
•
94
VeRA: Vector-based Random Matrix Adaptation
Paper
•
2310.11454
•
Published
•
26
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Paper
•
2310.11441
•
Published
•
24
Context-Aware Meta-Learning
Paper
•
2310.10971
•
Published
•
14
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Paper
•
2310.11440
•
Published
•
13
TEQ: Trainable Equivalent Transformation for Quantization of LLMs
Paper
•
2310.10944
•
Published
•
9
Approximating Two-Layer Feedforward Networks for Efficient Transformers
Paper
•
2310.10837
•
Published
•
10
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Paper
•
2310.10769
•
Published
•
8
Paper
•
2310.10625
•
Published
•
7
MiniGPT-v2: large language model as a unified interface for
vision-language multi-task learning
Paper
•
2310.09478
•
Published
•
15
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with
Point Cloud Priors
Paper
•
2310.08529
•
Published
•
16
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Paper
•
2310.00704
•
Published
•
16
DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture
Propagation
Paper
•
2310.13119
•
Published
•
10
H2O Open Ecosystem for State-of-the-art Large Language Models
Paper
•
2310.13012
•
Published
•
7
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient
Clipping
Paper
•
2310.12474
•
Published
•
4
DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual
Design
Paper
•
2310.15144
•
Published
•
12
FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling
Paper
•
2310.15169
•
Published
•
8
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large
Language Models by Extrapolating Errors from Small Models
Paper
•
2310.13671
•
Published
•
17
Auto-Instruct: Automatic Instruction Generation and Ranking for
Black-Box Language Models
Paper
•
2310.13127
•
Published
•
10
Teaching Language Models to Self-Improve through Interactive
Demonstrations
Paper
•
2310.13522
•
Published
•
10
SILC: Improving Vision Language Pretraining with Self-Distillation
Paper
•
2310.13355
•
Published
•
5
ToolChain*: Efficient Action Space Navigation in Large Language Models
with A* Search
Paper
•
2310.13227
•
Published
•
11
A Survey of Large Language Models
Paper
•
2303.18223
•
Published
•
13
A Survey of Large Language Models for Healthcare: from Data, Technology,
and Applications to Accountability and Ethics
Paper
•
2310.05694
•
Published
•
3
Woodpecker: Hallucination Correction for Multimodal Large Language
Models
Paper
•
2310.16045
•
Published
•
13
InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4
Paper
•
2308.12067
•
Published
•
4
Ghost in the Minecraft: Generally Capable Agents for Open-World
Enviroments via Large Language Models with Text-based Knowledge and Memory
Paper
•
2305.17144
•
Published
•
2
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Paper
•
2305.10601
•
Published
•
7
UI Layout Generation with LLMs Guided by UI Grammar
Paper
•
2310.15455
•
Published
•
2
An Early Evaluation of GPT-4V(ision)
Paper
•
2310.16534
•
Published
•
21
Wonder3D: Single Image to 3D using Cross-Domain Diffusion
Paper
•
2310.15008
•
Published
•
19
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Paper
•
2310.17631
•
Published
•
31
Controlled Decoding from Language Models
Paper
•
2310.17022
•
Published
•
12
HyperFields: Towards Zero-Shot Generation of NeRFs from Text
Paper
•
2310.17075
•
Published
•
13
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Paper
•
2310.17157
•
Published
•
8
Can Language Models Understand Physical Concepts?
Paper
•
2305.14057
•
Published
•
1
BLIP: Bootstrapping Language-Image Pre-training for Unified
Vision-Language Understanding and Generation
Paper
•
2201.12086
•
Published
•
2
ImageNetVC: Zero-Shot Visual Commonsense Evaluation on 1000 ImageNet
Categories
Paper
•
2305.15028
•
Published
•
1
Label Words are Anchors: An Information Flow Perspective for
Understanding In-Context Learning
Paper
•
2305.14160
•
Published
•
1
GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for
Reasoning Problems
Paper
•
2310.12397
•
Published
•
1
Can Large Language Models Really Improve by Self-critiquing Their Own
Plans?
Paper
•
2310.08118
•
Published
•
1
Voyager: An Open-Ended Embodied Agent with Large Language Models
Paper
•
2305.16291
•
Published
•
8
TiC-CLIP: Continual Training of CLIP Models
Paper
•
2310.16226
•
Published
•
7
CLEX: Continuous Length Extrapolation for Large Language Models
Paper
•
2310.16450
•
Published
•
9
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Paper
•
2310.17796
•
Published
•
15
Data-Centric Financial Large Language Models
Paper
•
2310.17784
•
Published
•
14
FP8-LM: Training FP8 Large Language Models
Paper
•
2310.18313
•
Published
•
30
Multimodal ChatGPT for Medical Applications: an Experimental Study of
GPT-4V
Paper
•
2310.19061
•
Published
•
8
LoRAShear: Efficient Large Language Model Structured Pruning and
Knowledge Recovery
Paper
•
2310.18356
•
Published
•
22
MM-VID: Advancing Video Understanding with GPT-4V(ision)
Paper
•
2310.19773
•
Published
•
18
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
Modeling Likewise
Paper
•
2310.19019
•
Published
•
9
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Paper
•
2310.19102
•
Published
•
7
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive
Learning for Code Generation
Paper
•
2310.18628
•
Published
•
6
Skywork: A More Open Bilingual Foundation Model
Paper
•
2310.19341
•
Published
•
4
The Impact of Depth and Width on Transformer Language Model
Generalization
Paper
•
2310.19956
•
Published
•
9
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models
across Computer Vision Tasks
Paper
•
2310.19909
•
Published
•
19
Learning From Mistakes Makes LLM Better Reasoner
Paper
•
2310.20689
•
Published
•
24
Does GPT-4 Pass the Turing Test?
Paper
•
2310.20216
•
Published
•
17
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Paper
•
2310.20624
•
Published
•
12
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement Learning
Paper
•
2310.20587
•
Published
•
15
Leveraging Word Guessing Games to Assess the Intelligence of Large
Language Models
Paper
•
2310.20499
•
Published
•
7
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation
Paper
•
2311.00272
•
Published
•
8
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper
•
2311.00176
•
Published
•
7
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation,
Generation and Editing
Paper
•
2311.00571
•
Published
•
39
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo
Labelling
Paper
•
2311.00430
•
Published
•
53
Grounding Visual Illusions in Language: Do Vision-Language Models
Perceive Illusions Like Humans?
Paper
•
2311.00047
•
Published
•
7
The Generative AI Paradox: "What It Can Create, It May Not Understand"
Paper
•
2311.00059
•
Published
•
17
AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning
Paper
•
2311.00257
•
Published
•
8
Text Rendering Strategies for Pixel Language Models
Paper
•
2311.00522
•
Published
•
10
FlashDecoding++: Faster Large Language Model Inference on GPUs
Paper
•
2311.01282
•
Published
•
30
FLAP: Fast Language-Audio Pre-training
Paper
•
2311.01615
•
Published
•
16
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task
Completion
Paper
•
2311.01767
•
Published
•
16
Contrastive Chain-of-Thought Prompting
Paper
•
2311.09277
•
Published
•
31
Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying
Paper
•
2311.09578
•
Published
•
12
Open-Sourcing Highly Capable Foundation Models: An evaluation of risks,
benefits, and alternative methods for pursuing open-source objectives
Paper
•
2311.09227
•
Published
•
5
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context
Learning
Paper
•
2312.01552
•
Published
•
26
Generating Illustrated Instructions
Paper
•
2312.04552
•
Published
•
6
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Paper
•
2312.04474
•
Published
•
28
Beyond Surface: Probing LLaMA Across Scales and Layers
Paper
•
2312.04333
•
Published
•
18
Large Language Models for Mathematicians
Paper
•
2312.04556
•
Published
•
11
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Paper
•
2312.03793
•
Published
•
17
OneLLM: One Framework to Align All Modalities with Language
Paper
•
2312.03700
•
Published
•
20
Generative Multimodal Models are In-Context Learners
Paper
•
2312.13286
•
Published
•
31
Specialized Language Models with Cheap Inference from Limited Domain
Data
Paper
•
2402.01093
•
Published
•
45
StepCoder: Improve Code Generation with Reinforcement Learning from
Compiler Feedback
Paper
•
2402.01391
•
Published
•
41
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large
Language Models
Paper
•
2402.01118
•
Published
•
28
K-Level Reasoning with Large Language Models
Paper
•
2402.01521
•
Published
•
16
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
Paper
•
2402.01622
•
Published
•
30
MusicRL: Aligning Music Generation to Human Preferences
Paper
•
2402.04229
•
Published
•
16
TinyLlama: An Open-Source Small Language Model
Paper
•
2401.02385
•
Published
•
81
Computing Power and the Governance of Artificial Intelligence
Paper
•
2402.08797
•
Published
•
11
Premise Order Matters in Reasoning with Large Language Models
Paper
•
2402.08939
•
Published
•
23
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating
Unconventional Objects
Paper
•
2402.09052
•
Published
•
16
Large Language Models as Zero-shot Dialogue State Tracker through
Function Calling
Paper
•
2402.10466
•
Published
•
16
RLVF: Learning from Verbal Feedback without Overgeneralization
Paper
•
2402.10893
•
Published
•
10
Learning to Learn Faster from Human Feedback with Language Model
Predictive Control
Paper
•
2402.11450
•
Published
•
20
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper
•
2402.12226
•
Published
•
37
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
•
2402.10986
•
Published
•
73
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper
•
2402.13753
•
Published
•
104
OmniPred: Language Models as Universal Regressors
Paper
•
2402.14547
•
Published
•
11
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
Synthesis
Paper
•
2402.14797
•
Published
•
18
Watermarking Makes Language Models Radioactive
Paper
•
2402.14904
•
Published
•
21
ChatMusician: Understanding and Generating Music Intrinsically with LLM
Paper
•
2402.16153
•
Published
•
55
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
•
2402.14905
•
Published
•
81
Towards Optimal Learning of Language Models
Paper
•
2402.17759
•
Published
•
16
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
•
2402.17764
•
Published
•
567
Beyond Language Models: Byte Models are Digital World Simulators
Paper
•
2402.19155
•
Published
•
45
Examining Forgetting in Continual Pre-training of Aligned Large Language
Models
Paper
•
2401.03129
•
Published
Can Large Language Models Be an Alternative to Human Evaluations?
Paper
•
2305.01937
•
Published
•
1
A Closer Look into Automatic Evaluation Using Large Language Models
Paper
•
2310.05657
•
Published
Large Language Models Understand and Can be Enhanced by Emotional
Stimuli
Paper
•
2307.11760
•
Published
•
1
Principled Instructions Are All You Need for Questioning LLaMA-1/2,
GPT-3.5/4
Paper
•
2312.16171
•
Published
•
30
Re3: Generating Longer Stories With Recursive Reprompting and Revision
Paper
•
2210.06774
•
Published
•
2
Constitutional AI: Harmlessness from AI Feedback
Paper
•
2212.08073
•
Published
•
1
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls
Paper
•
2402.04253
•
Published
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Paper
•
2403.05135
•
Published
•
39
Encouraging Divergent Thinking in Large Language Models through
Multi-Agent Debate
Paper
•
2305.19118
•
Published
CAMEL: Communicative Agents for "Mind" Exploration of Large Scale
Language Model Society
Paper
•
2303.17760
•
Published
•
1
Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with
Agent Team Optimization
Paper
•
2310.02170
•
Published
MetaGPT: Meta Programming for Multi-Agent Collaborative Framework
Paper
•
2308.00352
•
Published
•
2
Rho-1: Not All Tokens Are What You Need
Paper
•
2404.07965
•
Published
•
80
Large Language Models for Autonomous Driving: Real-World Experiments
Paper
•
2312.09397
•
Published
The Rise and Potential of Large Language Model Based Agents: A Survey
Paper
•
2309.07864
•
Published
•
5