-
Creative Robot Tool Use with Large Language Models
Paper • 2310.13065 • Published • 8 -
CodeCoT and Beyond: Learning to Program and Test like a Developer
Paper • 2308.08784 • Published • 5 -
Lemur: Harmonizing Natural Language and Code for Language Agents
Paper • 2310.06830 • Published • 31 -
CodePlan: Repository-level Coding using LLMs and Planning
Paper • 2309.12499 • Published • 74
Collections
Discover the best community collections!
Collections including paper arxiv:2309.07062
-
Large Language Models for Compiler Optimization
Paper • 2309.07062 • Published • 23 -
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Paper • 2310.17157 • Published • 12 -
FP8-LM: Training FP8 Large Language Models
Paper • 2310.18313 • Published • 31 -
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Paper • 2310.19102 • Published • 10
-
NExT-GPT: Any-to-Any Multimodal LLM
Paper • 2309.05519 • Published • 78 -
Large Language Model for Science: A Study on P vs. NP
Paper • 2309.05689 • Published • 20 -
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Paper • 2309.06126 • Published • 16 -
Large Language Models for Compiler Optimization
Paper • 2309.07062 • Published • 23
-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 82 -
Baichuan 2: Open Large-scale Language Models
Paper • 2309.10305 • Published • 19 -
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 38 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 65
-
One Wide Feedforward is All You Need
Paper • 2309.01826 • Published • 31 -
Gated recurrent neural networks discover attention
Paper • 2309.01775 • Published • 7 -
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper • 2309.03852 • Published • 44 -
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 75
-
Large Language Models for Compiler Optimization
Paper • 2309.07062 • Published • 23 -
Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors
Paper • 2306.17156 • Published • 21 -
Generative AI for learning: Investigating the potential of synthetic learning videos
Paper • 2304.03784 • Published -
Octo-planner: On-device Language Model for Planner-Action Agents
Paper • 2406.18082 • Published • 47