Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition Paper • 2405.15216 • Published 17 days ago • 11
FIFO-Diffusion: Generating Infinite Videos from Text without Training Paper • 2405.11473 • Published 22 days ago • 53
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching Paper • 2405.11252 • Published 22 days ago • 11
Towards Modular LLMs by Building and Reusing a Library of LoRAs Paper • 2405.11157 • Published 23 days ago • 23
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization Paper • 2405.11582 • Published 21 days ago • 11
Imp: Highly Capable Large Multimodal Models for Mobile Devices Paper • 2405.12107 • Published 20 days ago • 23
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published 20 days ago • 42
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper • 2405.11143 • Published 21 days ago • 33
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training Paper • 2405.06932 • Published 30 days ago • 15
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published 28 days ago • 21
view article Article Advancing Open-source Large Language Models in the Medical & Healthcare Domain By aaditya • about 1 month ago • 4
Exploiting Reasoning Chains for Multi-hop Science Question Answering Paper • 2109.02905 • Published Sep 7, 2021 • 1
Answering Questions by Meta-Reasoning over Multiple Chains of Thought Paper • 2304.13007 • Published Apr 25, 2023 • 1
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation Paper • 2305.14251 • Published May 23, 2023 • 1
L-Eval: Instituting Standardized Evaluation for Long Context Language Models Paper • 2307.11088 • Published Jul 20, 2023 • 4
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems Paper • 2311.09476 • Published Nov 16, 2023 • 3
Can Large Language Models Be an Alternative to Human Evaluations? Paper • 2305.01937 • Published May 3, 2023 • 1
Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published Apr 30 • 64
A Latent Space Theory for Emergent Abilities in Large Language Models Paper • 2304.09960 • Published Apr 19, 2023 • 3
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction Paper • 2312.13558 • Published Dec 21, 2023 • 5
LLM-AD: Large Language Model based Audio Description System Paper • 2405.00983 • Published May 2 • 13
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 167
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models Paper • 2203.01104 • Published Mar 2, 2022 • 2
The Impact of Depth and Width on Transformer Language Model Generalization Paper • 2310.19956 • Published Oct 30, 2023 • 9
ReAct: Synergizing Reasoning and Acting in Language Models Paper • 2210.03629 • Published Oct 6, 2022 • 12
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22 • 239
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published Apr 22 • 122
Neural Circuit Diagrams: Robust Diagrams for the Communication, Implementation, and Analysis of Deep Learning Architectures Paper • 2402.05424 • Published Feb 8 • 17
Hypothesis Search: Inductive Reasoning with Language Models Paper • 2309.05660 • Published Sep 11, 2023 • 1
Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding Paper • 2110.03789 • Published Oct 7, 2021 • 2
RARR: Researching and Revising What Language Models Say, Using Language Models Paper • 2210.08726 • Published Oct 17, 2022 • 1
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models Paper • 2404.17672 • Published Apr 26 • 18
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 • 71