FLAME: Factuality-Aware Alignment for Large Language Models Paper • 2405.01525 • Published 2 days ago • 8
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Paper • 2405.01481 • Published 2 days ago • 13
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published 5 days ago • 37
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published 2 days ago • 40
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper • 2309.10400 • Published Sep 19, 2023 • 21
Spectrally Pruned Gaussian Fields with Neural Compensation Paper • 2405.00676 • Published 3 days ago • 7
A Careful Examination of Large Language Model Performance on Grade School Arithmetic Paper • 2405.00332 • Published 3 days ago • 17
Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3 Paper • 2405.00664 • Published 3 days ago • 11
Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published 4 days ago • 46
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models Paper • 2404.17672 • Published 8 days ago • 15
Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations Paper • 2404.17521 • Published 8 days ago • 11
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published 5 days ago • 53
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs Paper • 2404.16873 • Published 13 days ago • 22
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper • 2404.16994 • Published 9 days ago • 29
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • 8 days ago • 47
ChatAnything: Facetime Chat with LLM-Enhanced Personas Paper • 2311.06772 • Published Nov 12, 2023 • 33
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs Paper • 2404.16375 • Published 9 days ago • 14
Interactive3D: Create What You Want by Interactive 3D Generation Paper • 2404.16510 • Published 9 days ago • 17
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published 9 days ago • 50
SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension Paper • 2404.16790 • Published 9 days ago • 7
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites Paper • 2404.16821 • Published 9 days ago • 47
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published 12 days ago • 114
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation Paper • 2404.14396 • Published 12 days ago • 16
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published 12 days ago • 37
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions Paper • 2404.13208 • Published 15 days ago • 36
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published 12 days ago • 226
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare 16 days ago • 57
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent 13 days ago • 61
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation Paper • 2404.13026 • Published 15 days ago • 21
AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation Paper • 2404.12753 • Published 15 days ago • 36
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models Paper • 2404.13013 • Published 15 days ago • 26
TextSquare: Scaling up Text-Centric Visual Instruction Tuning Paper • 2404.12803 • Published 15 days ago • 27
MeshLRM: Large Reconstruction Model for High-Quality Mesh Paper • 2404.12385 • Published 16 days ago • 23
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published 16 days ago • 10
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment Paper • 2404.12318 • Published 16 days ago • 14
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing Paper • 2404.12253 • Published 16 days ago • 46
BLINK: Multimodal Large Language Models Can See but Not Perceive Paper • 2404.12390 • Published 16 days ago • 22
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Paper • 2404.12387 • Published 16 days ago • 34
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 16 days ago • 459
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video Paper • 2404.09833 • Published 19 days ago • 27
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model Paper • 2404.09967 • Published 19 days ago • 20