Interesting SSL papers EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Paper • 2311.02077 • Published Nov 3, 2023 • 16 System 2 Attention (is something you might need too) Paper • 2311.11829 • Published Nov 20, 2023 • 42 Large Language Models for Mathematicians Paper • 2312.04556 • Published Dec 7, 2023 • 13 VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper • 2403.00522 • Published Mar 1, 2024 • 47
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Paper • 2311.02077 • Published Nov 3, 2023 • 16
System 2 Attention (is something you might need too) Paper • 2311.11829 • Published Nov 20, 2023 • 42
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper • 2403.00522 • Published Mar 1, 2024 • 47
LLM databricks/dbrx-instruct Text Generation • Updated Apr 19, 2024 • 5.83k • 1.11k Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published Dec 6, 2024 • 152 Running 2.45k 2.45k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published Dec 6, 2024 • 152
Running 2.45k 2.45k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters