Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yedson54 's Collections
Theoretical
Coding LLMs
AI-Generated Content Detection
Alignment and Unlearning
Artificial General Intelligence [AGI]
Synthetic Data Generation
Education
Efficient LMs
Federated Learning (FL) - Decentralized Scheme
Fundational - Deep Learning
Interpretability and Analysis
Surveys - Literature Reviews
Long Context
Model Evolution - Updates - Compatibility
Model Training - Learning Scheme
Architectures
Multimodal LMs
Optimization
Prompt Engineering - InContext Learning
Reasoning
Reinforcement Learning (RL / RLHF)
ReadLater
Self-Supervised Learning
Sequence Modeling
Small Language Model - Mobile Phone
Tabular Data - SpreadSheets
Transfer Learning - FineTuning SFT - Instruction
Uncertainty Quantification
Vision-language model (VLMs)
Models
xAI
FUN
Deep Learning
Applications
Times Series
Scientific Research - Discovery
Factuality - Faithfulness - Hallucination

Multimodal LMs

updated Sep 30, 2024
Upvote
-

  • Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

    Paper • 2406.16860 • Published Jun 24, 2024 • 61

  • PaliGemma: A versatile 3B VLM for transfer

    Paper • 2407.07726 • Published Jul 10, 2024 • 71

  • E5-V: Universal Embeddings with Multimodal Large Language Models

    Paper • 2407.12580 • Published Jul 17, 2024 • 41

  • Emu3: Next-Token Prediction is All You Need

    Paper • 2409.18869 • Published Sep 27, 2024 • 95
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs