Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2212.09748

Sora参考论文

OpenAI "Video generation models as world simulators"技术报告后面的参考论文，总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失，链接已补充到note中。

Unsupervised Learning of Video Representations using LSTMs

Paper • 1502.04681 • Published Feb 16, 2015 • 1
Recurrent Environment Simulators

Paper • 1704.02254 • Published Apr 7, 2017 • 2
World Models

Paper • 1803.10122 • Published Mar 27, 2018 • 2
Generating Videos with Scene Dynamics

Paper • 1609.02612 • Published Sep 8, 2016 • 1

Image / Video Gen

Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion

Understanding Diffusion Models: A Unified Perspective

Paper • 2208.11970 • Published Aug 25, 2022
Tutorial on Diffusion Models for Imaging and Vision

Paper • 2403.18103 • Published Mar 26 • 2
Denoising Diffusion Probabilistic Models

Paper • 2006.11239 • Published Jun 19, 2020 • 3
Denoising Diffusion Implicit Models

Paper • 2010.02502 • Published Oct 6, 2020 • 3

image-generation

aMUSEd: An Open MUSE Reproduction

Paper • 2401.01808 • Published Jan 3 • 28
black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.23M • • 6.04k
Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated Sep 21 • 1.02M • 771
zer0int/CLIP-GmP-ViT-L-14

Zero-Shot Image Classification • Updated Sep 23 • 5.34k • 283

2023 (and before) Papers of the Year

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Paper • 2306.00989 • Published Jun 1, 2023 • 1
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 47
Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 16
Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 9

Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 16

Applied Machine Learning Papers

Reading List (Mainly Focused of VLM's and Diffusion Models)

Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 16
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

Paper • 2311.15127 • Published Nov 25, 2023 • 11
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 11
U-Net: Convolutional Networks for Biomedical Image Segmentation

Paper • 1505.04597 • Published May 18, 2015 • 7

Img Gen Foundational

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

Paper • 2404.02905 • Published Apr 3 • 64
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5 • 56
Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 16
Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16 • 35

Diffusion Models

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Paper • 2403.05135 • Published Mar 8 • 42
Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation

Paper • 2303.00848 • Published Mar 1, 2023
Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 16
High-Resolution Image Synthesis with Latent Diffusion Models

Paper • 2112.10752 • Published Dec 20, 2021 • 11

Foundation AI Papers

Curated List of Must-Reads on LLM reasoning at Temus AI team

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Paper • 2310.04406 • Published Oct 6, 2023 • 8
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 99
ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization

Paper • 2402.09320 • Published Feb 14 • 6
Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6 • 109

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Paper • 2010.11929 • Published Oct 22, 2020 • 6
Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 16

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs