Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:1706.03762

This collection refers to the foundational papers in the area of NLP.

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 35
MetaGPT: Meta Programming for Multi-Agent Collaborative Framework

Paper • 2308.00352 • Published Aug 1, 2023 • 2
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 11
XLNet: Generalized Autoregressive Pretraining for Language Understanding

Paper • 1906.08237 • Published Jun 19, 2019

Sora参考论文

OpenAI "Video generation models as world simulators"技术报告后面的参考论文，总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失，链接已补充到note中。

Unsupervised Learning of Video Representations using LSTMs

Paper • 1502.04681 • Published Feb 16, 2015 • 1
Recurrent Environment Simulators

Paper • 1704.02254 • Published Apr 7, 2017 • 1
World Models

Paper • 1803.10122 • Published Mar 27, 2018 • 1
Generating Videos with Scene Dynamics

Paper • 1609.02612 • Published Sep 8, 2016 • 1

Sora Reference Papers

A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora

Unsupervised Learning of Video Representations using LSTMs

Paper • 1502.04681 • Published Feb 16, 2015 • 1
Recurrent Environment Simulators

Paper • 1704.02254 • Published Apr 7, 2017 • 1
World Models

Paper • 1803.10122 • Published Mar 27, 2018 • 1
Generating Videos with Scene Dynamics

Paper • 1609.02612 • Published Sep 8, 2016 • 1

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 35
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 11
Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 73
Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 44

a growing collection of arXiv papers read while learning ML 📜

SMOTE: Synthetic Minority Over-sampling Technique

Paper • 1106.1813 • Published Jun 9, 2011 • 1
Scikit-learn: Machine Learning in Python

Paper • 1201.0490 • Published Jan 2, 2012 • 1
Identity Mappings in Deep Residual Networks

Paper • 1603.05027 • Published Mar 16, 2016 • 2
Deep Residual Learning for Image Recognition

Paper • 1512.03385 • Published Dec 10, 2015 • 5

Transformer Arch

Checkout: https://bbycroft.net/llm and http://nlp.seas.harvard.edu/2018/04/03/attention.html

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 35
ImageNet Large Scale Visual Recognition Challenge

Paper • 1409.0575 • Published Sep 1, 2014 • 6
Sequence to Sequence Learning with Neural Networks

Paper • 1409.3215 • Published Sep 10, 2014 • 3
Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 9

Large Language Models 2024

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4 • 59
The Impact of Reasoning Step Length on Large Language Models

Paper • 2401.04925 • Published Jan 10 • 15
Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 31
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 35

Seminal AI Papers

A collection of top AI papers.

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 35
You Only Look Once: Unified, Real-Time Object Detection

Paper • 1506.02640 • Published Jun 8, 2015
HEp-2 Cell Image Classification with Deep Convolutional Neural Networks

Paper • 1504.02531 • Published Apr 10, 2015
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Paper • 2401.05566 • Published Jan 10 • 23

Spellman Investment Group AI

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 35

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 24
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 35
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 37
Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 31

Previous
1
2
3
4
5
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs