-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 2 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:1905.11946
-
AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling
Paper • 2011.09011 • Published • 2 -
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Paper • 2005.14187 • Published • 2 -
BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Paper • 2003.11142 • Published • 2 -
Efficient Architecture Search by Network Transformation
Paper • 1707.04873 • Published • 2
-
Wide Residual Networks
Paper • 1605.07146 • Published • 2 -
Characterizing signal propagation to close the performance gap in unnormalized ResNets
Paper • 2101.08692 • Published • 2 -
Pareto-Optimal Quantized ResNet Is Mostly 4-bit
Paper • 2105.03536 • Published • 2 -
When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations
Paper • 2106.01548 • Published • 2
-
Measuring the Effects of Data Parallelism on Neural Network Training
Paper • 1811.03600 • Published • 2 -
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Paper • 1804.04235 • Published • 2 -
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Paper • 1905.11946 • Published • 3 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 62
-
Scaling Laws for Neural Language Models
Paper • 2001.08361 • Published • 6 -
An Empirical Model of Large-Batch Training
Paper • 1812.06162 • Published • 2 -
Measuring the Effects of Data Parallelism on Neural Network Training
Paper • 1811.03600 • Published • 2 -
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Paper • 1804.04235 • Published • 2