liuyao
/

QLNet

liuyao commited on Nov 6, 2023

Commit

d46ec21

•

1 Parent(s): 0edeeb0

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -10,19 +10,21 @@ library_name: timm
 # Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
 This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
 ## Model Details
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
-- **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
@@ -35,7 +37,7 @@ This modelcard aims to be a base template for new models. It has been generated
 <!-- Provide the basic links for the model. -->
 - **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
 - **Demo [optional]:** [More Information Needed]
 ## Uses

 # Model Card for Model ID
+Based on quaslinear hyperbolic systems of PDEs, the QLNet explores a new model space for ConvNets that uses multiplication (of same-sized tensors) instead of ReLU as the nonlinearity. It achieves comparable accuracy as ResNet50 on ImageNet-1k, demonstrating that it has the same level of capacity/expressivity, and deserves more study (hyper-paremeter tuning) that I alone am not able to do.
 This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
 ## Model Details
 ### Model Description
+Instead of the bottleneck of ResNet50 which consists of 1x1, 3x3, 1x1 in succession, we instead of make the 1x1, split into two equal halves and multiply them, then apply a 3x3 (depthwise), and a 1x1. All without activation functions except at the end of the block, where we apply a *radial activation function* that I call `hardball`.
+- **Developed by:** Yao Liu 刘杳
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
 <!-- Provide the basic links for the model. -->
 - **Repository:** [More Information Needed]
+- **Paper [optional]:** [A Novel ConvNet Architecture with a Continuous Symmetry](https://arxiv.org/abs/2308.01621)
 - **Demo [optional]:** [More Information Needed]
 ## Uses