Feature Extraction
Transformers
Safetensors
English
bamboo
custom_code
yixinsong commited on
Commit
d3735d4
1 Parent(s): e809a07

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -8,7 +8,7 @@ However, the widespread adoption of ReLU-based models in the LLM field remains l
8
 
9
  ## Model Architecture
10
 
11
- To push the model's sparsity, we add a ReLU component after GLU component, called dReLU(double ReLU) So our FFN network works as follows:
12
 
13
  ```Python
14
  class BambooMLP(nn.Module):
 
8
 
9
  ## Model Architecture
10
 
11
+ To push the model's sparsity, we add a ReLU component after GLU component, called dReLU(double ReLU). So our FFN network works as follows:
12
 
13
  ```Python
14
  class BambooMLP(nn.Module):