JulesLT heloise-chomet commited on
Commit
825bdcd
·
verified ·
1 Parent(s): 53564f5

Update README.md (#3)

Browse files

- Update README.md (bbb4e2cb34572a3a182efc2468bf518e6b070c25)


Co-authored-by: Heloise Chomet <heloise-chomet@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +64 -1
README.md CHANGED
@@ -1,3 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ## License summary
2
 
3
  1. The Licensed Models are **only** available under this License for Non-Commercial Purposes.
@@ -6,4 +69,4 @@
6
  1. any Commercial Purposes, unless agreed by Us under a separate licence;
7
  2. to train, improve or otherwise influence the functionality or performance of any other third-party derivative model that is commercial or intended for a Commercial Purpose and is similar to the Licensed Models;
8
  3. to create models distilled or derived from the Outputs of the Licensed Models, unless such models are for Non-Commercial Purposes and open-sourced under the same license as the Licensed Models; or
9
- 4. in violation of any applicable laws and regulations.
 
1
+ # NequIP
2
+
3
+ ## Reference
4
+
5
+ Simon Batzner, Albert Musaelian, Lixin Sun, Mario Geiger, Jonathan P. Mailoa,
6
+ Mordechai Kornbluth, Nicola Molinari, Tess E. Smidt, and Boris Kozinsky.
7
+ E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials.
8
+ Nature Communications, 13(1), May 2022. ISSN: 2041-1723. URL: https://dx.doi.org/10.1038/s41467-022-29939-5.
9
+
10
+ ## How to Use
11
+
12
+ For complete usage instructions, please refer to our [documentation](https://instadeep.github.io/mlip)
13
+
14
+ ## Model architecture
15
+ | Parameter | Value | Description |
16
+ |---------------------------|-----------------------------------------------|---------------------------------------------|
17
+ | `num_layers` | `5` | Number of NequIP layers. |
18
+ | `node_irreps` | `64x0e + 64x0o + 32x1e + 32x1o + 4x2e + 4x2o` | O3 representation space of node features. |
19
+ | `l_max` | `2` | Maximal degree of spherical harmonics. |
20
+ | `num_bessel` | `8` | Number of Bessel basis functions. |
21
+ | `radial_net_nonlinearity` | `swish` | Activation function for radial MLP. |
22
+ | `radial_net_n_hidden` | `64` | Number of hidden features in radial MLP. |
23
+ | `radial_net_n_layers` | `2` | Number of layers in radial MLP. |
24
+ | `radial_envelope` | `polynomial_envelope` | Radial envelope function. |
25
+ | `scalar_mlp_std` | `4` | Standard deviation of weight initialisation.|
26
+ | `atomic_energies` | `None` | Treatment of the atomic energies. |
27
+ | `avg_um_neighbors` | `None` | Mean number of neighbors. |
28
+
29
+ For more information about NequIP hyperparameters,
30
+ please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/models/nequip.html#mlip.models.nequip.config.NequipConfig)
31
+
32
+ ## Training
33
+
34
+ Training is performed over 220 epochs, with an exponential moving average (EMA) decay rate of 0.99.
35
+ The model employs a Huber loss function with scheduled weights for the energy and force components.
36
+ Initially, the energy term is weighted at 40 and the force term at 1000.
37
+ At epoch 115, these weights are flipped.
38
+
39
+ We use our default MLIP optimizer in v1.0.0 with the following settings:
40
+ | Parameter | Value | Description |
41
+ |----------------------------------|----------------|-----------------------------------------------------------------|
42
+ | `init_learning_rate` | `0.002` | Initial learning rate. |
43
+ | `peak_learning_rate` | `0.002` | Peak learning rate. |
44
+ | `final_learning_rate` | `0.002` | Final learning rate. |
45
+ | `weight_decay` | `0` | Weight decay. |
46
+ | `warmup_steps` | `4000` | Number of optimizer warm-up steps. |
47
+ | `transition_steps` | `360000` | Number of optimizer transition steps. |
48
+ | `grad_norm` | `500` | Gradient norm used for gradient clipping. |
49
+ | `num_gradient_accumulation_steps`| `1` | Steps to accumulate before taking an optimizer step. |
50
+
51
+ For more information about the optimizer,
52
+ please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/training/optimizer.html#mlip.training.optimizer_config.OptimizerConfig)
53
+ ## Dataset
54
+ | Parameter | Value | Description |
55
+ |-----------------------------|-------|--------------------------------------------|
56
+ | `graph_cutoff_angstrom` | `5` | Graph cutoff distance (in Å). |
57
+ | `max_n_node` | `32` | Maximum number of nodes allowed in a batch.|
58
+ | `max_n_edge` | `288` | Maximum number of edges allowed in a batch.|
59
+ | `batch_size` | `16` | Number of graphs in a batch. |
60
+
61
+ This model was trained on the [SPICE2_curated dataset](https://huggingface.co/datasets/InstaDeepAI/SPICE2-curated).
62
+ For more information about dataset configuration
63
+ please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/data/dataset_configs.html#mlip.data.configs.GraphDatasetBuilderConfig)
64
  ## License summary
65
 
66
  1. The Licensed Models are **only** available under this License for Non-Commercial Purposes.
 
69
  1. any Commercial Purposes, unless agreed by Us under a separate licence;
70
  2. to train, improve or otherwise influence the functionality or performance of any other third-party derivative model that is commercial or intended for a Commercial Purpose and is similar to the Licensed Models;
71
  3. to create models distilled or derived from the Outputs of the Licensed Models, unless such models are for Non-Commercial Purposes and open-sourced under the same license as the Licensed Models; or
72
+ 4. in violation of any applicable laws and regulations.