Upload models

Browse files

Files changed (7) hide show

LICENSE +21 -0
README.md +59 -0
models.py +196 -0
nn_state.pt +3 -0
qobsTTFFFFFTF30FFTFTF30TTFTFTFFF80FFTFTTF2699FFFF_X01_no_qp_no_adv_surf_F_Tin_qin_disteq_O_Trad_rest_Tadv_qadv_qout_qsed_RESCALED_7epochs_no_drop_REAL_NN_layers5in61out148_BN_F_te70.nc +0 -0
requirements.txt +7 -0
test_python_net.py +40 -0

LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2020, 2023 Janni Yuval and Institute of Computing for Climate Science
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

README.md ADDED Viewed

	@@ -0,0 +1,59 @@

+# Convection Parameterization in CAM
+Note that this repository and code is still work in progress and undergoing significant development.
+Once a useable release is produced it will be tagged.
+## Description
+This repository contains code as part of an effort to deploy machine learning (ML) models of geophysical parameterisations into the [Community Earth System Model (CESM)](https://www.cesm.ucar.edu/).
+This work is part of the [M<sup>2</sup>LInES](https://m2lines.github.io/) project aiming to improve performance of climate models using ML models for subgrid parameterizations.
+A Neural Net providing a subgrid parameterization of atmospheric convection in a [single column model](https://www.arm.gov/publications/proceedings/conf04/extended_abs/randall_da.pdf) has been developed and successfully deployed as part of an atmospheric simulation.
+The work is described in a [GRL paper](https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2020GL091363) with [accompanying code available](https://github.com/yaniyuval/Neural_nework_parameterization/tree/v.1.0.3). The repository contains the neural net and its implementation into a simple system for atmospheric modelling, [SAM](http://rossby.msrc.sunysb.edu/~marat/SAM.html).
+The aims of this repository are to:
+1. develop a standalone fortran module based on this neural net that can be used elsewhere,
+2. deploy the module in another atmospheric model, and
+3. evaluate its performance.
+We may also perform an investigation into interfacing the pytorch implementation of the Neural Net using the [pytorch-fortran bridging code](https://github.com/Cambridge-ICCS/fortran-pytorch-lib) developed at the [Institute of Computing for Climate Science](https://cambridge-iccs.github.io/).
+The model will first be deployed into the [Single Column Atmospheric Model (SCAM)](https://www.cesm.ucar.edu/models/simple/scam) - a single column version of the CESM.
+We plan to evaluate performance using SCAM in the gateIII configuration for tropical convection in a similar manner described by the [SCAM6 pulication in JAMES](https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2018MS001578).
+This will compare model performance to data from an intense observation period (IOP) described in an [AMS publication](https://journals.ametsoc.org/view/journals/atsc/36/1/1520-0469_1979_036_0053_saposs_2_0_co_2.xml).
+Long term developments of this project will seek to re-deploy more complex ML parameterizations into mode complex atmospheric models such as the [Community Atmospheric Model (CAM)](https://www.cesm.ucar.edu/models/cam) part of the CESM.
+## Repository structure
+```
+├── NN_module
+│   └── ...
+└── torch_nets
+    └── ...
+```
+### Contents
+### `NN_module/`
+This folder contains the fortran neural net extracted from the [code referenced above](https://github.com/yaniyuval/Neural_nework_parameterization/tree/v.1.0.3), along with any dependencies, that may be compiled as a standalone fortran module.
+Currently there is code that can be built on CSD3 using the included shell script.
+This now needs cleaning up, testing, and a proper makefile creating (see open issues #9 and #10).
+### ``torch_nets/``
+The directory contains the PyTorch versions of the neural networks we are interested in.
+## Contributing
+This repository is currently private as it is new and work in progress.
+Open tickets can be viewed at ['Issues'](https://github.com/m2lines/convection-parameterization-in-CAM/issues).
+To contribute find a relevant issue or open a new one and assign yourself to work on it.
+Then create a branch in which to add your contribution and open a pull request.
+Once ready assign a reviewer and request a code review.
+Merging should _only_ be performed once a code review has taken place.

models.py ADDED Viewed

	@@ -0,0 +1,196 @@

+"""Neural network architectures."""
+from typing import Optional
+import netCDF4 as nc  # type: ignore
+import torch
+from torch import nn, Tensor
+class ANN(nn.Sequential):
+    """Model used in the paper.
+    Paper: https://doi.org/10.1029/2020GL091363
+    Parameters
+    ----------
+    n_in : int
+        Number of input features.
+    n_out : int
+        Number of output features.
+    n_layers : int
+        Number of layers.
+    neurons : int
+        The number of neurons in the hidden layers.
+    dropout : float
+        The dropout probability to apply in the hidden layers.
+    device : str
+        The device to put the model on.
+    features_mean : ndarray
+        The mean of the input features.
+    features_std : ndarray
+        The standard deviation of the input features.
+    outputs_mean : ndarray
+        The mean of the output features.
+    outputs_std : ndarray
+        The standard deviation of the output features.
+    output_groups : ndarray
+        The number of output features in each group of the ouput.
+    Notes
+    -----
+    If you are doing inference, always remember to put the model in eval model,
+    by using ``model.eval()``, so the dropout layers are turned off.
+    """
+    def __init__(  # pylint: disable=too-many-arguments,too-many-locals
+        self,
+        n_in: int = 61,
+        n_out: int = 148,
+        n_layers: int = 5,
+        neurons: int = 128,
+        dropout: float = 0.0,
+        device: str = "cpu",
+        features_mean: Optional[Tensor] = None,
+        features_std: Optional[Tensor] = None,
+        outputs_mean: Optional[Tensor] = None,
+        outputs_std: Optional[Tensor] = None,
+        output_groups: Optional[list] = None,
+    ):
+        """Initialize the ANN model."""
+        dims = [n_in] + [neurons] * (n_layers - 1) + [n_out]
+        layers = []
+        for i in range(n_layers):
+            layers.append(nn.Linear(dims[i], dims[i + 1]))
+            if i < n_layers - 1:
+                layers.append(nn.ReLU())  # type: ignore
+                layers.append(nn.Dropout(dropout))  # type: ignore
+        super().__init__(*layers)
+        fmean = fstd = omean = ostd = None
+        if features_mean is not None:
+            assert features_std is not None
+            assert len(features_mean) == len(features_std)
+            fmean = torch.tensor(features_mean)
+            fstd = torch.tensor(features_std)
+        if outputs_mean is not None:
+            assert outputs_std is not None
+            assert len(outputs_mean) == len(outputs_std)
+            if output_groups is None:
+                omean = torch.tensor(outputs_mean)
+                ostd = torch.tensor(outputs_std)
+            else:
+                assert len(output_groups) == len(outputs_mean)
+                omean = torch.tensor(
+                    [x for x, g in zip(outputs_mean, output_groups) for _ in range(g)]
+                )
+                ostd = torch.tensor(
+                    [x for x, g in zip(outputs_std, output_groups) for _ in range(g)]
+                )
+        self.register_buffer("features_mean", fmean)
+        self.register_buffer("features_std", fstd)
+        self.register_buffer("outputs_mean", omean)
+        self.register_buffer("outputs_std", ostd)
+        self.to(torch.device(device))
+    def forward(self, input: Tensor):  # pylint: disable=redefined-builtin
+        """Pass the input through the model.
+        Override the forward method of nn.Sequential to add normalization
+        to the input and denormalization to the output.
+        Parameters
+        ----------
+        input : Tensor
+            A mini-batch of inputs.
+        Returns
+        -------
+        Tensor
+            The model output.
+        """
+        if self.features_mean is not None:
+            input = (input - self.features_mean) / self.features_std
+        # pass the input through the layers using nn.Sequential.forward
+        output = super().forward(input)
+        if self.outputs_mean is not None:
+            output = output * self.outputs_std + self.outputs_mean
+        return output
+    def load(self, path: str) -> "ANN":
+        """Load the model from a checkpoint.
+        Parameters
+        ----------
+        path : str
+            The path to the checkpoint.
+        """
+        state = torch.load(path)
+        for key in ["features_mean", "features_std", "outputs_mean", "outputs_std"]:
+            if key in state and getattr(self, key) is None:
+                setattr(self, key, state[key])
+        self.load_state_dict(state)
+        return self
+    def save(self, path: str):
+        """Save the model to a checkpoint.
+        Parameters
+        ----------
+        path : str
+            The path to save the checkpoint to.
+        """
+        torch.save(self.state_dict(), path)
+def load_from_netcdf_params(nc_file: str, dtype: str = "float32") -> ANN:
+    """Load the model with weights and biases from the netcdf file.
+    Parameters
+    ----------
+    nc_file : str
+        The netcdf file containing the parameters.
+    dtype : str
+        The data type to cast the parameters to.
+    """
+    data_set = nc.Dataset(nc_file)  # pylint: disable=no-member
+    model = ANN(
+        features_mean=data_set["fscale_mean"][:].astype(dtype),
+        features_std=data_set["fscale_stnd"][:].astype(dtype),
+        outputs_mean=data_set["oscale_mean"][:].astype(dtype),
+        outputs_std=data_set["oscale_stnd"][:].astype(dtype),
+        output_groups=[30, 29, 29, 30, 30],
+    )
+    for i, layer in enumerate(l for l in model.modules() if isinstance(l, nn.Linear)):
+        layer.weight.data = torch.tensor(data_set[f"w{i+1}"][:].astype(dtype))
+        layer.bias.data = torch.tensor(data_set[f"b{i+1}"][:].astype(dtype))
+    return model
+if __name__ == "__main__":
+    # Load the model from the netcdf file and save it to a checkpoint.
+    net = load_from_netcdf_params(
+        "qobsTTFFFFFTF30FFTFTF30TTFTFTFFF80FFTFTTF2699FFFF_X01_no_qp_no_adv_"
+        "surf_F_Tin_qin_disteq_O_Trad_rest_Tadv_qadv_qout_qsed_RESCALED_7epochs"
+        "_no_drop_REAL_NN_layers5in61out148_BN_F_te70.nc"
+    )
+    net.save("nn_state.pt")
+    print("Model saved to nn_state.pt")

nn_state.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:552b224668a40820e9afd0e4f83053dbc9f4ee7c75814ad32ed2805041afb1e1
+size 312574

qobsTTFFFFFTF30FFTFTF30TTFTFTFFF80FFTFTTF2699FFFF_X01_no_qp_no_adv_surf_F_Tin_qin_disteq_O_Trad_rest_Tadv_qadv_qout_qsed_RESCALED_7epochs_no_drop_REAL_NN_layers5in61out148_BN_F_te70.nc ADDED Viewed

Binary file (308 kB). View file

requirements.txt ADDED Viewed

	@@ -0,0 +1,7 @@

+torch
+black
+pytest
+pydocstyle
+pylint
+mypy
+netcdf4

test_python_net.py ADDED Viewed

	@@ -0,0 +1,40 @@

+"""A smoke test for the ANN model.
+This test checks that the model can be loaded from a weights file in both pt format and
+netcdf format and that they produce the expected output when given an input of all ones.
+This ensures that it is equivalent to the Fortran NN model.
+"""
+import os
+from pathlib import Path
+import torch
+import numpy as np
+from models import ANN, load_from_netcdf_params
+os.chdir(Path(__file__).parent)
+expected = np.loadtxt("nn_ones.txt").astype(np.float32)
+# nn_ones.txt is the output of the Fortran NN model given an input of all ones.
+model1 = ANN().load("nn_state.pt")  # load from the pytorch weights
+model2 = load_from_netcdf_params(
+    "qobsTTFFFFFTF30FFTFTF30TTFTFTFFF80FFTFTTF2699FFFF_X01_no_qp_no_adv_"
+    "surf_F_Tin_qin_disteq_O_Trad_rest_Tadv_qadv_qout_qsed_RESCALED_7epochs"
+    "_no_drop_REAL_NN_layers5in61out148_BN_F_te70.nc"
+)  # load from the NetCDF weights of the pretrained Fortran NN model
+# file created at https://github.com/yaniyuval/Neural_nework_parameterization/blob/f81f5f695297888f0bd1e0e61524590b4566bf03/NN_training/src/ml_train_nn.py#L417 # pylint: disable=line-too-long
+# (which the naming scheme integrating information about the training setup, see e.g., https://github.com/yaniyuval/Neural_nework_parameterization/blob/f81f5f695297888f0bd1e0e61524590b4566bf03/NN_training/src/ml_train_nn.py#L263-L265) # pylint: disable=line-too-long
+# This Neural Net can be found at https://github.com/yaniyuval/Neural_nework_parameterization/tree/f81f5f695297888f0bd1e0e61524590b4566bf03/NNs # pylint: disable=line-too-long
+x = torch.ones(61)
+actual1 = model1.forward(x).detach().numpy()
+actual2 = model2.forward(x).detach().numpy()
+assert np.all(actual1 == actual2)
+assert np.allclose(expected, actual1, atol=3e-8, rtol=2e-6)
+# Values of atol and rtol are chosen to be the lowest that still pass the test.
+print("Smoke tests passed")