MetOffice
/

FastNet-global

@@ -13,7 +13,7 @@ pipeline_tag: graph-ml
 **FastNet** is an data-driven medium range numerical weather prediction model developed jointly by the UK Met Office and the Alan Turing Institute. This release of FastNet v.1.1 marks the first publicly shared experimental release of the FastNet project.
-FastNet produces highly skilled forecasts that overcome commonly known limitations of AI models, resulting in more physically realistic forecasts as demonstrated in the corresponding publication xxx.
 ⚠️ **Note** A future v2 of this model is in development, based on the [Anemoi framework](https://github.com/ecmwf/anemoi-core). No further updates to the v1 codebase are planned.
@@ -21,9 +21,10 @@ FastNet produces highly skilled forecasts that overcome commonly known limitatio
 ![model_architecture_hf](https://cdn-uploads.huggingface.co/production/uploads/695be1a05bdcada0996ebc50/f5widCbwb-qJQ7GH5Dk5F.png)
 ### Model Description
-FastNet has an encode-process-decode structure with a series of graph neural networks and auto-regressive rollout. The encoder is a directional bipartite graph linking the current atmospheric state defined on input grid cells to a lower resolution latent space defined on mesh nodes. The processor then advances the mesh state in time by six hour increments. The processor operates on a multi-scale icosahedral mesh, starting from the 12-node icosahedron and subdividing six times, enabling the model to capture both localised and long-range interactions. Finally, the decoder maps the latent mesh representation back to the output domain, and the prediction is fed back as input for subsequent steps during rollout. FastNet uses a residual formulation, where the decoder output represents the increment to be added to the input state via skip-level connections, rather than predicting the full field from scratch. Notably, FastNet was trained with loss-function adaptations designed to improve physical realism compared to similar models.
 - **Developed by:** Met Office & The Alan Turing Institute
 - **Model type:** Encoder-processor-decoder model
@@ -32,12 +33,12 @@ FastNet has an encode-process-decode structure with a series of graph neural net
 ### Model Sources
 - **Inference-only repository:** https://github.com/MetOffice/fastnet-inference
-- **Paper:** _add-doi-here_
 ## Uses
 ### Direct Use
-This model is intended for research and exploratory inference on historical or real-time atmospheric reanalysis inputs to produce global weather pattern predictions over a time horizon of up to ~2 days.
 It is released for inference only: weights are provided for forward prediction, but training and fine-tuning are not supported in this release.
 Typical direct uses include: benchmarking against baselines, sensitivity experiments (e.g., perturbing input fields), case-study analysis of notable events
@@ -45,8 +46,6 @@ Typical direct uses include: benchmarking against baselines, sensitivity experim
 ### Out of scope
 This release is not intended for operational forecasting, safety-critical decision making or issuing public warnings
-## Known Limitations
 ## Training Details
 ### Training Data
@@ -84,8 +83,8 @@ Below we summarize the three-stage training procedure, including the number of u
 | Stage         | # rollout   | Loss / Objective |
 | ------------- | ----------- | ---------------- |
 | Pre-training  | n = 1 (6h)  | weighted MSE     |
-| Fine-tuning 1 | n = 7 (42h) | MSE              |
-| Fine-tuning 2 | n = 7 (42h) | spectral MSE     |
 Weighting was applied per-variable and was proportional to pressure level.
 #### Preprocessing

 **FastNet** is an data-driven medium range numerical weather prediction model developed jointly by the UK Met Office and the Alan Turing Institute. This release of FastNet v.1.1 marks the first publicly shared experimental release of the FastNet project.
+FastNet produces highly skilled forecasts that overcome commonly known limitations of AI models, resulting in more physically realistic forecasts as demonstrated in the corresponding publication [FastNet: Improving the physical consistency of machine-learning weather prediction models through loss function design](https://arxiv.org/abs/2509.17601).
 ⚠️ **Note** A future v2 of this model is in development, based on the [Anemoi framework](https://github.com/ecmwf/anemoi-core). No further updates to the v1 codebase are planned.
 ![model_architecture_hf](https://cdn-uploads.huggingface.co/production/uploads/695be1a05bdcada0996ebc50/f5widCbwb-qJQ7GH5Dk5F.png)
 ### Model Description
+FastNet has an encode-process-decode structure with a series of graph neural networks and auto-regressive rollout. The encoder is a directional bipartite graph linking the current atmospheric state defined on input grid cells to a lower resolution latent space defined on mesh nodes. The processor then advances the mesh state in time by six hour increments. The processor operates on a multi-scale icosahedral mesh, starting from the 12-node icosahedron and subdividing five times, enabling the model to capture both localised and long-range interactions. Finally, the decoder maps the latent mesh representation back to the output domain, and the prediction is fed back as input for subsequent steps during rollout. FastNet uses a residual formulation, where the decoder output represents the increment to be added to the input state via skip-level connections, rather than predicting the full field from scratch. Notably, FastNet was trained with loss-function adaptations designed to improve physical realism compared to similar models.
 - **Developed by:** Met Office & The Alan Turing Institute
 - **Model type:** Encoder-processor-decoder model
 ### Model Sources
 - **Inference-only repository:** https://github.com/MetOffice/fastnet-inference
+- **Paper:** [main publication](https://arxiv.org/abs/2509.17601) and [technical paper](https://arxiv.org/abs/2509.17658)
 ## Uses
 ### Direct Use
+This model is intended for research and exploratory inference using historical or real-time atmospheric reanalysis inputs to generate global weather-pattern predictions. Model performance has typically been evaluated for lead times of up to 10 days, although fine-tuning was focused on shorter horizons.
 It is released for inference only: weights are provided for forward prediction, but training and fine-tuning are not supported in this release.
 Typical direct uses include: benchmarking against baselines, sensitivity experiments (e.g., perturbing input fields), case-study analysis of notable events
 ### Out of scope
 This release is not intended for operational forecasting, safety-critical decision making or issuing public warnings
 ## Training Details
 ### Training Data
 | Stage         | # rollout   | Loss / Objective |
 | ------------- | ----------- | ---------------- |
 | Pre-training  | n = 1 (6h)  | weighted MSE     |
+| Fine-tuning 1 | n = 7 (42h) | weighted MSE              |
+| Fine-tuning 2 | up to n = 12 (72h) in a [1, 2, 4, 8, 12] pattern | spectral AMSE     |
 Weighting was applied per-variable and was proportional to pressure level.
 #### Preprocessing