Continue working on JOSS paper

jfrost-mo · jfrost-mo · commit 2e458c8ee875 · 2025-09-22T13:21:52.000+01:00
diff --git a/docs/papers/paper.bib b/docs/papers/paper.bib
@@ -11,3 +11,34 @@ @article{
     title = {Cylc: A Workflow Engine for Cycling Systems},
     journal = {Journal of Open Source Software}
 }
+
+@software{metplus,
+    author = {Prestopnik, J. and Opatz, J. and Gotway, J.Halley and Jensen, T. and Vigh, J. and Row, M. and Kalb, C. and Fisher, H. and Goodrich, L. and Adriaansen, D. and Win-Gildenmeister, M. and McCabe, G. and Frimel, J. and Blank, L. and Arbetter, T.},
+    date = {2025},
+    title = {The METplus Version 6.1.0 User’s Guide},
+    publisher = {Developmental Testbed Center},
+    url = {https://github.com/dtcenter/METplus/releases.},
+    language = {en}
+}
+
+@software{scitools_iris,
+    author = {{Iris contributors}},
+    doi = {10.5281/zenodo.595182},
+    license = {BSD-3-Clause},
+    title = {{Iris}},
+    url = {https://github.com/SciTools/iris}
+}
+
+@article{lfric,
+    title = {LFRic: Meeting the challenges of scalability and performance portability in Weather and Climate models},
+    journal = {Journal of Parallel and Distributed Computing},
+    volume = {132},
+    pages = {383-396},
+    year = {2019},
+    issn = {0743-7315},
+    doi = {https://doi.org/10.1016/j.jpdc.2019.02.007},
+    url = {https://www.sciencedirect.com/science/article/pii/S0743731518305306},
+    author = {S.V. Adams and R.W. Ford and M. Hambley and J.M. Hobson and I. Kavčič and C.M. Maynard and T. Melvin and E.H. Müller and S. Mullerworth and A.R. Porter and M. Rezny and B.J. Shipway and R. Wong},
+    keywords = {Separation of concerns, Domain specific language, Exascale, Numerical weather prediction},
+    abstract = {This paper describes LFRic: the new weather and climate modelling system being developed by the UK Met Office to replace the existing Unified Model in preparation for exascale computing in the 2020s. LFRic uses the GungHo dynamical core and runs on a semi-structured cubed-sphere mesh. The design of the supporting infrastructure follows object-oriented principles to facilitate modularity and the use of external libraries where possible. In particular, a ‘separation of concerns’ between the science code and parallel code is imposed to promote performance portability. An application called PSyclone, developed at the STFC Hartree centre, can generate the parallel code enabling deployment of a single source science code onto different machine architectures. This paper provides an overview of the scientific requirement, the design of the software infrastructure, and examples of PSyclone usage. Preliminary performance results show strong scaling and an indication that hybrid MPI/OpenMP performs better than pure MPI.}
+}
diff --git a/docs/papers/paper.md b/docs/papers/paper.md
@@ -1,102 +1,135 @@
 ---
 title: "CSET: Toolkit for evaluation of weather and climate models"
 tags:
-  - Python
-  - Cylc
-  - Weather
-  - Climate
-  - Atmospheric Science
+ - Python
+ - Cylc
+ - Weather
+ - Climate
+ - Atmospheric Science
 authors:
-  - name: James Frost
-    orcid: 0009-0009-8043-3802
-    affiliation: 1
-  - name: James Warner
-    orcid:
-    affiliation: 1
-  - name: Sylvia Bohnenstengel
-    orcid:
-    affiliation: 1
-  - name: David Flack
-    orcid:
-    affiliation: 1
-  - name: Huw Lewis
-    orcid:
-    affiliation: 1
-  - name: Dasha Shchepanovska
-    orcid:
-    affiliation: 1
-  - name: Jon Shonk
-    orcid:
-    affiliation: 1
-  - name: Bernard Claxton
-    orcid:
-    affiliation: 1
-  - name: Jorge Bornemann
-    orcid:
-    affiliation: 2
-  - name: Carol Halliwell
-    orcid:
-    affiliation: 1
-  - name: Magdalena Gruziel
-    orcid:
-    affiliation: 3
-  - name: Pluto ???
-    orcid:
-    affiliation: 4
-  - name: John M Edwards
-    orcid:
-    affiliation: 1
+ - name: James Frost
+ orcid: 0009-0009-8043-3802
+ affiliation: 1
+ - name: James Warner
+ orcid:
+ affiliation: 1
+ - name: Sylvia Bohnenstengel
+ orcid:
+ affiliation: 1
+ - name: David Flack
+ orcid:
+ affiliation: 1
+ - name: Huw Lewis
+ orcid:
+ affiliation: 1
+ - name: Dasha Shchepanovska
+ orcid:
+ affiliation: 1
+ - name: Jon Shonk
+ orcid:
+ affiliation: 1
+ - name: Bernard Claxton
+ orcid:
+ affiliation: 1
+ - name: Jorge Bornemann
+ orcid:
+ affiliation: 2
+ - name: Carol Halliwell
+ orcid:
+ affiliation: 1
+ - name: Magdalena Gruziel
+ orcid:
+ affiliation: 3
+ - name: Pluto ???
+ orcid:
+ affiliation: 4
+ - name: John M Edwards
+ orcid:
+ affiliation: 1
 affiliations:
-  - name: Met Office, United Kingdom
-    index: 1
-    ror: 01ch2yn61
-  - name: NIWA, New Zealand
-    index: 2
-    ror: 01ch2yn61
-  - name: Interdisciplinary Centre for Mathematical and Computational Modelling, Poland
-    index: 3
-  - name: Centre for Climate Research Singapore, Meteorological Service Singapore, Singapore
-    index: 4
-    ror: 025sv2d63
+ - name: Met Office, United Kingdom
+ index: 1
+ ror: 01ch2yn61
+ - name: NIWA, New Zealand
+ index: 2
+ ror: 01ch2yn61
+ - name: Interdisciplinary Centre for Mathematical and Computational Modelling, Poland
+ index: 3
+ - name: Centre for Climate Research Singapore, Meteorological Service Singapore, Singapore
+ index: 4
+ ror: 025sv2d63
 date: 17 September 2025
 bibliography: paper.bib
 ---
 
 # CSET: Toolkit for evaluation of weather and climate models.
 
-<!-- TODO: Recopy paragraphs from Word doc, as it is still being updated. -->
-
 ## Summary
 
 <!-- A summary describing the high-level functionality and purpose of the software for a diverse, non-specialist audience. -->
 
-The Convective- [and turbulence-] Scale Evaluation Toolkit (**CSET**) is an open source library, command line tool, and workflow for evaluation of weather and climate models. It can analyse model and observational data and visualises the output in a website to allow the development of a coherent evaluation story for numerical weather prediction, climate, and machine learning models across time and spatial scales.
+The _Convective- [and turbulence-] Scale Evaluation Toolkit_ (**CSET**) is a community-driven open source library, command line tool, and workflow designed to support the evaluation of weather and climate models at convective and turbulent scales.
+Developed by the Met Office in collaboration with the [Momentum® Partnership][momentum_partnership] and broader research community, CSET provides a reproducible, modular, and extensible framework for model diagnostics and verification.
+It analyses numerical weather prediction (NWP) and climate modelling output, including from the next-generation LFRic model [@lfric], ML models, and observational data and visualises the output in an easily sharable static website to allow the development of a coherent evaluation story for weather and climate models across time and spatial scales.
 
 ## Statement of need
 
 <!-- A Statement of need section that clearly illustrates the research purpose of the software and places it in the context of related work. -->
 
-Evaluating weather and climate models is essential for the model development process and has applications in various research domains. Typically, an evaluation includes both context and justification to demonstrate the benefit of model changes compared to other models or previous model versions. The verification provides the context or baseline for understanding the model’s performance through comparison against observation. The evaluation then demonstrates the benefit through comparison against theoretical expectations or previous or different version of the model and other models for similar application areas using diagnostics derived from model output to explain the context.
-
-Historically, evaluation has typically been done with bespoke scripts. These scripts are rarely portable, and the results of evaluation at different institutions are therefore difficult to compare. The writing of these scripts for each evaluation takes significant effort, and they are often poorly maintained, with little in the way of testing or documentation.
+Evaluation is essential for the model development process in atmospheric sciences.
+Typically, an evaluation includes both context and justification to demonstrate the benefit of model changes against other models or previous model versions.
+The verification provides the context or baseline for understanding the model’s performance through comparison against observation.
+The evaluation then demonstrates the benefit through comparison against theoretical expectations or previous or different version of the model and other models for similar application areas using diagnostics derived from model output to explain the context.
 
 ## Contribution to the field
 
-The toolkit aims to cater for the full evaluation process, providing a range of verification diagnostics and diagnostics derived from model output that allow for both process-based and impact-based understanding. The verification side of CSET utilises the Model Evaluation Tools (METplus) verification system [@metplus] to provide a range of verification metrics that are aligned with operational verification best practices. The justification side of CSET consists of a range of diagnostics derived from model output. The diagnostics include process-based diagnostics for specific phenomena. Impact-based diagnostics that can be used to provide meaning to changes for customers are also included.
+CSET addresses the need for an evaluation system that supports consistent and comparable evaluation.
+It gives users easy access to a wide selection of peer-reviewed diagnostics, including spatial plots, time series, vertical profiles, probability density functions, and aggregated analysis over multiple model simulations, replacing bespoke evaluation scripts.
+To cater for the full evaluation process, CSET provides a range of verification diagnostics to compare against observations and derived diagnostics based on model output, allowing for both physical process-based and impact-based understanding.
+
+<!-- TODO: Should we include a figure of the CSET web UI? -->
+
+<!-- TODO: Should METplus be mentioned given it isn't integrated yet? -->
+The verification side of CSET utilises the Model Evaluation Tools (METplus) verification system [@metplus] to provide a range of verification metrics that are aligned with operational verification best practices.
+The justification side of CSET consists of a range of diagnostics derived from model output.
+These derived diagnostics include process-based diagnostics for specific atmospheric phenomena and impact-based diagnostics that can be used to understand how model changes will affect customers.
+
+## Design
+
+CSET is build using operators, recipes and a workflow:
 
-The diagnostics within CSET are well-documented, tested, and peer reviewed, allowing confidence for users and increased discoverability. Furthermore, CSET provides a legacy for diagnostics via a clear maintenance infrastructure. The documentation covers diagnostic applicability allowing for confidence in their use. By building around composable operators CSET’s evaluation code can be adapted to user needs while maintaining traceability, putting customers at the heart of evaluation.
+* **Operators** are small python functions performing a single task, such as reading, writing, filtering, executing a calculation, stratifying, or plotting.
+* **Recipes** are YAML files that compose operators together to produce diagnostics, such as a wind speed difference plot between two model configurations.
+* The **Workflow** runs the recipes across a larger number of models, variables, model domains and dates, collating the result into a website.
 
-Technically, CSET has been built with portability in mind. It can run on a range of platforms, from laptops to supercomputers, and can be easily installed from conda-forge. It is built on a modern software stack that is underpinned by Cylc (a workflow engine for complex computational tasks) [@cylc8], Python 3, Iris (a Python library for meteorological data analysis) [@iris], and METplus (a verification system for weather and climate models). The toolkit is open source and actively developed in the open on GitHub, with extensive automatic unit and integration testing. It aims to be a community-based toolkit, thus contributing to CSET is made easy and actively encouraged with clear developer guidelines to help.
+The design provides a flexible software that is easily adaptable by scientists to address model evaluation questions while maintaining traceability.
+
+![Graph view of a wind speed difference recipe, as produced by `cset graph`. Each node represents an operator, with the arrows showing the flow of data.](wind_speed_difference_graph.svg)
+
+The recipes and operators within CSET are well-documented, tested, and peer reviewed, increasing discoverability and giving confidence to users.
+The documentation covers information on the applicability and interpretation of diagnostics, ensuring they are appropriately used.
+
+CSET has been built with portability in mind.
+It can run on a range of platforms, from laptops to supercomputers, and can be easily installed from conda-forge.
+It is built on a modern software stack that is underpinned by Cylc (a workflow engine for complex computational tasks) [@cylc8], Python 3, and Iris (a Python library for meteorological data analysis) [@scitools_iris].
+CSET is open source under the Apache-2.0 licence, and actively developed on GitHub, with extensive automatic unit and integration testing.
+It aims to be a community-based toolkit, thus contributing to CSET is made easy and actively encouraged with clear developer guidelines to help.
 
 ## Research usage
 
 <!-- Mention (if applicable) a representative set of past or ongoing research projects using the software and recent scholarly publications enabled by it. -->
 
-In the Met Office and across the Momentum® Partnership (a cooperative partnership of institutions sharing a seamless modelling framework for weather and climate science and services) [@momentum_partnership], CSET has been the tool of choice for understanding the regional configuration of the next-generation numerical weather prediction and climate model LFRic. [@lfric] It has helped us to characterise the regional configuration and lead to improvements in our model.
+Recently, CSET has been the tool of choice in the development and evaluation of the Regional Atmosphere Land Configuration RAL3-LFRic in the Met Office and across the Momentum® Partnership (a cooperative partnership of institutions sharing a seamless modelling framework for weather and climate science and services), as part of the Met Office’s Next Generation Modelling System (NGMS) programme to transition from the Unified Model to LFRic.
+It has helped us to characterise the regional configuration and lead to improvements in our model.
 
-## Related software packages
+## Conclusion
 
-<!-- TODO: Discuss alternatives, such as ESMValTool. -->
+
+CSET shows the benefits of open source evaluation software.
+It reduces redundant evaluation diagnostics development and supports easier collaboration across organisations involved in atmospheric model evaluation, helping to build a clear and consistent understanding of model characteristics and model improvement benefits.
+Major items on CSET's development roadmap are integrating METplus verification into the workflow, and increasing the number of supported observation sources.
+
+The CSET documentation is hosted at https://metoffice.github.io/CSET
 
 ## Acknowledgements
 
@@ -110,6 +143,7 @@ We acknowledge contributions and support from the Met Office and Momentum® Part
 
 * @metplus
 * @cylc8
-* @iris
-* @momentum_partnership
+* @scitools_iris
 * @lfric
+
+[momentum_partnership]: https://www.metoffice.gov.uk/research/approach/collaboration/momentum-partnership
diff --git a/docs/papers/wind_speed_difference_graph.svg b/docs/papers/wind_speed_difference_graph.svg