TY - JOUR A1 - Grundner, Arthur A1 - Beucler, Tom A1 - Gentine, Pierre A1 - Iglesias‐Suarez, Fernando A1 - Giorgetta, Marco A. A1 - Eyring, Veronika T1 - Deep Learning Based Cloud Cover Parameterization for ICON Y1 - 2022-12-14 VL - 14 IS - 12 JF - Journal of Advances in Modeling Earth Systems DO - 10.1029/2021MS002959 PB - N2 - A promising approach to improve cloud parameterizations within climate models and thus climate projections is to use deep learning in combination with training data from storm‐resolving model (SRM) simulations. The ICOsahedral Non‐hydrostatic (ICON) modeling framework permits simulations ranging from numerical weather prediction to climate projections, making it an ideal target to develop neural network (NN) based parameterizations for sub‐grid scale processes. Within the ICON framework, we train NN based cloud cover parameterizations with coarse‐grained data based on realistic regional and global ICON SRM simulations. We set up three different types of NNs that differ in the degree of vertical locality they assume for diagnosing cloud cover from coarse‐grained atmospheric state variables. The NNs accurately estimate sub‐grid scale cloud cover from coarse‐grained data that has similar geographical characteristics as their training data. Additionally, globally trained NNs can reproduce sub‐grid scale cloud cover of the regional SRM simulation. Using the game‐theory based interpretability library SHapley Additive exPlanations, we identify an overemphasis on specific humidity and cloud ice as the reason why our column‐based NN cannot perfectly generalize from the global to the regional coarse‐grained SRM data. The interpretability tool also helps visualize similarities and differences in feature importance between regionally and globally trained column‐based NNs, and reveals a local relationship between their cloud cover predictions and the thermodynamic environment. Our results show the potential of deep learning to derive accurate yet interpretable cloud cover parameterizations from global SRMs, and suggest that neighborhood‐based models may be a good compromise between accuracy and generalizability. N2 - Plain Language Summary: Climate models, such as the ICOsahedral Non‐hydrostatic climate model, operate on low‐resolution grids, making it computationally feasible to use them for climate projections. However, physical processes –especially those associated with clouds– that happen on a sub‐grid scale (inside a grid box) cannot be resolved, yet they are critical for the climate. In this study, we train neural networks that return the cloudy fraction of a grid box knowing only low‐resolution grid‐box averaged variables (such as temperature, pressure, etc.) as the climate model sees them. We find that the neural networks can reproduce the sub‐grid scale cloud fraction on data sets similar to the one they were trained on. The networks trained on global data also prove to be applicable on regional data coming from a model simulation with an entirely different setup. Since neural networks are often described as black boxes that are therefore difficult to trust, we peek inside the black box to reveal what input features the neural networks have learned to focus on and in what respect the networks differ. Overall, the neural networks prove to be accurate methods of reproducing sub‐grid scale cloudiness and could improve climate model projections when implemented in a climate model. N2 - Key Points: Neural networks can accurately learn sub‐grid scale cloud cover from realistic regional and global storm‐resolving simulations. Three neural network types account for different degrees of vertical locality and differentiate between cloud volume and cloud area fraction. Using a game theory based library we find that the neural networks tend to learn local mappings and are able to explain model errors. UR - http://resolver.sub.uni-goettingen.de/purl?gldocs-11858/11260 ER -