%0 Journal article %A Rasp, Stephan %A Dueben, Peter D. %A Scher, Sebastian %A Weyn, Jonathan A. %A Mouatadid, Soukayna %A Thuerey, Nils %T WeatherBench: A Benchmark Data Set for Data-Driven Weather Forecasting %R 10.1029/2020MS002203 %R 10.23689/fidgeo-4726 %J Journal of Advances in Modeling Earth Systems %V 12 %N 11 %X Data-driven approaches, most prominently deep learning, have become powerful tools for prediction in many domains. A natural question to ask is whether data-driven methods could also be used to predict global weather patterns days in advance. First studies show promise but the lack of a common data set and evaluation metrics make intercomparison between studies difficult. Here we present a benchmark data set for data-driven medium-range weather forecasting (specifically 3–5 days), a topic of high scientific interest for atmospheric and computer scientists alike. We provide data derived from the ERA5 archive that has been processed to facilitate the use in machine learning models. We propose simple and clear evaluation metrics which will enable a direct comparison between different methods. Further, we provide baseline scores from simple linear regression techniques, deep learning models, as well as purely physical forecasting models. The data set is publicly available at https://github.com/pangeo-data/WeatherBench and the companion code is reproducible with tutorials for getting started. We hope that this data set will accelerate research in data-driven weather forecasting. %U http://resolver.sub.uni-goettingen.de/purl?gldocs-11858/9072 %~ FID GEO-LEO e-docs