The Short-Term Explicit Prediction (STEP) Program is a multi-NCAR Laboratory activity with the overarching goal to improve the short-term (0-36 hours) forecasting of high-impact weather events such as severe thunderstorms (heavy rain, tornados, downburst, flash flood, lightning and hail), winter storms (snow, freezing rain and drizzle), and hurricanes. The STEP program emphasizes several research areas that are crucial for advancing the science and application of the short-term prediction of high-impact weather, through collaborative effort incorporating national and international scientists, engineers, and operational personnel from universities, government institutions and the private sector. Most of the forecasting/nowcasting systems and analysis tools developed under STEP are available to the communities for the support of research and real-time operations.
In FY19, RAL’s STEP effort emphasized development of nowcasting techniques based on advanced data assimilation and machine learning techniques, improving microphysics scheme for more skillful mail prediction, further improvement of WRF-hydro system. All three research areas are crucial components for the integrated Hydromet Prediction System (Figure 1). The overarching objective of these research efforts are to advance the prediction of heavy rainfall, flash floods and streamflow through the integration of state-of-the-art rainfall estimation, precipitation forecasting/nowcasting, and hydrology modeling techniques into one seamless system.
In the past year, efforts to improve data assimilation and nowcasting systems have been focused on evaluation of data from the recent STEP hydromet test bed experiment and from the PECAN field campaign. Some additional work supported the RELAMPAGO field campaign.
RAL has evaluated the large collection of WRF 0-24 hour forecasts of convective precipitation in comparison to the EOL radar QPE product, and the NOAA MRMS precipitation product. In these evaluations, the Method for Object based Diagnostic Evaluation (MODE) and MODE-Time Domain (MODE-TD) tools have been used to characterize more than just the error in precipitation fields. These tools have been used to identify the spatial structure of the storms and evaluate weaknesses in the prediction of storm size, magnitude, location, and translation. In addition, raw grid-point statistical characteristics (e.g. mean square error and correlation) have been computed over the larger great plains domain. In this region, we identified regions with weaker correlation between model and observed precipitation closer to the Rocky Mountain front range, and maximums in predictability (correlations up to 0.9) occurring in western Kansas. Other patterns identified include a known tendency of WRF to over predict the number of light precipitation events, and under predict the number of heavy (>15mm/hr) events. The MODE object based verification analysis showed that errors in position, translation and intensity, had persistence for up to 6 hours (Figure 2). This implies corrections to the position and translation of a convective system can be derived from comparisons to radar observations and applied to future WRF forecast period.
Working with the streamflow forecasting group, several major case studies were identified for further investigation. These cases include two major floods along Cherry Creek near Parker, one of which (June 13, 2016) resulted in streamflow two orders of magnitude greater than the background flow (Figure 3). Comparisons to streamflow are challenging because river systems are commonly modified by human infrastructure, and finding a basin with minimal human modifications, an operational stream, and major flooding events in the STEP hydromet test period was difficult. The WRF forecasts for this period were shown to predict the evolution of a thunderstorm in the region, but failed to locate it with the correct intensity over the water shed. The WRF-Hydro model was shown to be able to reproduce a major flood when reliable precipitation data were available, although some errors in timing were consistent regardless of the precipitation event, and work is ongoing to examine the tradeoff between hydrologic model parameters, precipitation errors to identify an ensemble of streamflow forecasts that can be used to make a more statistically reliable flood forecast.
In addition, RAL has completed the analysis of the refractivity fields derived from radar observations for the duration of the PECAN experiment and drafted a paper for publications. This work has been presented, at the AMS radar conference in Japan in September 2019, and the paper will be an important contribution to the literature.
In the past year efforts on data assimilation focused on the assimilation of new observations and development of a hybrid system that merges WRFDA with DART based on an ensemble variational data assimilation framework (EnVAR). A prototype of the hybrid system is now ready for further testing. Recently we have started to apply the system to convective-scale radar data assimilation.
Studies on the assimilation of new observations include development of a technique for lightning data assimilation and evaluate the impact of rainfall (QPE) on convective-scale data assimilation and forecasting. Both types of new observations were simultaneously assimilated with radar observations and their added benefits were evaluated.
The lightning data assimilation method attempts to obtain updraft information from the observed lightning flash rate. Since the lightning flash rate is only correlated with the maximum updraft in an air column, a vertical velocity profile is needed to map the updraft in the whole column. We tested a few approaches to obtain the profile and evaluated their impact on the lightning data assimilation. Another challenge in lightning data assimilation is to understand the representativeness of the lightning observations. Since the lightning flash is an instantaneous and point quantity but NWP models have much coarser resolution spatially and temporally, it is important to understand the representative error of the model. We used a method to match the scale of the lightning data to that of the NWP model such that the lightning data can be effectively assimilated into the model. Our results show that the lightning data assimilation in combination with radar data significantly improves the convective-scale analysis and very-short-term forecasting. The new lightning data assimilation method have been implemented to both WRFDA 3DVar and the 4DVar-based nowcasting system FINECAST. Figure 4 shows the positive impact of the lightning data assimilation from FINECAST by comparing the combined radar and lightning data assimilation experiment (RAD+LTN) with the radar alone experiment (RAD). The RAD+LTN experiment successfully predicts the newly initiated storm cell in front of a decaying convective system. The 2 hour forecast skill is significantly increased.
The assimilation of QPE data were conducted using WRFDA 4DVar. A case of MCS embedded within a Meiyu front that affected Taiwan with heavy rain and severe flood was used for the study. Our study shows that while radar observations are crucial to analyze the air motion and its associated convergence through data assimilation, it has not positive effect (sometimes negative effect) on the humidity field. We further demonstrated that the assimilation of QPE data can greatly improve the low- to mid-level humidity analysis, hence result in more skillful short-term heavy rain forecasts. Figure 5 shows that the addition of the rainfall data assimilation to radar improves the pattern, location, and intensity of the rainband over the Taiwan island.
Figure 6 compares the humidity analysis fields between the radar alone experiment and the radar plus rainfall experiment. The impact of the added rainfall assimilation on the humidity analysis in the rainband region is significant, which is the main reason for the improved heavy rain forecast.
In the past year, we also further improved the performance of FINECAST including the improvement of surface data assimilation and terrain scheme, and adding a scheme to blend the advection only forecast and the full-model forecast to address the issue of model spin up.
In FY17-FY18, a multi-moment graupel/hail category was added in the Thompson microphysics scheme. For this multi-moment graupel/hail category, number concentration and bulk volume mixing ratio are now predicted variables, in addition to the bulk mass mixing ratio. This allows for the graupel density to vary in space and time and be diagnosed from the new predicted variables. Based upon initial testing of the new scheme, the code was refined and prepared for use in a real-time forecast model that was run during the RELAMPAGO field campaign in FY19. In addition, hail pads were deployed during RELAMPAGO in order to obtain ground truth hail size measurements.
The original Thompson single-moment graupel/hail category only predicted graupel/hail mass mixing ratio, however it diagnosed the y-intercept of the size distribution allowing it to vary and not be prescribed as a constant value. A recent study was published by Field et al. (2019) that showed observed relationships between the slope and y-intercept parameters of hail size distributions based upon in situ hail measurements from 18 flights of the T-28 armored aircraft between 1995-2003. This study also revealed that the original Thompson single-moment graupel/hail diagnostic y-intercept values were far too large compared to observed values. Therefore, using these data as guidance, new diagnostic relationships were developed and tested. This work in ongoing and in parallel with the multi-moment graupel/hail scheme testing and evaluation, by using the same case studies and observations to compare with both simulations (Figs. 7-8).
In order to evaluate impacts on storm structure, QPF, and, in particular, storm evolution due to physics parameterizations, including updates made to the Thompson microphysics scheme, an object-based evaluation tool that tracks storms over time is needed. The Method for Object-based Diagnostic Evaluation (MODE)-Time Domain (TD) is a tool developed at NCAR for such analyses, allowing users to set several parameters to evaluate storms with certain spatial scales and intensities. Moreover, there are several parameters that users can set that impact how observed and modeled objects are matched for comparative evaluation metrics. In FY18-19, sensitivity tests were performed that varied the matching parameters to quantify the impact of these choices on the evaluation outcomes. This effort has established methods for how to apply MODE-TD to quantify impacts of various physics choices, with specific emphasis on the new multi-moment graupel/hail category, and in order to determine baseline performance metrics.