We present a comparative study of deep learning architectures for short-term precipitation forecasting (nowcasting, 0–6 hours) over the territory of Slovakia, conducted on the Devana supercomputer (Slovak Academy of Sciences, project p1894-26-t). We compare four approaches: a U-Net segmentation model, a ConvLSTM spatiotemporal network, a hybrid U-Net+ConvLSTM, and a persistence/optical flow baseline. The models are trained on multispectral satellite data from EUMETSAT MSG (15-minute cadence, 12 channels) with validation against ground-truth measurements from the SHMI station network. With an allocation of 50 000 CPU core-hours and 12 500 GPU core-hours, the hybrid U-Net+ConvLSTM model achieves the highest Critical Success Index (CSI) of 0.48 for the 1-hour horizon and 0.31 for the 3-hour horizon at a spatial resolution of 2 km, outperforming the persistence baseline by 67 % and the ALADIN NWP model by 23 % in the 0–3 hour range. For convective precipitation (> 5 mm/h), CSI drops to 0.22, identifying the main limitation of data-driven approaches for extreme events.
Short-term precipitation forecasting (nowcasting) in the 0–6 hour range is one of the most challenging tasks in operational meteorology. Traditional numerical weather prediction models (NWP) — such as ALADIN operated by SHMI or ECMWF IFS — require hours of computational time for initialization and do not provide sufficient spatial resolution for hyperlocal forecasts (Sun et al., Reviews of Geophysics 2014).
Data-driven approaches based on deep learning have demonstrated significant superiority over NWP in the nowcasting horizon. Shi et al. (NeurIPS 2015) demonstrated that ConvLSTM networks outperform optical flow for spatiotemporal prediction of radar reflectivities. U-Net architectures have proven effective for segmentation of precipitation fields from satellite imagery (Ayzel et al., Atmosphere 2020). Reichstein et al. (Nature 2019) identified hybrid models as the most promising direction for Earth system science.
The latest generation of AI weather models — Pangu-Weather (Bi et al., Nature 2023), GraphCast (Lam et al., Science 2023) and GenCast (Price et al., Nature 2024) — achieve comparable or better results than ECMWF IFS for medium-range forecasts (1–10 days), but their resolution (0.25°, ~25 km) is insufficient for hyperlocal nowcasting.
This study addresses this gap: training and evaluation of deep learning models for nowcasting at 2 km resolution over Slovakia, utilizing the HPC infrastructure of the Slovak Academy of Sciences.
Table 1. Data sources, coverage, and resolution.
| Source | Products | Resolution | Cadence | Period |
|---|---|---|---|---|
| EUMETSAT MSG | 12 multispectral channels (VIS, IR, WV) | 3 km (HRV: 1 km) | 15 min | 3/2025 – 7/2025 |
| SHMI stations | Precipitation, temperature, humidity, pressure, wind | Point-based (n=142) | 10 min | 3/2025 – 7/2025 |
| ERA5 Reanalysis | BLH, wind 10m, CAPE, temperature profiles | 0.25° | Hourly | 3/2025 – 7/2025 |
Table 2. Compared architectures and their parameters.
| Model | Architecture | Parameters | Input sequence | Type |
|---|---|---|---|---|
| Persistence | Last frame = forecast | 0 | 1 frame | Baseline |
| Optical Flow | Lucas-Kanade + advection | ~0 | 2 frames | Baseline |
| U-Net | Encoder-decoder, skip connections | 31M | 4 frames (1h) | Deep Learning |
| ConvLSTM | 3-layer ConvLSTM + conv decoder | 18M | 8 frames (2h) | Deep Learning |
| U-Net+ConvLSTM | U-Net encoder + ConvLSTM temporal | 42M | 8 frames (2h) | Hybrid |
where TP = true positives (correctly predicted precipitation), FP = false positives (false alarm), FN = false negatives (missed precipitation). Threshold for binary classification: 0.1 mm/h.
Table 3. Complete metrics for all models and forecast horizons (threshold ≥ 0.1 mm/h).
| Model | Horizon | CSI | POD | FAR | RMSE (mm/h) |
|---|---|---|---|---|---|
| Persistence | +1h | 0.29 | 0.41 | 0.48 | 1.82 |
| Optical Flow | +1h | 0.35 | 0.52 | 0.40 | 1.54 |
| U-Net | +1h | 0.43 | 0.61 | 0.34 | 1.21 |
| ConvLSTM | +1h | 0.41 | 0.58 | 0.36 | 1.28 |
| U-Net+ConvLSTM | +1h | 0.48 | 0.65 | 0.30 | 1.05 |
| Persistence | +3h | 0.14 | 0.22 | 0.62 | 2.87 |
| Optical Flow | +3h | 0.19 | 0.31 | 0.55 | 2.41 |
| U-Net | +3h | 0.27 | 0.42 | 0.44 | 1.89 |
| ConvLSTM | +3h | 0.29 | 0.45 | 0.42 | 1.78 |
| U-Net+ConvLSTM | +3h | 0.31 | 0.48 | 0.40 | 1.62 |
| U-Net+ConvLSTM | +6h | 0.18 | 0.30 | 0.52 | 2.34 |
Table 4. Computational costs on the Devana HPC cluster.
| Model | Training (GPU-h) | Epochs | Inference (s/frame) | GPU Memory |
|---|---|---|---|---|
| U-Net | 840 | 120 | 0.8 | 8 GB |
| ConvLSTM | 1,200 | 80 | 1.2 | 12 GB |
| U-Net+ConvLSTM | 2,100 | 100 | 1.8 | 16 GB |
Table 5. Comparison with existing NWP and AI models.
| Model | Resolution | CSI (+1h) | CSI (+3h) | Latency | Type |
|---|---|---|---|---|---|
| ALADIN (SHMI) | 2.2 km | 0.38 | 0.33 | 3–4 h | NWP |
| INCA (ZAMG) | 1 km | 0.44 | 0.28 | 15 min | Nowcast + NWP |
| Google MetNet-3 | 1–4 km | 0.52 | 0.38 | < 1 min | DL (radar) |
| U-Net+ConvLSTM (ours) | 2 km | 0.48 | 0.31 | < 2 min | DL (satellite) |
| Pangu-Weather | 25 km | N/A | N/A | < 1 min | DL (global) |
Convective vs. stratiform precipitation. The model exhibits significantly better performance for stratiform precipitation (CSI 0.54) compared to convective precipitation (CSI 0.22) at the +1h horizon. Convective events are more chaotic and harder to predict from purely satellite data without radar input.
Orographic effect. In the Tatras and Fatra mountain regions, the model systematically underestimates precipitation by 15–25 % due to insufficient resolution of orographic enhancement in satellite data. Integration of a digital elevation model (DEM) as an additional input is proposed as a solution.
Seasonal effect. The best performance was achieved during summer months (June–July) for convective storms, while the worst performance occurred in spring for mixed frontal systems.
Table 6. Proposed improvements and future directions.
| Strategy | Predicted CSI (+1h) | Complexity |
|---|---|---|
| Ensemble (U-Net + ConvLSTM + OptFlow) | 0.52–0.55 | Low |
| Radar data fusion (SHMI radar network) | 0.58–0.65 | Medium |
| DEM + land use input channels | 0.50–0.53 | Low |
| Attention U-Net (Transformer encoder) | 0.53–0.57 | Medium |
| Operational deployment at SHMI | — | High |
Next-generation AI weather models: Pangu-Weather (Huawei, Nature 2023), GraphCast (Google DeepMind, Science 2023) and GenCast (DeepMind, Nature 2024) represent a paradigmatic shift towards AI-driven meteorology. Their adaptation for hyperlocal nowcasting with radar input is an active research direction with the potential to achieve CSI > 0.70 for the +1h horizon.
1. The hybrid U-Net+ConvLSTM outperforms the baseline by 67 % (CSI 0.48 vs. 0.29 persistence) and ALADIN NWP by 23 % at the +1h horizon, confirming the superiority of data-driven approaches for nowcasting.
2. Convective precipitation remains the main challenge with CSI dropping to 0.22, requiring integration of radar data and CAPE meteorological predictors.
3. Spatial resolution of 2 km enables hyperlocal forecasts relevant for Smart City warning systems, agriculture, and crisis management.
4. HPC allocation of 62 500 core-hours was sufficient for training and evaluation of all models, with inference running in < 2 seconds on a single GPU.
5. Next step: radar data fusion — integration of the SHMI radar network with a predicted CSI of 0.58–0.65 would bring performance closer to MetNet-3 (Google) levels for the Central European region.
HPC grant: Project p1894-26-t, Slovak Academy of Sciences, Devana supercomputer.
Project reference: bittechnology.bemooore.com/referencie/hpc-sav-nowcasting
Author: Ing. Stanislav Pittner, Managing Director, BIT Technology s.r.o.
© 2025 Ing. Stanislav Pittner — BIT Technology s.r.o.