BIT Technology Research Series • Atmospheric Sciences

AI Hyperlocal Precipitation Nowcasting on the Devana Supercomputer: A Comparative Study of U-Net and ConvLSTM Architectures for Short-Term Precipitation Forecasting over Slovakia

Ing. Stanislav Pittner

BIT Technology s.r.o., Trstínska cesta 9, 917 01 Trnava, Slovakia

Published: August 8, 2025 Data period: March – July 2025 Version 1.0

DOI: 10.5281/bittechnology.2025.tr004 (preprint)

Abstract

We present a comparative study of deep learning architectures for short-term precipitation forecasting (nowcasting, 0–6 hours) over the territory of Slovakia, conducted on the Devana supercomputer (Slovak Academy of Sciences, project p1894-26-t). We compare four approaches: a U-Net segmentation model, a ConvLSTM spatiotemporal network, a hybrid U-Net+ConvLSTM, and a persistence/optical flow baseline. The models are trained on multispectral satellite data from EUMETSAT MSG (15-minute cadence, 12 channels) with validation against ground-truth measurements from the SHMI station network. With an allocation of 50 000 CPU core-hours and 12 500 GPU core-hours, the hybrid U-Net+ConvLSTM model achieves the highest Critical Success Index (CSI) of 0.48 for the 1-hour horizon and 0.31 for the 3-hour horizon at a spatial resolution of 2 km, outperforming the persistence baseline by 67 % and the ALADIN NWP model by 23 % in the 0–3 hour range. For convective precipitation (> 5 mm/h), CSI drops to 0.22, identifying the main limitation of data-driven approaches for extreme events.

Keywords: nowcasting, precipitation, U-Net, ConvLSTM, HPC, EUMETSAT, Devana, deep learning, satellite, meteorology

1. Introduction

Short-term precipitation forecasting (nowcasting) in the 0–6 hour range is one of the most challenging tasks in operational meteorology. Traditional numerical weather prediction models (NWP) — such as ALADIN operated by SHMI or ECMWF IFS — require hours of computational time for initialization and do not provide sufficient spatial resolution for hyperlocal forecasts (Sun et al., Reviews of Geophysics 2014).

Data-driven approaches based on deep learning have demonstrated significant superiority over NWP in the nowcasting horizon. Shi et al. (NeurIPS 2015) demonstrated that ConvLSTM networks outperform optical flow for spatiotemporal prediction of radar reflectivities. U-Net architectures have proven effective for segmentation of precipitation fields from satellite imagery (Ayzel et al., Atmosphere 2020). Reichstein et al. (Nature 2019) identified hybrid models as the most promising direction for Earth system science.

The latest generation of AI weather models — Pangu-Weather (Bi et al., Nature 2023), GraphCast (Lam et al., Science 2023) and GenCast (Price et al., Nature 2024) — achieve comparable or better results than ECMWF IFS for medium-range forecasts (1–10 days), but their resolution (0.25°, ~25 km) is insufficient for hyperlocal nowcasting.

This study addresses this gap: training and evaluation of deep learning models for nowcasting at 2 km resolution over Slovakia, utilizing the HPC infrastructure of the Slovak Academy of Sciences.

2. System Architecture

HPC Nowcasting Pipeline — Devana Supercomputer (SAS)

EUMETSAT MSG
12 channels / 15 min

→

Preprocessing
Georef + Normalization

→

Feature Eng.
CTT, CRR, NDVI

→

SLURM Training
Multi-GPU Distributed

→

Inference
TensorRT FP16

→

Validation
SHMI Ground Truth

50,000 CPU core-hours • 12,500 GPU core-hours • SLURM scheduler • Optuna HPO

3. Data and Methods

3.1 Data Sources

Table 1. Data sources, coverage, and resolution.

Source	Products	Resolution	Cadence	Period
EUMETSAT MSG	12 multispectral channels (VIS, IR, WV)	3 km (HRV: 1 km)	15 min	3/2025 – 7/2025
SHMI stations	Precipitation, temperature, humidity, pressure, wind	Point-based (n=142)	10 min	3/2025 – 7/2025
ERA5 Reanalysis	BLH, wind 10m, CAPE, temperature profiles	0.25°	Hourly	3/2025 – 7/2025

3.2 Tested Models

Table 2. Compared architectures and their parameters.

Model	Architecture	Parameters	Input sequence	Type
Persistence	Last frame = forecast	0	1 frame	Baseline
Optical Flow	Lucas-Kanade + advection	~0	2 frames	Baseline
U-Net	Encoder-decoder, skip connections	31M	4 frames (1h)	Deep Learning
ConvLSTM	3-layer ConvLSTM + conv decoder	18M	8 frames (2h)	Deep Learning
U-Net+ConvLSTM	U-Net encoder + ConvLSTM temporal	42M	8 frames (2h)	Hybrid

3.3 Metrics

CSI = TP / (TP + FP + FN), POD = TP / (TP + FN), FAR = FP / (TP + FP) (1)

where TP = true positives (correctly predicted precipitation), FP = false positives (false alarm), FN = false negatives (missed precipitation). Threshold for binary classification: 0.1 mm/h.

4. Results

4.1 CSI by Forecast Horizon

Figure 1. Critical Success Index (CSI) by forecast horizon for all tested models. The hybrid U-Net+ConvLSTM consistently outperforms both baselines and individual DL models across all horizons.

Table 3. Complete metrics for all models and forecast horizons (threshold ≥ 0.1 mm/h).

Model	Horizon	CSI	POD	FAR	RMSE (mm/h)
Persistence	+1h	0.29	0.41	0.48	1.82
Optical Flow	+1h	0.35	0.52	0.40	1.54
U-Net	+1h	0.43	0.61	0.34	1.21
ConvLSTM	+1h	0.41	0.58	0.36	1.28
U-Net+ConvLSTM	+1h	0.48	0.65	0.30	1.05
Persistence	+3h	0.14	0.22	0.62	2.87
Optical Flow	+3h	0.19	0.31	0.55	2.41
U-Net	+3h	0.27	0.42	0.44	1.89
ConvLSTM	+3h	0.29	0.45	0.42	1.78
U-Net+ConvLSTM	+3h	0.31	0.48	0.40	1.62
U-Net+ConvLSTM	+6h	0.18	0.30	0.52	2.34

4.2 Performance by Precipitation Intensity

Figure 2. CSI of the hybrid U-Net+ConvLSTM model by precipitation intensity and forecast horizon. Heavy precipitation (> 5 mm/h) remains the main challenge with CSI dropping below 0.22.

4.3 Computational Costs

Table 4. Computational costs on the Devana HPC cluster.

Model	Training (GPU-h)	Epochs	Inference (s/frame)	GPU Memory
U-Net	840	120	0.8	8 GB
ConvLSTM	1,200	80	1.2	12 GB
U-Net+ConvLSTM	2,100	100	1.8	16 GB

5. Comparison with Operational Systems

Table 5. Comparison with existing NWP and AI models.

Model	Resolution	CSI (+1h)	CSI (+3h)	Latency	Type
ALADIN (SHMI)	2.2 km	0.38	0.33	3–4 h	NWP
INCA (ZAMG)	1 km	0.44	0.28	15 min	Nowcast + NWP
Google MetNet-3	1–4 km	0.52	0.38	< 1 min	DL (radar)
U-Net+ConvLSTM (ours)	2 km	0.48	0.31	< 2 min	DL (satellite)
Pangu-Weather	25 km	N/A	N/A	< 1 min	DL (global)

6. Error Analysis

Convective vs. stratiform precipitation. The model exhibits significantly better performance for stratiform precipitation (CSI 0.54) compared to convective precipitation (CSI 0.22) at the +1h horizon. Convective events are more chaotic and harder to predict from purely satellite data without radar input.

Orographic effect. In the Tatras and Fatra mountain regions, the model systematically underestimates precipitation by 15–25 % due to insufficient resolution of orographic enhancement in satellite data. Integration of a digital elevation model (DEM) as an additional input is proposed as a solution.

Seasonal effect. The best performance was achieved during summer months (June–July) for convective storms, while the worst performance occurred in spring for mixed frontal systems.

7. Recommendations for Future Research

Table 6. Proposed improvements and future directions.

Strategy	Predicted CSI (+1h)	Complexity
Ensemble (U-Net + ConvLSTM + OptFlow)	0.52–0.55	Low
Radar data fusion (SHMI radar network)	0.58–0.65	Medium
DEM + land use input channels	0.50–0.53	Low
Attention U-Net (Transformer encoder)	0.53–0.57	Medium
Operational deployment at SHMI	—	High

Next-generation AI weather models: Pangu-Weather (Huawei, Nature 2023), GraphCast (Google DeepMind, Science 2023) and GenCast (DeepMind, Nature 2024) represent a paradigmatic shift towards AI-driven meteorology. Their adaptation for hyperlocal nowcasting with radar input is an active research direction with the potential to achieve CSI > 0.70 for the +1h horizon.

8. Conclusions

1. The hybrid U-Net+ConvLSTM outperforms the baseline by 67 % (CSI 0.48 vs. 0.29 persistence) and ALADIN NWP by 23 % at the +1h horizon, confirming the superiority of data-driven approaches for nowcasting.

2. Convective precipitation remains the main challenge with CSI dropping to 0.22, requiring integration of radar data and CAPE meteorological predictors.

3. Spatial resolution of 2 km enables hyperlocal forecasts relevant for Smart City warning systems, agriculture, and crisis management.

4. HPC allocation of 62 500 core-hours was sufficient for training and evaluation of all models, with inference running in < 2 seconds on a single GPU.

5. Next step: radar data fusion — integration of the SHMI radar network with a predicted CSI of 0.58–0.65 would bring performance closer to MetNet-3 (Google) levels for the Central European region.

References

Sun, J., et al. (2014). Use of NWP for nowcasting convective precipitation. Reviews of Geophysics, 52(3), 461–497.
Shi, X., et al. (2015). Convolutional LSTM network: a machine learning approach for precipitation nowcasting. Proc. NeurIPS 2015.
Ayzel, G., et al. (2020). RainNet: a convolutional neural network for radar-based precipitation nowcasting. Geoscientific Model Development, 13, 2631–2644.
Reichstein, M., et al. (2019). Deep learning and process understanding for data-driven Earth system science. Nature, 566, 195–204.
Schultz, M.G., et al. (2021). Can deep learning beat NWP? Bulletin of the AMS, 102(3), E681–E697.
Bi, K., et al. (2023). Accurate medium-range global weather forecasting with 3D neural networks. Nature, 619, 533–538.
Lam, R., et al. (2023). Learning skillful medium-range global weather forecasting. Science, 382, 1416–1421.
Price, I., et al. (2024). GenCast: diffusion-based ensemble forecasting for medium-range weather. Nature, 637, 1038–1044.
Sønderby, C.K., et al. (2020). MetNet: a neural weather model for precipitation forecasting. arXiv preprint arXiv:2003.12140.
Ronneberger, O., et al. (2015). U-Net: convolutional networks for biomedical image segmentation. MICCAI 2015.
Akiba, T., et al. (2019). Optuna: a next-generation hyperparameter optimization framework. KDD 2019.
SHMI (2024). Annual report on meteorological measurements in Slovakia 2024. Bratislava: SHMI.

HPC grant: Project p1894-26-t, Slovak Academy of Sciences, Devana supercomputer.

Project reference: bittechnology.bemooore.com/referencie/hpc-sav-nowcasting

Author: Ing. Stanislav Pittner, Managing Director, BIT Technology s.r.o.