This review examines recent advances in the application of machine learning to ocean data assimilation, covering contributions published between 2020 and 2025. We identify emerging trends, recurring limitations, and critical open questions, structuring the discussion around four scientific challenges: observation integration, boundary treatment, fine-scale process representation, and physical consistency. While convolutional neural networks remain widely used, particularly in bias correction and super-resolution tasks, recent studies increasingly employ multilayer perceptrons, long short-term memories, transformers and neural operators for error estimation, sequential bias correction, and latent-space assimilation. Despite this architectural diversity, most contributions remain confined to idealized configurations or offline modules, with limited evidence of generalization and integration into operational pipelines. We conclude that hybrid systems combining embedded physical knowledge with systematic validation across different oceanic regimes will be essential to unlock the full potential of machine learning-enhanced ocean data assimilation.
Machine learning in ocean data assimilation: Advances, gaps and the road to operations
Buizza, Roberto;
2026-01-01
Abstract
This review examines recent advances in the application of machine learning to ocean data assimilation, covering contributions published between 2020 and 2025. We identify emerging trends, recurring limitations, and critical open questions, structuring the discussion around four scientific challenges: observation integration, boundary treatment, fine-scale process representation, and physical consistency. While convolutional neural networks remain widely used, particularly in bias correction and super-resolution tasks, recent studies increasingly employ multilayer perceptrons, long short-term memories, transformers and neural operators for error estimation, sequential bias correction, and latent-space assimilation. Despite this architectural diversity, most contributions remain confined to idealized configurations or offline modules, with limited evidence of generalization and integration into operational pipelines. We conclude that hybrid systems combining embedded physical knowledge with systematic validation across different oceanic regimes will be essential to unlock the full potential of machine learning-enhanced ocean data assimilation.| File | Dimensione | Formato | |
|---|---|---|---|
|
Grande_etal_2026_title_abs.png.pdf
accesso aperto
Tipologia:
Documento in Pre-print/Submitted manuscript
Licenza:
Dominio pubblico
Dimensione
379.98 kB
Formato
Adobe PDF
|
379.98 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

