This review examines recent advances in the application of machine learning to ocean data assimilation, covering contributions published between 2020 and 2025. We identify emerging trends, recurring limitations, and critical open questions, structuring the discussion around four scientific challenges: observation integration, boundary treatment, fine-scale process representation, and physical consistency. While convolutional neural networks remain widely used, particularly in bias correction and super-resolution tasks, recent studies increasingly employ multilayer perceptrons, long short-term memories, transformers and neural operators for error estimation, sequential bias correction, and latent-space assimilation. Despite this architectural diversity, most contributions remain confined to idealized configurations or offline modules, with limited evidence of generalization and integration into operational pipelines. We conclude that hybrid systems combining embedded physical knowledge with systematic validation across different oceanic regimes will be essential to unlock the full potential of machine learning-enhanced ocean data assimilation.

Machine learning in ocean data assimilation: Advances, gaps and the road to operations

Buizza, Roberto;
2026-01-01

Abstract

This review examines recent advances in the application of machine learning to ocean data assimilation, covering contributions published between 2020 and 2025. We identify emerging trends, recurring limitations, and critical open questions, structuring the discussion around four scientific challenges: observation integration, boundary treatment, fine-scale process representation, and physical consistency. While convolutional neural networks remain widely used, particularly in bias correction and super-resolution tasks, recent studies increasingly employ multilayer perceptrons, long short-term memories, transformers and neural operators for error estimation, sequential bias correction, and latent-space assimilation. Despite this architectural diversity, most contributions remain confined to idealized configurations or offline modules, with limited evidence of generalization and integration into operational pipelines. We conclude that hybrid systems combining embedded physical knowledge with systematic validation across different oceanic regimes will be essential to unlock the full potential of machine learning-enhanced ocean data assimilation.
2026
File in questo prodotto:
File Dimensione Formato  
Grande_etal_2026_title_abs.png.pdf

accesso aperto

Tipologia: Documento in Pre-print/Submitted manuscript
Licenza: Dominio pubblico
Dimensione 379.98 kB
Formato Adobe PDF
379.98 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11382/588612
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
social impact