Inter-Research > MEPS > v664 > p1-22  
Marine Ecology Progress Series

via Mailchimp

MEPS 664:1-22 (2021)  -  DOI:

Using machine learning to link spatiotemporal information to biological processes in the ocean: a case study for North Sea cod recruitment

Bernhard Kühn*, Marc H. Taylor, Alexander Kempf

Thünen Institute of Sea Fisheries, Herwigstraße 31, 27570 Bremerhaven, Germany
*Corresponding author:

ABSTRACT: Marine organisms are subject to environmental variability on various temporal and spatial scales, which affect processes related to growth and mortality of different life stages. Marine scientists are often faced with the challenge of identifying environmental variables that best explain these processes, which, given the complexity of the interactions, can be like searching for a needle in the proverbial haystack. Even after initial hypothesis-based variable selection, a large number of potential candidate variables can remain if different lagged and seasonal influences are considered. To tackle this problem, we propose a machine learning framework that incorporates important steps in model building, ranging from environmental signal extraction to automated variable selection and model validation. Its modular structure allows for the inclusion of both parametric and machine learning models, like random forest. Unsupervised feature extractions via empirical orthogonal functions (EOFs) or self-organising maps (SOMs) are demonstrated as a way to summarize spatiotemporal fields for inclusion in predictive models. The proposed framework offers a robust way to reduce model complexity through a multi-objective genetic algorithm (NSGA-II) combined with rigorous cross-validation. We applied the framework to recruitment of the North Sea cod stock and investigated the effects of sea surface temperature (SST), salinity and currents on the stock via a modified version of random forest. The best model (5-fold CV r2 = 0.69) incorporated spawning stock biomass and EOF-derived time series of SST and salinity anomalies acting through different seasons, likely relating to differing environmental effects on specific life-history stages during the recruitment year.

KEY WORDS: Machine learning · Multi-objective genetic algorithm · Empirical orthogonal function · EOF · Self-organising map · SOM · Random forest · Extreme randomized trees · Environmental stock-recruitment relationships · North Sea

Full text in pdf format
Information about this Feature Article
Supplementary material 
Cite this article as: Kühn B, Taylor MH, Kempf A (2021) Using machine learning to link spatiotemporal information to biological processes in the ocean: a case study for North Sea cod recruitment. Mar Ecol Prog Ser 664:1-22.

Export citation
RSS - Facebook - - linkedIn