Knowledge informed hybrid machine learning in agricultural yield prediction

Malte von Bloh, David Lobell, Senthold Asseng

Research output: Contribution to journalArticlepeer-review

Abstract

Research on yield predictions is dominated by two approaches: machine learning and process-based models. Machine learning has shown impressive results in capturing complex relationships but is often limited by data availability in agriculture. Conversely, process-based models, with over 60 years of research history, simulate crop growth processes using biophysical equations. Here, we present a method to transfer domain knowledge from the Decision Support System for Agrotechnology Transfer framework (DSSAT) using the Nwheat crop simulation process-model into neural networks and random forest for predicting wheat yield at field scale. Expanding the feature and distribution space involved simulating crop parameters and synthetic samples through the utilization of observed and historical weather recordings, as well as future climate projections. We demonstrated that neural networks can learn both general crop growth and yield processes and then effectively adapt to regional, field-specific growth patterns using synthetic and high-resolution field data. This approach boosts overall performance and reduces model error by 8 % compared to a purely data-centric model without process-knowledge transfer and solely trained on observed field data and features. Synthetic samples generated from warmer conditions were the greatest driver for improvements and we showed that the climate scenario for data generation is more important than the actual synthetic data set size. The proposed method shows the potential of combining process-based and machine-learning models, highlighting the potential to leverage the strengths of both methods in a collaborative manner.

Original languageEnglish
Article number109606
JournalComputers and Electronics in Agriculture
Volume227
DOIs
StatePublished - Dec 2024

Keywords

  • Agriculture
  • Crop yield
  • DSSAT
  • Hybrid machine learning
  • Process-based models
  • Wheat

Fingerprint

Dive into the research topics of 'Knowledge informed hybrid machine learning in agricultural yield prediction'. Together they form a unique fingerprint.

Cite this