Abstract
Research on yield predictions is dominated by two approaches: machine learning and process-based models. Machine learning has shown impressive results in capturing complex relationships but is often limited by data availability in agriculture. Conversely, process-based models, with over 60 years of research history, simulate crop growth processes using biophysical equations. Here, we present a method to transfer domain knowledge from the Decision Support System for Agrotechnology Transfer framework (DSSAT) using the Nwheat crop simulation process-model into neural networks and random forest for predicting wheat yield at field scale. Expanding the feature and distribution space involved simulating crop parameters and synthetic samples through the utilization of observed and historical weather recordings, as well as future climate projections. We demonstrated that neural networks can learn both general crop growth and yield processes and then effectively adapt to regional, field-specific growth patterns using synthetic and high-resolution field data. This approach boosts overall performance and reduces model error by 8 % compared to a purely data-centric model without process-knowledge transfer and solely trained on observed field data and features. Synthetic samples generated from warmer conditions were the greatest driver for improvements and we showed that the climate scenario for data generation is more important than the actual synthetic data set size. The proposed method shows the potential of combining process-based and machine-learning models, highlighting the potential to leverage the strengths of both methods in a collaborative manner.
Original language | English |
---|---|
Article number | 109606 |
Journal | Computers and Electronics in Agriculture |
Volume | 227 |
DOIs | |
State | Published - Dec 2024 |
Keywords
- Agriculture
- Crop yield
- DSSAT
- Hybrid machine learning
- Process-based models
- Wheat