Using gene expression programming to discover macroscopic governing equations hidden in the data of molecular simulations

Haoyun Xing, Jun Zhang, Wenjun Ma, Dongsheng Wen

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

The unprecedented amount of data and the advancement of machine learning methods are driving the rapid development of data-driven modeling in the community of fluid mechanics. In this work, a data-driven strategy is developed by the combination of the direct simulation Monte Carlo (DSMC) method and the gene expression programming (GEP) method. DSMC is a molecular simulation method without any assumed macroscopic governing equations a priori and is employed to generate data of flow fields, while the enhanced GEP method is leveraged to discover governing equations. We first validate our idea using two benchmarks, such as the Burgers equation and Sine-Gordon equation. Then, we apply the strategy to discover governing equations hidden in the complex fluid dynamics. Our results demonstrate that in the continuum regime, the discovered equations are consistent with the traditional ones with linear constitutive relations, while in the non-continuum regime such as shock wave, the discovered equation comprises of high-order constitutive relations, which are similar to those in the Burnett equation but with modified coefficients. Compared to the Navier-Stokes-Fourier equations and the Burnett equation, the prediction of the viscous stress and heat flux in the shock wave via the presented data-driven model has the best match to the DSMC data. It is promising to extend the proposed data-driven strategy to more complex problems and discover hidden governing equations which may be unknown so far.

Original languageEnglish
Article number057109
JournalPhysics of Fluids
Volume34
Issue number5
DOIs
StatePublished - 1 May 2022
Externally publishedYes

Fingerprint

Dive into the research topics of 'Using gene expression programming to discover macroscopic governing equations hidden in the data of molecular simulations'. Together they form a unique fingerprint.

Cite this