Section 1: Introduction

Abstract

Accurate modelling of chemical engineering processes is essential for reliable prediction and decision-making, yet traditional analytical models are often limited by incomplete process understanding, leading to oversimplified representations of complex dynamics. Machine learning (ML) offers an alternative by learning input-output relationships directly from data, potentially capturing unknown physical phenomena unrepresented by analytical models. However, ML is constrained by limited available data and lack of interpretability. Physics-Informed Machine Learning (PIML) addresses these challenges by embedding a priori physical knowledge, enabling more accurate and interpretable models from limited data. As such, this work presents a comparative study on modelling limited experimental data on dynamic CO₂ chemisorption in a lab-scale packed column through both architectural modifications and incorporation of governing equations. In particular, architectural variants of Recurrent Neural Networks (RNN) and Neural Ordinary Differential Equations (NODE) were explored. The best variant, NODE-ED-Uncoupled, was identified as the best-performing variant through 100 repeated K-fold cross-validation and bootstrap aggregating, achieving a mean R² of 0.689 ± 0.101 and MSE of 0.245 ± 0.036 for K-fold cross-validation. Governing equations was subsequently integrated as equation constraints, learnable kinetic parameters, and collocation-based supervision across the full operating space. Collocation-based supervision yields the best results of R² of 0.773 ± 0.0882 and MSE of 0.239 ± 0.070 for bootstrap aggregating.

1

Overview

Chemical engineering lies at the intersection of chemistry, physics and engineering to design, optimise and scale processes that transform raw materials into valuable products¹. As global demand for these products intensifies, chemical manufacturers are compelled towards more efficient process routes and optimal operations control². As such, accurate process modelling has become essential for reliable prediction and informed decision-making to enhance process performance.

1.1

First Principle Modelling (FPM)

Analytical models also referred to as First Principles Modelling (FPM) are conventionally utilised to help engineers understand and design chemical processes³. For example, Wang et al. (2023)⁴ employed FPMs, including linear spring dashpot formulations alongside mass, momentum, energy, and species balances, to simulate the hydrodynamics of coal-biomass co-gasification. This enabled prediction of system behaviour in response to variations in particle size due to velocity and temperature disturbances. Similarly, Bo et al. (2025)⁵ employed FPMs such as Newton's second law, Navier-Stokes equations, momentum and energy balances, to predict the heat transfer effects arising from wear mechanisms. Additionally, Kashid et al. (2007)⁶ applied Navier-Stokes and convection-diffusion equations to investigate the effects of operating conditions such as viscosity on circulation patterns and mass transfer in a liquid-liquid slug flow microreactor. Collectively, these studies demonstrate moderately strong agreement between FPM predictions and experimental observations.

While these studies demonstrate the effectiveness of FPMs, chemical engineering processes tend to be complex and such models are hence analytically intractable and computationally demanding⁷. Consequently, simplified representations are frequently adopted, leading to discrepancies between model predictions and actual system behaviour⁸. These deviations are further exacerbated by unknown variables, idealised assumptions and imposed boundary conditions. For instance, linking back to the aforementioned studies, Wang et al. (2023)⁴ reported a discrepancy of approximately 15% in outlet gas mole fraction predictions due to simplifications in bed configuration and chemical reaction modelling. Kashid et al. (2007)⁶ observed deviations of about 30% in titration time predictions at low velocities, largely attributed to the assumption of a flat interface affecting the velocity profile. Chhabra et al. (2001)⁹ reported average errors of 20-25% in predicting minimum fluidisation velocity, with deviations reaching up to 60% in the creeping flow regime in their review on non-Newtonian fluid flow. As such, these papers highlight the inherent limitations of FPM in accurately capturing certain process behaviours.

1.2

Machine Learning (ML)

To address these challenges, Machine Learning (ML) has emerged as a promising alternative. It serves as a surrogate model capable of capturing complex, non-linear relationships that are difficult to describe analytically¹⁰. By learning directly from data, ML models can implicitly account for unknown or unmodelled physics¹¹. For example, Serrano et al. (2020)¹² applied Artificial Neural Networks (ANN) with Levenberg-Marquardt and Bayesian Regularisation algorithms to predict the gas composition and yield of biomass gasification when biomass properties and operating conditions varied. Similarly, Cu et al. (2025)¹³ explored ANN coupled with Particle Swarm Optimisation (ANN-PSO) to predict variations in heat transfer coefficients under changing operating conditions. Additionally, Dahlan et al. (2025)¹⁴ employed ANN in conjunction with response surface methodology and various training algorithms to model and optimise CO₂ removal processes.

Despite these advantages, ML has not gained much traction in industrial chemical engineering applications. First, ML methods are inherently data-intensive⁸, yet obtaining high-quality experimental data in chemical engineering is often costly and time-consuming. Second, ML models are frequently criticised for their lack of interpretability, whereby they function as black boxes³, which reduces trust and hinder their direct application in process design. Third, the absence of explicit physical constraints may result in predictions that are not physically realisable¹⁵. For instance, linking back to the aforementioned studies, Serrano et al. (2020)¹² reported poor predictions for H₂ gas composition, with deviations of up to 90% from experimental data, likely due to the model's inability to distinguish measurement noise from underlying physical behaviour. Dahlan et al. (2025)¹⁴ reported that ANN was unable to quantify the individual influence of input variables, highlighting the limited interpretability of purely data-driven approaches.

1.3

Physics-Informed Machine Learning (PIML)

With these identified strengths and weaknesses of both existing models, this gives rise to a new advancement, Physics Informed Machine Learning (PIML), which seeks to constrain the ML model to long-established physical laws. By unifying physics-based and data-driven approaches, PIML can potentially deliver accurate predictions that remain physically consistent while generalising beyond conventional assumptions and boundary conditions. Moreover, it is reported that PIML can reduce the amount of data required for training as demonstrated by Veliogulu et al. (2025)¹⁶ in their studies on a Van de Vusse continuous stirred tank reactor and a liquid-liquid extractor.

Despite their promising capabilities, PIML remains a relatively nascent field in chemical engineering. Recent studies have demonstrated its applicability across a few examples. For instance, the previously mentioned study by Veliogulu et al. (2025)¹⁶ investigated the generalisation, state estimation and extrapolation capabilities of stirred tank reactors and liquid-liquid extractors. Jalili et al. (2024)¹⁷ applied PIML to model the hydrodynamics and heat transfer of varying flow configurations on two-phase flow. Carranza-Abaid and P.Jakobsen (2022)¹⁸ incorporated FPM into structured ANN models to predict distillate and bottoms purity in flash and distillation columns. A couple of papers studied PIML application on reaction kinetics and polymer systems. Alizadeh et al. (2025)¹⁹ analysed kinetic modelling in heavy oil hydrocracking, Wu and Li (2024)²⁰ studied a plate reactor system with a heating cylinder, while Ma et al. (2025)²¹ evaluated for food kinetics purposes. Additionally, Ghaderi et al. (2020)²² applied a feedforward Physics-Informed Neural Network to predict inelasticity in cross-linked polymers.

While these studies demonstrate the promise of PIML in chemical engineering applications, several limitations remain that constrain their broader adoption in this field. First, approximately 70% of reported results have been trained and validated primarily on simulated data. This suggests that the learning process only reflects known physical relationships rather than genuinely uncovering unknown physical phenomena. As such, the evaluations of current PIML may have inflated the perceived effectiveness of PIML relative to their performance on real systems. Second, even among studies that utilise experimental data, they provide limited comparative analysis, if any, between the PIML models explored. These papers often focus solely on the potential of PIML to fit the data rather than a rigorous evaluation into the reproducibility of their results via statistical significance testing. Third, the majority of existing studies focus on simple case studies that predominantly involve a single governing physical principle. For instance, many of the aforementioned works primarily examine the application of PIML to reaction kinetic modelling. In contrast, systems characterised by strong interdependencies between multiple coupled physical phenomena remain significantly underexplored, despite being representative of most real-world chemical engineering processes²³. This is likely attributed to the limited physical understanding of the underlying system. Therefore, further research of PIML capabilities in this area is warranted, especially where conventional FPM approaches are inadequate.

1.4

Objective & Scope

Accordingly, the scope of this study is guided by the central research question:

“To what extent can Physics-Informed Machine Learning (PIML) models trained on limited experimental data, with varying degrees of physics incorporation, inherently improve the accuracy and precision of modelling chemical engineering systems with limited physical knowledge?”

To address this research question, the study is organised into four sections as shown in Figure 1.1.

Study Scope Diagram — Figure 1.1 Study Scope

To answer the research question, a rigorous selection of an appropriate case study is undertaken to anchor the discussion (Section 2). This is followed by the design of a data acquisition methodology to ensure that the collected dataset captures key manipulated and response variables that reflect realistic process variability (Section 3). An architectural evaluation is subsequently conducted on two key model classes: Recurrent Neural Networks (RNN) and Neural Ordinary Differential Equations (NODE). Variants of these architectures are explored and compared to identify the most suitable model architecture for subsequent development (Section 4). Finally, the selected architecture is extended to incorporate limited governing physical equations currently known, with varying levels of enforcement (Section 5).

References & Notes

1 Wu, Z.; Wang, H.; He, C.; Zhang, B.; Xu, T.; Chen, Q. The Application of Physics-Informed Machine Learning in Multiphysics Modeling in Chemical Engineering. Industrial & Engineering Chemistry Research 2023, 62 (44), 18178-18204. https://doi.org/10.1021/acs.iecr.3c02383. ↩
2 Mitsos, A.; Asprion, N.; Floudas, C. A.; Bortz, M.; Baldea, M.; Bonvin, D.; Caspari, A.; Pascal Schäfer. Challenges in Process Optimization for New Feedstocks and Energy Sources. Computers & Chemical Engineering 2018, 113, 209-221. https://doi.org/10.1016/j.compchemeng.2018.03.013. ↩
3 Dobbelaere, M. R.; Plehiers, P. P.; Van de Vijver, R.; Stevens, C. V.; Van Geem, K. M. Machine Learning in Chemical Engineering: Strengths, Weaknesses, Opportunities, and Threats. Engineering 2021, 7 (9). https://doi.org/10.1016/j.eng.2021.03.019. ↩
4 Du, S.; Wang, J.; Yu, Y.; Zhou, Q. Sign in. Sciencedirect.com. https://www.sciencedirect.com/science/article/abs/pii/S096014812201713X (accessed 2026-04-05). ↩
5 Bo, H.; Fu, Y.; Shao, Y.; Zhong, W. Effects of Particle Size Reduction due to Wear on Heat Transfer in a Fluidized Bed: A CFD-DEM Study. Particuology 2025, 103, 176-192. https://doi.org/10.1016/j.partic.2025.05.017. ↩
6 Kashid, M. N.; Agar, D. W.; Turek, S. CFD Modelling of Mass Transfer with and without Chemical Reaction in the Liquid-Liquid Slug Flow Microreactor. Chemical Engineering Science 2007, 62 (18-20), 5102-5109. https://doi.org/10.1016/j.ces.2007.01.068. ↩
7 Munck, M. J. A. de; Peters, E. A. J. F.; Kuipers, J. A. M. Fluidized bed-gas-solid heat transfer using a CFD-DEM coarse-graining technique. https://www.sciencedirect.com/science/article/pii/S0009250923006048 (accessed 2026-01-11). ↩
8 Schweidtmann, A. M.; Esche, E.; Fischer, A.; Kloft, M.; Repke, J.; Sager, S.; Mitsos, A. Machine Learning in Chemical Engineering: A Perspective. Chemie Ingenieur Technik 2021, 93 (12), 2029-2039. https://doi.org/10.1002/cite.202100083. ↩
9 Chhabra, R. P.; Comiti, J.; Machač, I. Flow of Non-Newtonian Fluids in Fixed and Fluidised Beds. Chemical Engineering Science 2001, 56 (1), 1-27. https://doi.org/10.1016/s0009-2509(00)00207-4. ↩
10 Thebelt, A.; Wiebe, J.; Kronqvist, J.; Tsay, C.; Misener, R. Maximizing Information from Chemical Engineering Data Sets: Applications to Machine Learning. Chemical Engineering Science 2022, 252, 117469. https://doi.org/10.1016/j.ces.2022.117469. ↩
11 Chen, Z.; Liu, Y.; Sun, H. Physics-Informed Learning of Governing Equations from Scarce Data. Nature Communications 2021, 12 (1). https://doi.org/10.1038/s41467-021-26434-1. ↩
12 Serrano, D.; Golpour, I.; Sánchez-Delgado, S. Predicting the Effect of Bed Materials in Bubbling Fluidized Bed Gasification Using Artificial Neural Networks (ANNs) Modeling Approach. Fuel 2020, 266, 117021. https://doi.org/10.1016/j.fuel.2020.117021. ↩
13 Cu, W.; Fang, J.; Guo, X.; Chen, K.; Zheng, N.; Xiao, B.; Wei, J. Prediction and Analysis of Bed-To-Tube Heat Transfer in Fluidized Bed Heat Exchangers Based on ANN-PSO Hybrid Approach and CFD Simulation. International Journal of Heat and Mass Transfer 2025, 252, 127490. https://doi.org/10.1016/j.ijheatmasstransfer.2025.127490. ↩
14 Dahlan, I.; Suhaimi, M. H. M. Adsorption of CO2 Using NaOH-Modified Nanoclay Montmorillonite Adsorbent: Comparative Analysis of RSM-Based Central Composite Design and ANN-Based Models in Modelling and Optimization. Arabian Journal for Science and Engineering 2025, 50 (24), 21011-21027. https://doi.org/10.1007/s13369-025-10401-9. ↩
15 Wang, R.; Yu, R. Physics-Guided Deep Learning for Dynamical Systems: A Survey. arXiv.org. https://arxiv.org/abs/2107.01272. ↩
16 Velioglu, M.; Zhai, S.; Rupprecht, S.; Mitsos, A.; Jupke, A.; Dahmen, M. Physics-Informed Neural Networks for Dynamic Process Operations with Limited Physical Knowledge and Data. Computers & Chemical Engineering 2024, 192, 108899. https://doi.org/10.1016/j.compchemeng.2024.108899. ↩
17 Jalili, D.; Jang, S.; Jadidi, M.; Giustini, G.; Keshmiri, A.; Mahmoudi, Y. Physics-informed neural networks for heat transfer prediction in two-phase flows. https://www.sciencedirect.com/science/article/pii/S0017931023012346 (accessed 2026-01-20). ↩
18 Carranza-Abaid, A.; P.Jakobsen, J. Neural network programming: Integrating first principles into machine learning models. https://www.sciencedirect.com/science/article/pii/S009813542200196X#sec0008 (accessed 2026-02-03). ↩
19 Alizadeh, S.; Ta, S.; Ray, A. K.; Samavedham, L. Physics-Informed Neural Network with NSGA II and Levenberg-Marquardt Method for Kinetic Modeling in Heavy Oil Hydrocracking. Industrial & Engineering Chemistry Research 2025, 64 (40), 19624-19640. https://doi.org/10.1021/acs.iecr.5c02581. ↩
20 Wu, Z.; Li, M.; He, C.; Zhang, B.; Ren, J.; Yu, H.; Chen, Q. Physics-Informed Learning of Chemical Reactor Systems Using Decoupling-Coupling Training Framework. AIChE Journal 2024, 70 (7). https://doi.org/10.1002/aic.18436. ↩
21 Ma, Y.; Turan, D.; Maarten Schutyser. Evaluating Physics-Informed Neural Networks for Food Kinetic Modeling. 2025. https://doi.org/10.26434/chemrxiv-2025-t2r5k. ↩
22 Ghaderi, A.; Morovati, V.; Dargazany, R. A Physics-Informed Assembly of Feed-Forward Neural Network Engines to Predict Inelasticity in Cross-Linked Polymers. Polymers 2020, 12 (11), 2628. https://doi.org/10.3390/polym12112628. ↩
23 Narayanamurthi, M.; Sandu, A. Partitioned Exponential Methods for Coupled Multiphysics Systems. arXiv.org. https://arxiv.org/abs/1908.09434 (accessed 2026-04-05). ↩