Overview

In this section, NODE-ED-Uncoupled, identified in Section 4 is further refined through the incorporation of physics-guided enhancements for CO2 prediction, as outlined in Table 5.1. The training procedure follows the methodology described in Section 4.5.

Table 5.1 Physics-guided Enhancements Evaluated
NODE-ED-Uncoupled
Steady State Constraint with Parameter Discovery (SS-PD)
Steady State Collocation with Parameter Discovery (SS-C-PD)

Steady State Constraint with Parameter Discovery (SS-PD)

To address the first hypothesis that NODE-ED-Uncoupled converges to an incorrect final state, SS-PD approach is employed. At final state, the system can be expressed using 3 algebraic equations (3 different zones mentioned in Section 2): h1 (SRZ), h2 (IRZ) , h3 (PAZ) without excessive simplifying assumptions. By incorporating the final-state constraint, the model is guided towards physically consistent end states, thereby potentially putting lesser emphasis on the noises in earlier time steps.

However, the final state governing equation contains mass transfer empirical parameters: K0GA, k0LA, kgA that are not directly measurable. To address this, a parameter discovery framework is adopted, wherein these parameters are treated as learnable variables and optimised during training.

To implement this, an additional loss term, referred to as the SS-PD loss, LSS-PD is incorporated alongside the existing MSE loss, LMSE , at time = 16 mins as defined below. The governing equations are described in Figure 5.1 and Table 5.2.

At time = 16 mins,

Heights of Zones in Chemisorption Column diagram
Figure 5.1 Heights of Zones in Chemisorption Column
Table 5.2 Definition of Governing Equations
Definition of Governing Equations Table

Given that heights of all zones = height of entire column = 42.5cm,

Equation Working

Steady State Collocation with Parameter Discovery (SS-C-PD)

To address the second hypothesis that NODE-ED-Uncoupled has limited experimental data to learn the full range of interactions of the zones, SS-C-PD approach is employed. Figure 5.2 showcases the training procedure of SS-C-PD.

Flowchart of SS-C-PD Training Procedure Diagram
Figure 5.2 Flowchart of SS-C-PD Training Procedure

In this approach, collocation points are generated across the same input space. Specifically, 2.500 points (arranged as a 50 x 50 grid) are sampled within the defined sample range to provide a dense representation of the zones at fine resolution. These generated inputs are then evaluated using the governing equation with parameter discovery described in Section 5.1 for SS-PD. During training, at every epoch, the model is evaluated on a randomly sampled mini-batch of 100 collocation points from the bulk 2,500 points to reduce computational costs. This stochastic sampling strategy progressively improves domain coverage, where certain zones are under-represented while preventing overfitting to a fixed set of collocation locations as shown in Figure 5.3.

Experimental VS Collocation Sampling Diagram
Figure 5.3 Experimental VS Collocation Sampling
Equation Working Diagram

Analysis of results for Physics Guided Enhancements

Table 5.3 Performance of Physics Guided Enhancement with 100 K-fold
Metric Physics Guided Enhancement NODE-ED-Uncoupled (Basecase)
SS-PD SS-C-PD
0.694 ± 0.101 0.697 ± 0.100 0.689 ± 0.101
MSE 0.239 ± 0.035 0.233 ± 0.0314 0.245 ± 0.034
Table 5.4 Statistical Test of Physics Guided Enhancement with 100 K-fold
Model Pair Metric Significance Test
Paired t-test Cohen's d *
NODE-ED-Uncoupled Vs SS-PD 1.32 x 10-1 -0.152
MSE 5.14 x 10-3 0.286
NODE-ED-Uncoupled Vs SS-C-PD 6.45 x 10-3 -0.278
MSE 3.96 x 10-8 0.596

Note: The sign of Cohen's d is defined with respect to the first model listed in each pair. Positive value indicates higher metric value for 1st model, negative value indicates a higher value for 2nd model

As shown in Table 5.3, SS-PD only achieved a marginal improvement over NODE-ED-Uncoupled, with a mean R2 of 0.694 against 0.689 with comparable standard deviations of 0.101 across both models and a slightly lower MSE of 0.239 ± 0.035 against 0.245 ± 0.034. The statistical tests conducted afterwards in Table 5.4 highlight, however, inconclusive results. The paired t-test for R2 yields a p-value above 0.05, indicating that the difference in explained variance is not distinguishable from random model initialisations. Notably, the MSE comparison between the model does have a significant difference at a p-value of 5.14 x 10-3, though the Cohen's d of 0.286 suggests only a small practical improvement. The difference in these two results can be attributed to each of the metrics' intrinsic method of calculation. R2 is a normalised measure of variance that is inherently less sensitive to small absolute improvements while MSE captures the absolute error incurred between the model's prediction and the test samples. Hence, the statistically significant MSE reduction suggests a true, albeit minor, reduction in absolute prediction error that R2 is not sensitive enough to detect.

Despite the minor reduction in MSE, the performance of SS-PD is practically still comparable to NODE-ED-Uncoupled. It was previously recognised that the NODE-ED-Uncoupled architecture exhibits inaccuracy in mapping the final state of the trajectory. While the incorporation of governing equations with a better prediction is expected to enhance predictive performance, this improvement is not observed. It is hypothesised that although the model may achieve a better fit at the final time point (t = 16), the trajectory leading to this state may exhibit increased discrepancies at earlier time steps. Consequently, these opposing effects offset one another, resulting in no significant overall improvement in model performance.

Meanwhile, the benefit of broader coverage from collocation points is reflected in the SS-C-PD results. The model achieved an R2 of 0.697 ± 0.100, with a paired t-test confirming the small improvement over NODE-ED-Uncoupled is statistically significant at a p-value of 6.45 x 10-3 with a Cohen's d of 0.278. The reduction in MSE is more pronounced, with SS-C-PD achieving 0.233 ± 0.031, a reduction in both the absolute error and variability. The MSE reduction is also supported via statistically significant p-value of 3.96 x 10-8 and a moderate Cohen's d of 0.596. The stronger effect size on MSE relative to R2 suggests that SS-C-PD is able to better generalise on more difficult test samples. This progression from SS-PD to SS-C-PD therefore demonstrates that it is not the physics constraint itself that is limiting, but rather the narrow operating range over which the physics is applied to and that extending physics enhancement across the entirety of the operating space is able to produce meaningful generalisation improvement.

A possible explanation lies in the limited representation of the three distinct zones (SRZ, IRZ, and PAZ) occurring simultaneously within the column, which is captured in only 6 out of the 30 data points. The majority of the data instead reflects conditions where only one or two zones are present. Consequently, when evaluating test cases involving the coexistence of all three zones, the model exhibits poor predictive performance due to this underrepresentation. The introduction of collocation points addresses this limitation by enriching the dataset with the missing three regime information, thereby enabling improved model training and predictive capability.

Table 5.5 Bagging Ensemble Performance with & without SS-C-PD for NODE-ED-Uncoupled
Metric Model
Without SS-C-PD With SS-C-PD
0.765 ± 0.0800 0.773 ± 0.0882
MSE 0.241 ± 0.106 0.239 ± 0.070
Table 5.6 Statistical Tests comparing with & without SS-C-PD for NODE-ED-Uncoupled with 100 Bagging Models
Model Pair Metric Significance Test Model Selection
Paired t-test Cohen's d *
With VS without SS-C-PD for NODE-ED-Uncoupled 6.41 x 10-5 0.588 NODE-ED-Uncoupled with SS-C-PD
MSE 5.59 x 10-3 -0.564

Note: The sign of Cohen's d is defined with respect to the first model listed in each pair. Positive value indicates higher metric value for 1st model, negative value indicates a higher value for 2nd model

Finally, the bagging results in Table 5.5 and 5.6 indicate that SS-C-PD improves performance over NODE-ED-Uncoupled, achieving R2 = 0.773 and MSE = 0.239, compared to without SS-C-PD of 0.765 and 0.241, respectively. Statistical testing confirms that this improvement is not due to chance, with a moderate Cohen's d indicating a meaningful effect size