2,120
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Physics-based and data-driven hybrid modeling in manufacturing: a review

, , ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Article: 2305358 | Received 30 Jun 2023, Accepted 04 Jan 2024, Published online: 18 Jan 2024

ABSTRACT

Manufacturing, an industry set in the physical world, is undergoing its digital transformation, also known as the fourth industrial revolution. Sensors, connectivity, and platforms provide unprecedented access to quantify the quality and diversity of manufacturing data. Progress in data-driven modeling is exponential across all industries. This leads to the question of how physics-based and data-driven modeling can be utilized in a hybrid modeling approach to advance our understanding of processes, materials, and systems in manufacturing. In this review, we focus on discrete manufacturing based on the understanding that hybrid modeling is more mature in process manufacturing. This paper aims to provide an overview of projects where hybrid modeling was used in manufacturing and introduce various ways of composing hybrid models. We provide examples highlighting the implementation of models, structure and expand on metrics to test and validate hybrid models, discuss challenges, and future research directions of hybrid modeling in manufacturing.

1. Introduction

Manufacturing as an industry has embraced data and machine learning applications for a long time Harding et al. (Citation2006). Manufacturing operations and their dynamics are diverse and complex across the supply chain. Thus, various machine learning (ML) approaches should be adapted for different processes and problems to enable smart manufacturing systems. ML models rely on available data to produce reliable, high-quality, value-adding results. These data-driven models have proven especially useful and powerful across industries and are incredibly transformative in data-rich (high quantity) and knowledge-sparse areas.

However, manufacturing is centered around the physical transformation of materials at its core. There is a long history of dedicated work to understand the physics behind manufacturing operations.

For example, we can understand the significance of parameters influencing grinding processes using detailed first-principle equations to determine specific grinding energy Sutherland et al. (Citation2018). Physics-based models make certain assumptions, thus making them challenging to work with (e.g. insignificance of parameters, idealized constants), while data-driven methods have issues with data volume, variety, velocity, and veracity. Data-driven models also require a sufficient amount of data from all classes for training. This can be problematic in manufacturing as operators work to eliminate defects, so defect data may be hard to come by. Many machine learning algorithms perform better with balanced datasets (e.g. 50% good, 50% bad parts) Wuest et al. (Citation2016). Yet, this is not a realistic approach in all situations, as a manufacturing process resulting in 50% scrap is far from competitive.

Physics-informed hybrid machine learning approach or hybrid modeling, which is defined as the combined use of physics-based and data-driven models to achieve more accurate and physically consistent predictions by leveraging the advantages of both physics-based and data-driven methods, Karpatne et al. (Citation2017); Kapusuzoglu and Mahadevan (Citation2020) could help address these issues by leveraging the strengths of both approaches. Hybrid models often outperform purely data-driven models when the physics components do not contain oversimplifications, incomplete modeling, insufficient parameters, or incorrect assumptions.

While prior reviews have surveyed standalone physics-based Cubillo et al. (Citation2016); Bayat et al. (Citation2021) or data-driven models Wang et al. (Citation2022); Cerquitelli et al. (Citation2021) in manufacturing, this paper specifically focuses on reviewing applications of the hybrid of physics and data-driven modeling to provide manufacturing researchers and practitioners with a comprehensive overview of how hybrid modeling can be leveraged as an emerging tool. This analysis focuses specifically on combinations of physics-based and data-driven models, not combinations of multiple physics or data-driven models.

The primary aim of this paper is to outline the current practices in manufacturing optimization and introduce readers to why a physics-informed hybrid machine learning approach might outperform traditional ML optimization procedures. It offers a general perspective on the integration of physics and data-driven models, followed by a more detailed exploration of specific objectives and how the reviewed papers align with these objectives. Additionally, it discusses quantifiable measures for evaluating the efficiency of hybrid models and presents challenges encountered in manufacturing, along with suggested solutions. This article aims to inform readers about the possibilities when using these physics and data-driven hybrid combinations in manufacturing optimization. Some papers discuss hybrid modeling in general and in other industries. However, our review concentrates on applications within the manufacturing domain, focusing on the specific objectives the researchers accomplished and how they fit into the broader context of physics and data-driven hybrid combinations. We categorize and discuss these papers under their respective sections, although several papers align with multiple integration objectives.

The remainder of this paper is organized as follows: Section Two provides a brief background on the topic and explains the methodology of the review. Section Three presents the main results of the review and provides different hybrid modeling approaches and applications in manufacturing systems. In Section Four, challenges and future research directions are discussed. Finally, we present conclusions drawn from the previous sections in Section Five. This paper aims to cohesively synthesize learning and best practices to guide further research and the real-world deployment of hybrid manufacturing models.

2. Background and review methodology

Understanding the underlying physics of manufacturing processes is the key to enabling optimization. Several traditional tools exist that can be used for manufacturing process optimization. Heuristics Chauvet et al. (Citation2003), simulation Aksarayli and Yildiz (Citation2011), lower-dimensional modeling Wattamwar et al. (Citation2008), Six Sigma Naeem et al. (Citation2016), lean manufacturing Sabadka et al. (Citation2017), and the design of experiments Beg et al. (Citation2019) are a few examples of such methods. These methods aim to optimize manufacturing processes, supply chains, and pipelines. With improved hardware and software, data accessibility across processes has increased and continues to improve. However, supply chain optimization often focuses on minimizing variation for maximum stability. This limits data variety, as experiments that introduce variability are avoided due to the costs of time, resources, and effort. Limited data variety makes modeling difficult.

In this article, we define ‘physics’ as any model, heuristic, or algorithm that is explainable, avoiding the black-box nature of machine learning models. Thus, the papers presented here do not consider hybrid black box, data-driven, or physics-based models in isolation. Instead, we consider a combination of an explainable physics model and a black-box data-driven model. With this focus, we also demonstrate how the integration of physics and data-driven models can work together to achieve various objectives. The authors of this paper have worked on hybrid modeling for energy-efficient CNC grinding and have effectively applied some of these techniques. With minimal access to the machine elsewhere, we achieved the objective of at least a 15% energy reduction in a CNC grinding machine using these methods. Additionally, for a particular part, we achieved a 41% reduction in time and a 37% reduction in energy consumption.

Scopus and Web of Science (WoS) serve as curated sources of high-quality academic articles, and for this reason, we selected these two databases for our literature study. To capture all relevant instances of published research featuring combinations of physics and data-driven hybrid modeling, we began by including synonyms of the involved terms, as illustrated in . Our search specifically targeted hybrid modeling scenarios based on the combination of physics and data-driven approaches. Thus, we considered terms that could signify the same concept. Our approach was informed by a previously conducted bibliometric analysis Kasilingam et al. (Citation2021) that focused on hybrid modeling in manufacturing, related terminology, and the most commonly used methods and processes. This analysis helped define the review methodology, constraints, time range, and search string used in this article.

Figure 1. Overview of the published hybrid models in the manufacturing domain.

Figure 1. Overview of the published hybrid models in the manufacturing domain.

The defined search term returned 251 unique articles from WoS and Scopus. Following a comprehensive manual review, we checked if these articles indeed featured a combination of physics and data-driven models, as required by our inclusion criteria. This process reduced the final number of articles to 81. Any articles lacking sufficient information on the physics portion or how the integration occurred were eliminated. Furthermore, generic digital twin articles that did not provide details on implementing manufacturing processes were also excluded. An overview of the article review methodology is presented in .

Figure 2. Methodology of review of hybrid modeling in manufacturing articles.

Figure 2. Methodology of review of hybrid modeling in manufacturing articles.

3. Hybrid modeling approaches and applications

Integrating physics knowledge into data-driven models has numerous applications in the manufacturing domain, and there are multiple methods to achieve this integration. In this section, we will explore various approaches found in the literature, which can be employed to infuse physics knowledge into data-driven models and create hybrid models. We will also highlight different studies that have utilized these methods within the manufacturing domain. All references in this section, whether cited in the text or presented in table format, constitute the main results extracted from our literature review of 81 papers.

In our reviewed paper, we have identified the top five manufacturing categories with the highest frequencies where hybrid models were applied, as illustrated in . It comes as no surprise that machine monitoring is the field with the most applications published. It should be pointed out that additive manufacturing and welding have some overlapping similarities when it comes to the models used in metal additive manufacturing, another indicator that the availability and quality of physics models is a crucial factor for the successful application of hybrid models. Below is a high-level overview of how the physics and data-driven models can be integrated.

Table 1. Top five manufacturing categories across reviewed papers.

3.1. Integrating physics knowledge into machine learning models

Manufacturing, as an industry that is fundamentally centered around the manipulation of physical objects, has a long history of dedicated work to understand the physics of its processes and operations. The performance of hybrid models, when compared to purely data-driven models, is often better when the physics model component does not contain oversimplifications, partial modeling, insufficient parameters/representation, or incorrect assumptions. Mechanistic models are typically derived from conservation laws (material, momentum, and energy), as well as thermodynamic, kinetic, and/or transport laws forming the foundation of many physics-based models in a manufacturing context. Below are examples of how physics and data-driven models can be integrated.

Physics-Guided Loss Function is the first approach that can be employed. This can be done by incorporating physics knowledge into data-driven models by integrating physical knowledge into the ML loss functions. This helps ML models capture dynamic patterns that align with established physical laws. The addition of physics-based loss aims to ensure consistency with physical laws. It is weighted by a hyperparameter, γ, which is determined alongside other ML hyperparameters through validation data or a nested cross-validation setup. Regularization by physical constraints allows the model to learn even with unlabeled data, as the computation of the physics-based loss does not require observational data.

Physics-guided initialization is another approach. Neural network weights are often initialized according to a random distribution before training. Poor initialization can cause models to become stuck in local minima, especially for deep neural networks. However, if physical or other contextual knowledge is used to inform weight initialization, model training can be accelerated, and fewer training samples may be required. One possibility is to use simulated data from the physics-based model to pre-train the ML model. Physics-guided initialization can also be accomplished using a self-supervised learning method. One approach involves using a physics-based model to simulate intermediate physical variables, which can then be used to pre-train ML models by adding supervision to hidden layers.

Finally, we can employ a physics-guided design of ML architecture. New ML architectures can be designed to leverage the specific characteristics of the problem at hand. The modular and flexible nature of neural networks makes them ideal candidates for architecture modification. For example, domain knowledge can be used to specify node connections that capture physics-based dependencies among variables. One way to incorporate physical principles into neural network design is to assign physical meaning to specific neurons in the neural network. It is also possible to explicitly declare physically relevant variables. Another related approach is to fix one or more weights within the neural network to physically meaningful values or parameters and make them non-modifiable during training. Recurrent neural networks (RNNs) encode temporal invariance, and convolutional neural networks (CNNs) can implicitly encode spatial translation, rotation, and scale invariance. Similarly, scientific modeling tasks may require other invariances based on physical laws. Architecture modifications incorporating symmetry are also commonly seen in dynamic systems research involving differential equations. Raissi and Em Karniadakis (Citation2018) demonstrated that the covariance function can explicitly encode the underlying physical laws expressed by differential equations to solve Partial Differential Equations (PDE) and learn with smaller datasets.

3.2. Hybrid models applications in manufacturing

Here we will discuss the utilization of available fundamental knowledge to account for the impact of certain factors and provide structure to a comprehensive model. This knowledge enables a clear distinction between established, validated relationships and new variables essential for data-driven or statistical models, in addition to physics-based models. We will elaborate on the integration of physics and data-driven models, offering real-world examples from manufacturing. Hybrid models find applications in various scenarios, including refining existing models, enhancing output resolution, replacing interconnected component models, optimizing computational efficiency, solving partial differential equations, uncovering static characteristics, generating synthetic data, and estimating output uncertainty based on input. This categorization highlights the diverse applications of physics models in the papers under consideration. Interested readers are referred to the seminal book about hybrid modeling in process industries Glassey and Von Stosch (Citation2018). Below, we will go over each application and provide examples of each category in our literature review.

3.2.1. Improving or replacing physics model

While physics models are grounded in established physical laws, they often serve as approximations of reality due to our incomplete understanding of certain processes, resulting in inherent biases. Furthermore, these models frequently rely on a multitude of parameters, the values of which must be estimated using limited observational data, further compromising their performance, particularly when dealing with the heterogeneity of processes across different spatial and temporal scales. The limitations of physics-based models transcend disciplinary boundaries and are widely acknowledged in the scientific community. In contrast, machine learning models, such as neural networks, can discern intricate structures and patterns even in complex scenarios where explicit programming of a system’s exact physical characteristics is unfeasible, provided they are supplied with a sufficient volume of data. Given this ability to automatically extract complex relationships from data, ML models appear promising for scientific problems with physical processes that are not fully understood but have data of adequate quality and quantity available. However, the black-box application of ML has met with limited success in scientific domains for several reasons. Integrated physics-ML models are expected to better capture the dynamics of scientific systems and advance the understanding of underlying physical processes Nouri et al. (Citation2022); Hermann et al. (Citation2022); Zhou et al. (Citation2021); Guo et al. (Citation2021); Huang et al. (Citation2022); Kim et al. (Citation2022). . Shows more examples of improving physics-based models using data-driven techniques.

Table 2. Examples where hybrid models used to improve or replace physics models.

3.2.2. Downscaling

Complex mechanistic models can more precisely capture physical reality than simpler models. This enhanced precision is often achieved by incorporating a wider array of components that account for a greater number of processes at finer spatial or temporal resolutions. However, due to the computational costs and modeling complexities involved, many models are operated at coarser resolutions than what is necessary for accurately representing underlying physical processes. To address this, downscaling techniques have been widely employed to capture physical variables that require modeling at a finer resolution. These downscaling techniques can be categorized into two groups: statistical downscaling and dynamical downscaling.

Statistical downscaling involves using empirical models to predict finer-resolution variables based on coarser-resolution data. This mapping between different resolutions can encompass complex non-linear relationships that traditional empirical models may struggle to precisely represent. Given the significant time required to run these complex models, there is a growing interest in utilizing ML models as surrogate models – models that approximate simulation-driven input-output data – to predict target variables at a higher resolution. While state-of-the-art ML methods can be applied in both statistical and dynamical downscaling, ensuring that the learned ML component remains consistent with established physical laws and enhances overall simulation performance remains a challenge. . Summarize the studies we found in the literature that used downscaling techniques.

Table 3. Examples where downscaling used in hybrid models.

3.2.3. Parameterization

In parameterization, specific complex dynamical processes are simplified into static parameters using physical approximations. One common approach to estimating these parameters involves conducting a grid search across various combinations of parameter values to find the best match with observations, a process known as parameter calibration. Failing to achieve correct parameterization can reduce the model’s robustness, and errors stemming from imperfect parameterization can adversely impact other components of the entire physics-based model, ultimately affecting the modeling of critical physical processes.

An increasingly considered alternative is to replace overly complex processes that cannot be physically represented in the model with simplified dynamic or statistical/ML processes. This allows for the direct learning of new parameterizations from observations and/or high-resolution model simulations using ML methods. This approach has found applications in diverse domains, including additive manufacturing Aljarrah et al. (Citation2022); Liu et al. (Citation2022); Liu and Wang (Citation2022), biopharmaceutical manufacturing Gerogiorgis and Castro-Rodriguez (Citation2021), monitoring Yang et al. (Citation2022), robotics Aivaliotis et al. (Citation2021), smelting Liang et al. (Citation2021), machining Zotov and Kadirkamanathan (Citation2021), and chemical processes Cai et al. (Citation2021). . shows more examples of parametrization in hybrid models.

Table 4. Examples where parameterization used in hybrid models.

3.2.4. Reduced-order models

Reduced-Order Models (ROMs) are computationally efficient representations of complex models. Typically, constructing ROMs involves dimensionality reduction to capture the essential dynamical characteristics of often large, high-fidelity simulations and physical systems models. However, this reduction to a lower-dimensional subspace naturally results in a loss of accuracy.

One recent focus in the realm of ML-based ROMs is approximating the dominant modes of the Koopman (or composition) operator as a dimensionality reduction technique. The Koopman operator is an infinite-dimensional linear operator that encodes the temporal evolution of the system state through nonlinear dynamics. While dynamic mode decomposition is the most common technique for approximating the Koopman operator, recent efforts have been made to approximate Koopman operator embeddings using deep learning models, which have shown superior performance compared to existing methods. Incorporating physics-based knowledge into the learning of the Koopman operator can enhance generalizability and interpretability, aspects that current ML methods in this field often lack. Consequently, integrating principles from physics-based models has the potential to reduce the search space, making ROM training more robust and allowing the model to be trained with less data in various scenarios Roland et al. (Citation2021); Farlessyost and Singh (Citation2022). More examples of using ROMs in hybrid modeling are shown in .

Table 5. Examples where reduced-order models used in hybrid models.

3.2.5. Forward solving partial differential equations

Finite Element Method (FEM) and Finite Difference Method (FDM) based models are often prohibitively expensive. In such cases, traditional methods may not be ideal or, in some instances, even feasible. A common approach is to employ an ML model as a surrogate for the solution to reduce computation time. Neural networks can significantly reduce the computational demands of traditional numerical methods to a single forward pass of a neural network. Notably, solutions obtained through neural networks are naturally differentiable. They possess a closed analytic form that can be easily applied to subsequent calculations, a feature not typically found in more traditional solving methods.

Taking it a step further, deep neural network models have demonstrated success in approximating solutions for high-dimensional physics-based Partial Differential Equations (PDEs) that were previously considered unsuitable for approximation by ML, as observed in the works of Chen and Ahmadi (Citation2022); Teren et al. (Citation2022); Yang and Özel (Citation2021); Ren et al. (Citation2022). More examples of using forward solving PDEs in hybrid modeling are shown in .

Table 6. Examples where hybrid models used to solve PDEs.

3.2.6. Inverse modeling

The forward modeling of a physical system involves utilizing the system’s physical parameters, such as mass, temperature, charge, physical dimensions, or structure, to predict the system’s next state or its output effects. In contrast, inverse modeling employs a system’s potentially noisy output to deduce the underlying physical parameters or inputs. Inverse problems often hold great significance in physics-based modeling communities because they can reveal valuable information that cannot be directly observed.

The solution to an inverse problem can often be computationally expensive due to the potential need for millions of forward model evaluations for estimator assessment or the characterization of posterior distributions of physical parameters. ML-based surrogate models, among other methods like reduced-order models, are emerging as viable options because they can represent high-dimensional phenomena with abundant data and operate considerably faster than most physical simulations. Applying physics-based constraints and stopping conditions based on material properties can guide the optimization process, enhancing data efficiency and the ability to address ill-posed inverse problems. Examples of using inverse modeling in hybrid modeling are shown in .

Table 7. Examples where inverse modeling used in hybrid models.

3.2.7. Discovering governing equations

When the governing equations of a dynamical system are known explicitly, they allow for more robust forecasting, control, and the opportunity for analysis of system stability and bifurcations through increased interpretability. If a mathematical model accurately describes the processes governing the observed data, it can generalize to data outside of the training domain. Advances in machine learning for discovering these governing equations have become an active research area with rich potential to integrate applied mathematics and physics principles with modern machine learning methods Mekarthy (Citation2019). Early works on the data-driven discovery of physical laws relied on heuristics and expert guidance. They were focused on rediscovering known, non-differential laws in different scientific disciplines from artificial data. Recently, work has used sparse regression built on a dictionary of functions and partial derivatives to construct governing equations. These sparse identification techniques are based on the principle of Occam’s Razor, where the goal is to use only a few equation terms to describe any given nonlinear system. . shows examples of this application in the literature.

Table 8. Examples of discovering governing equations usage in hybrid models.

3.2.8. Data generation

Data generation approaches are valuable for creating virtual simulations of scientific data under specific conditions. Traditional physics-based approaches for data generation often depend on running physical simulations or conducting physical experiments, which can be exceedingly time-consuming. Moreover, these approaches are constrained by what can be produced by physics-based models. Therefore, there is a growing interest in generative ML approaches that learn data distributions in unsupervised settings, offering the potential to generate novel data beyond the capabilities of traditional methods Sarishvili et al. (Citation2021); Relan et al. (Citation2021).

The concept behind generative models is to capture the underlying probabilistic distribution to generate similar data. Recent advances in deep learning have led to the development of new generative models, such as Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs). These models have demonstrated significantly improved performance in learning non-linear relationships and extracting representative latent embeddings from observational data. An emerging area of research is the engineering of GANs that can incorporate prior knowledge of physics, including physical laws and invariance properties. The physics-based constraints ensure that the generated artificial samples exhibit the same morphology distribution as authentic data, thus greatly reducing the vast design space required for data generation. Examples of data generation applications in hybrid models are illustrated in .

Table 9. Examples of using data generation in hybrid models.

3.2.9. Uncertainty quantification

Uncertainty Quantification (UQ) necessitates the accurate characterization of the entire distribution rather than merely making a point prediction. This approach enables various analyses, such as assessing the proximity of predictions to acceptable values and conducting sensitivity analyses of input features. Within the ML community, Gaussian processes have often served as the primary technique for quantifying uncertainty in simulating physical processes. In contrast, traditional methods like Monte Carlo (MC) are typically unfeasible due to the substantial number of forward model evaluations required to obtain statistically meaningful results when applying uncertainty quantification to physics models.

The integration of physics knowledge into ML for UQ holds the potential to provide a more robust characterization of uncertainty Schoeneberger et al. (Citation2021); Wang et al. (Citation2022); McGowan et al. (Citation2022). For instance, ML surrogate models run the risk of producing physically inconsistent predictions, and the incorporation of physics principles could mitigate this issue. Additionally, it’s worth noting that the reduced data requirements of ML, driven by the necessity to adhere to known physical laws, may alleviate some of the high computational costs associated with bayesian neural networks for UQ. Examples of uncertainty quantification applications in hybrid models are illustrated in .

Table 10. Examples of using uncertainty quantification in hybrid models.

3.3. Evaluation metrics for hybrid modeling in manufacturing

Evaluation metrics are used to qualify the efficiency and prediction accuracy of the statistical or machine learning model. This part provides a summary of commonly used performance metrics that are appropriate for hybrid models in the manufacturing domain. The summary provided here will enhance the general comprehension of metrics for hybrid modeling and make it easier to choose them for use in various industrial applications. This section will cover the existing evaluation metrics for these classification and regression tasks individually. We will also provide examples of using the mentioned metrics in our review.

3.3.1. Metrics for classification problems

Based on our review, the classification problem in the manufacturing domain includes chatter detection Yang and Rai (Citation2019); Rahimi et al. (Citation2021), bearing fault detection Tian et al. (Citation2015); Sun et al. (Citation2017), turbine blades crack detection Pillai et al. (Citation2016). Model classification performance can be assessed using various evaluation criteria, including accuracy, precision, recall, and F1 score.

Accuracy is the most commonly used metric for classification problems, quantifying the ratio of correctly predicted samples to the total number of prediction samples. The performance of hybrid models has been assessed in various manufacturing applications using accuracy measures Yumei et al. (Citation2017); Sadoughi and Chao (Citation2019); Sun et al. (Citation2017). However, accuracy is most effective when the dataset has an equal distribution among classes. In the manufacturing process, collected datasets typically suffer from imbalanced class issues. Specifically, fault data in manufacturing industries is often overwhelmed by instances of faultless data Fathy et al. (Citation2020). To address these accuracy shortcomings and enhance assessment robustness in the presence of an imbalanced dataset, precision and recall metrics are proposed. Precision calculates the percentage of true positive samples within all predicted positive samples, while recall measures the machine learning model’s ability to correctly identify true positives within all actual positive samples. These two metrics are better suited to handle imbalanced datasets and offer a more objective description of hybrid model classification performance compared to current state-of-the-art methods. In a study by Yazdi et al. (Citation2020), these metrics were used to confirm the superiority of the hybrid model over conventional VGG and SVM models for additive manufacturing process monitoring tasks. However, evaluating a model’s performance can be challenging when precision is high and recall is low, or vice versa. The F1 score addresses this by measuring both precision and recall simultaneously, providing a more comprehensive assessment of the predictive hybrid model’s performance Yazdi et al. (Citation2020).

3.3.2. Metrics for regression problems

The regression modeling establishes a mapping between input variables and continuous hybrid model output variables where both input and output variables can be multivariate. Regression problems encompass a range of applications, including molten steel temperature prediction Ahmad et al. (Citation2014); Okura et al. (Citation2013), additive manufacturing melt pool size prediction Olleak and Zhimin (Citation2020); Mondal et al. (Citation2020), remaining useful life estimation Aivaliotis et al. (Citation2021); H. Sun et al. (Citation2018), and more.

provides a detailed summary of evaluation metrics for regression problems. Among these metrics, mean squared error (MSE) and mean absolute error (MAE) are the most commonly used to assess hybrid model performance. MSE represents the mean of the sum of squares of the differences between actual and predicted values and is utilized in studies such as Z. Liu et al. (Citation2019), Reinhart et al. (Citation2017), and Ambrogio et al. (Citation2008).On the other hand, MAE quantifies the absolute error between predictions and ground truth values, as demonstrated by Cho et al. (Citation1997), Swischuk et al. (Citation2019), and Haghighi and Lin (Citation2020). In addition to these widely used metrics, there are unique metrics tailored to specific problems. In Hong et al. (Citation2019), a new metric called ‘score’ is defined to assess remaining useful life prediction results for gas turbine engines, which may be more tolerant and less critical than RUL prediction for turbofan engines.

Table 11. Regression evaluation metrics for hybrid models.

4. Challenges and future research directions

Despite recent advancements in the field of hybrid modeling techniques within the manufacturing industry, as presented in this paper, several challenges and knowledge gaps persist. Further research is necessary to address these current issues and to expand the use and functionality of hybrid models. In this section, we will discuss potential future research directions related to hybrid modeling in manufacturing and the manufacturing industry.

4.1. Small datasets and data limitations

Sufficient and accurate (high-quality) data are required to characterize, develop, validate, and assess hybrid models in the manufacturing domain. Unfortunately, the complexity of manufacturing data collection and processing often limits access to sufficiently large and high-quality datasets. Setting up the required connectivity and pre-processing is an expensive and time-consuming process. The raw manufacturing data may come from different distributed sources in various formats, such as RGB and depth images from the camera Yazdi et al. (Citation2020); Wang et al. (Citation2018), acoustic signals from microphones Yang and Rai (Citation2019); Rahimi et al. (Citation2021), and numerical values from vibration sensors Sadoughi and Chao (Citation2019). These raw datasets generally have variable sampling rates, necessitating additional processing with image/signal preprocessing and data fusion methods before being fed into the hybrid model.

Furthermore, the data labeling work for most manufacturing problems typically requires domain knowledge. For example, detecting minor flaws or bearing issues always necessitates the involvement of skilled personnel or experts. Otherwise, the training of the hybrid model may be complicated by inconsistent and erroneous labels. The amount of qualifying production data for hybrid model training is constrained by the aforementioned problems. However, most machine learning models, especially deep learning models, are data-hungry algorithms, as they perform better with a large amount of data. Deep learning models trained with small datasets need better generalization. Developing data-efficient and robust hybrid models with fewer labels is imperative, not limited to labeled data constraints, and can expand their applications.

Semi-supervised learning, already utilized in semiconductor production Kang et al. (Citation2016) and laser powder-bed fusion fault detection Okaro et al. (Citation2019), is one option that requires fewer labeled data and a vast amount of unlabeled data. Unlabeled data, containing only input variables, can be collected through various sensors with less effort than labeled data. The reduction in the demand for labeled data can significantly enhance its feasibility in the manufacturing sector. However, semi-supervised learning-based hybrid models have yet to be deployed in the industrial sector and should be a focus of future development.

Moreover, knowledge sharing accumulates and maintains knowledge learned from previous tasks and seamlessly applies it to learning new tasks and solving new problems with minimal data and effort Adadi (Citation2021). Transfer Learning is a representative knowledge-sharing technique, that has been applied to machine fault diagnostics Wang and Gao (Citation2020) and the tool’s remaining useful life prediction Sun et al. (Citation2018). Merging physics models with transfer learning architectures provides a promising solution to the data limitation and warrants more research in the area.

Apart from hybrid models, data augmentation techniques can also create additional data by modifying the original data without changing its distribution Perez and Wang (Citation2017). In the manufacturing domain, data augmentation methods include variational autoencoders Yun et al. (Citation2020), Gaussian Noise Li et al. (Citation2020), and synthetic data augmentation through generative adversarial networks Jain et al. (Citation2020).While good prediction results can be obtained with each augmentation technique independently, a combined augmentation technique can be a future research direction, expected to leverage the advantages of different approaches and further enhance the hybrid model’s accuracy and efficiency.

4.2. Imbalanced data in manufacturing applications

Imbalanced data is a term used to describe the uneven representation of different classes in a classification task. In manufacturing, imbalanced data is more likely to occur in defect/fault detection situations Saqlain et al. (Citation2020); Lee et al. (Citation2020). Compared with the standard sample, the defects sample generally occupies a small portion of the overall dataset, which can lead to biased learning and misclassification issues. This issue is challenging to overcome, given the manufacturing objective of producing high-quality parts and continually improving processes. For example, in the study by Saqlain et al. (Citation2020); Lee et al. (Citation2020) the eight wafer faults account for only 14.8% of all labeled datasets. This data imbalance and the resulting misclassifications may lead to issues from the analytic point.

While we cannot change the nature of the data due to the manufacturing objective, we can address the problem in other ways. Three strategies show promise for addressing the issue of data imbalance in the manufacturing sector: data-driven methods, algorithmic-based strategies, and hybrid approaches Fathy et al. (Citation2020). Since the selection of a solution depends heavily on the dataset’s characteristics, there is no one-size-fits-all solution and no clear guidance for solution selection. Developing well-defined guidance is critical to improving the performance of hybrid models when dealing with imbalanced data.

Furthermore, most solutions for imbalanced data issues have traditionally focused on image data, such as image rotation, flipping, and noise addition. Research addressing imbalanced data issues in non-image manufacturing data is limited and should be expanded upon Johnson and Khoshgoftaar (Citation2019).

4.3. Benchmark problems for hybrid modeling in manufacturing

Benchmark problems enable the performance evaluation of different machine learning algorithms and serve as a common platform for addressing challenging domain problems, which have been shared by many research groups over an extended time frame Dueben et al. (Citation2022). In the field of computer vision, benchmark problems are well-established and include datasets like COCO Lin et al. (Citation2014), CIFAR-10 Krizhevsky and Hinton (Citation2009), and KITTI Geiger et al. (Citation2012). However, the manufacturing domain has a limited number of benchmark problems Farahani, McCormick, Gianinny, et al. (Citation2023). The Case Western Reserve University (CWRU) bearing dataset Smith and Randall (Citation2015), has been the most extensively utilized benchmark problem in the diagnostics category for bearing defect identification. Consequently, most diagnostics applications in the selected articles primarily focus on defect detection. In contrast, there are relatively few articles that address chatter and crack detection due to a lack of suitable benchmark problems. Some publicly available manufacturing datasets from NASA and others that can be useful in this regard were introduced in Farahani, McCormick, Harik, et al. (Citation2023).

Manufacturing encompasses various categories beyond diagnostics, including manufacturing process control and human-machine interaction, among others. Ideally, each type of application should have a distinct benchmark dataset to stimulate competitive progress. In general, the growth of state-of-the-art techniques is directly associated with the complexity of the benchmark problems Rai and Sahu (Citation2020). Future work could involve formulating benchmark problems for hybrid models across various manufacturing processes, significantly accelerating research in hybrid models, and expanding their practical applications.

4.4. Hybrid model selection

Proper model selection is a crucial precondition for the successful implementation of a model. The manufacturing industry has achieved remarkable success with data-driven models, particularly deep learning, owing to the intuitive model selection guidelines that are already in place. For instance, CNN has been widely used for analyzing visual images Imoto et al. (Citation2018), the LSTM model has demonstrated its ability to learn from sequential data Lindemann et al. (Citation2020), and Autoencoder architecture is capable of feature extraction without label information J. Wang et al. (Citation2018).

In manufacturing, hybrid models can be classified into three main categories based on their interaction methods: combination Tian et al. (Citation2015), constraint Kapusuzoglu and Mahadevan (Citation2020), and embedding approaches Slatteke and Astrom (Citation2005). However, the criteria for selecting relatively novel hybrid models are less standardized compared to predominantly data-driven approaches. This selection process is primarily influenced by the domain expert’s opinion at the current stage. For example, the combination approach is commonly used in most bearing defect detection problems Sun et al. (Citation2017); Sadoughi and Chao (Citation2019); Tian et al. (Citation2015); Junbo et al. (Citation2015); Chen et al. (Citation2017). However, none of these papers address the rationale for selecting the combination technique or demonstrate its superiority over the other two approaches. Therefore, it is essential to determine how to choose the appropriate components from both physics and data-driven methods, as well as the interface style, based on the various challenges at hand. Ineffective implementation, resulting from a lack of clear direction, negatively impacts the model’s performance. It is imperative to develop a universal hybrid model selection approach to address manufacturing problems, as this can significantly enhance performance and broaden the model’s applicability.

5. Conclusion and Limiations

This paper provides an overview of hybrid modeling in manufacturing. Our literature analysis focuses on the clear combination of physics-based and data-driven modeling in a manufacturing context. We have observed that the reviewed literature covers only 10 use cases related to traditional manufacturing processes, such as machining, milling, rolling, smelting, drilling, forming, and sintering.

In this review, we have gathered and presented information on how new hybrid models can be developed and designed. We also offer an overview of alternative architectures for hybrid models and have collected common testing and evaluation metrics for such models. Furthermore, we discuss key challenges associated with hybrid models in a manufacturing context, which are crucial for those interested in developing and implementing hybrid models on the manufacturing shop floor and beyond. Additive manufacturing, machine monitoring, tool wear, and welding are the predominant areas where hybrid models have been implemented in the reviewed literature to date. When reflecting on these dominant areas, we notice that readily available access to data, compared to other areas, exists for both additive manufacturing and monitoring. Additionally, the availability of sophisticated physics models, either as simulations or through experimentation, stands out. Thus, for future work, it is essential to investigate how to make data and physics models for traditional manufacturing processes more readily available, enabling the implementation of hybrid models and the ensuing positive results.

It is worth noting that manufacturing challenges go beyond optimization problems. This article explores the various objectives that can be pursued in this context, including modeling, data generation, and uncertainty quantification. Manufacturing researchers and practitioners must consider whether new projects can be initiated to create innovative models that enhance our understanding of processes and interactions in unprecedented ways.

The limitations of this work should be acknowledged. The literature review, while following a strict methodology, may still exhibit certain biases. For example, the term ‘production’ is often used internationally to refer to manufacturing activities, and this usage may have been missed by our defined search terms. Similarly, the variant spelling of ‘optimization’ was not considered, potentially affecting the results and leading to more use cases that could have been presented. Thermal sciences are well-documented in physics and are relatively common among hybrid modeling use cases, whether in additive manufacturing or welding. Moreover, a substantial portion of the data and analyses presented in this paper are inherently subjective and influenced by our perceptions. Recognizing that the complete elimination of subjectivity is impossible and acknowledging that such an endeavor may not be entirely desirable, our commitment is to be transparent about the process and methodology employed. We aim to ensure that our audience is well-informed about our biases, intent, understanding, and the consequential impact on the content of this paper. Finally, some additional papers may get indexed after our initial database search so they may not be considered here, and the result may differ under different assumptions.

Acknowledgments

The authors thank the anonymous reviewers for their constructive comments that improved the paper substantially.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Correction Statement

This article has been corrected with minor changes. These changes do not impact the academic content of the article.

Additional information

Funding

This material is based upon the study supported by (i) the U.S. Department of Energy’s Office of Energy Efficiency and Renewable Energy (EERE) under the Advanced Manufacturing Office Award Number DE-EE0007613 (Disclaimer: This report was prepared as an account of the work sponsored by an agency of the United States Government. Neither the United States government nor any agency thereof, nor any of its employees, makes any warranty, express or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness.); and (ii) the National Science Foundation under Grant No. 2119654. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

References

  • Adadi, A. (2021). A survey on data-efficient algorithms in big data era. Journal of Big Data, 8(1), 1–27. https://doi.org/10.1186/s40537-021-00419-9
  • Agana, N. A., Oleka, E., Awogbami, G., & Homaifar, A. (2018). Short-term load forecasting based on a hybrid deep learning model. In SoutheastCon 2018, St. Petersburg, Florida, USA (pp. 1–6). IEEE.
  • Ahmad, I., Kano, M., Hasebe, S., Kitada, H., & Murata, N. (2014). Gray-box modeling for prediction and control of molten steel temperature in tundish. Journal of Process Control, 24(4), 375–382. https://doi.org/10.1016/j.jprocont.2014.01.018
  • Aivaliotis, P., Arkouli, Z., Georgoulias, K., & Makris, S. (2021). Degradation curves integration in physics-based models: Towards the predictive maintenance of industrial robots. Robotics and Computer-Integrated Manufacturing, 71, 102177. https://doi.org/10.1016/j.rcim.2021.102177
  • Aksarayli, M., & Yildiz, A. (2011). Process optimization with simulation modeling in a manufacturing system. Research Journal of Applied Sciences, Engineering & Technology, 3(4), 318–328.
  • Aljarrah, O., Jun, L., Heryudono, A., Huang, W., & Jing, B. (2022). Predicting part distortion field in additive manufacturing: A data-driven framework. Journal of Intelligent Manufacturing, 34(4), 1975–1993. https://doi.org/10.1007/s10845-021-01902-z
  • AlThobiani, F., Ball, A., & Ball, A. (2014). An approach to fault diagnosis of reciprocating compressor valves using teager–Kaiser energy operator and deep belief networks. Expert Systems with Applications, 41(9), 4113–4122. https://doi.org/10.1016/j.eswa.2013.12.026
  • Ambrogio, G., Filice, L., Shivpuri, R., & Umbrello, D. (2008). Application of NN technique for predicting the in-depth residual stresses during hard machining of AISI 52100 steel. International Journal of Material Forming, 1(1), 39–45. https://doi.org/10.1007/s12289-008-0004-5
  • Bayat, M., Dong, W., Thorborg, J., To, A. C., & Hattel, J. H. (2021). A review of multi-scale and multi-physics simulations of metal additive manufacturing processes with focus on modeling strategies. Additive Manufacturing, 47, 102278. https://doi.org/10.1016/j.addma.2021.102278
  • Beg, S., Swain, S., Rahman, M., Saquib Hasnain, M., & Sarim Imam, S. (2019). Application of design of experiments (DoE) in pharmaceutical product and process optimization. In S. Beg & H. Saquib (Eds.), Pharmaceutical quality by design (pp. 43–64). Elsevier.
  • Bhutani, N., Rangaiah, G. P., & Ray, A. K. (2006). First-principles, data-based, and hybrid modeling and optimization of an industrial hydrocracking unit. Industrial & Engineering Chemistry Research, 45(23), 7807–7816. https://doi.org/10.1021/ie060247q
  • Cai, J., Yan, C., Yuzhe, M., Bei, Y., Zhou, D., & Zeng, X. (2021). “NeurFill: Migrating full-chip CMP simulators to neural networks for model-based dummy filling synthesis.” In 2021 58th ACM/IEEE Design Automation Conference (DAC), San Francisco, California, USA (pp. 187–192). IEEE.
  • Cardillo, A. G., Monica Castellanos, M., Desailly, B., Dessoy, S., Mariti, M., Portela, R. M., Scutella, B., von Stosch, M., Tomba, E., & Varsakelis, C. (2021). Towards in silico process modeling for vaccines. Trends in Biotechnology, 39(11), 1120–1130. https://doi.org/10.1016/j.tibtech.2021.02.004
  • Carrascal, A., & Alberdi, A. (2010.) “Evolutionary industrial physical model generation.” In International Conference on Hybrid Artificial Intelligence Systems, San Sebastián, Spain (pp. 327–334). Springer.
  • Cerquitelli, T., Jahier Pagliari, D., Calimera, A., Bottaccioli, L., Patti, E., Acquaviva, A., & Poncino, M. (2021). Manufacturing as a data-driven practice: Methodologies, technologies, and tools. Proceedings of the IEEE, 109(4), 399–422. https://doi.org/10.1109/JPROC.2021.3056006
  • Chauvet, F., Herrmann, J. W., & Proth, J.-M. (2003). Optimization of cyclic production systems: A heuristic approach. IEEE Transactions on Robotics and Automation, 19(1), 150–154. https://doi.org/10.1109/TRA.2002.807529
  • Chen, H., & Ahmadi, K. (2022). Estimating pose-dependent FRF in machining robots using multibody dynamics and Gaussian process regression. Robotics and Computer-Integrated Manufacturing, 77, 102354. https://doi.org/10.1016/j.rcim.2022.102354
  • Chen, Z., Deng, S., Chen, X., Chuan, L., Sanchez, R.-V., & Qin, H. (2017). Deep neural networks-based rolling bearing fault diagnosis. Microelectronics Reliability, 75, 327–333. https://doi.org/10.1016/j.microrel.2017.03.006
  • Cho, S., Jang, M., Yoon, S., Chot, Y., & Cho, H. (1997). A hybrid neural-network/mathematical prediction model for tandem cold mill. Computers & Industrial Engineering, 33(3–4), 453–456. https://doi.org/10.1016/S0360-8352(97)00166-6
  • Cubillo, A., Perinpanayagam, S., & Esperon-Miguez, M. (2016). A review of physics-based models in prognostics: Application to gears and bearings of rotating machinery. Advances in Mechanical Engineering, 8(8), 1687814016664660. https://doi.org/10.1177/1687814016664660
  • Destro, F., Facco, P., Garca Muñoz, S., Bezzo, F., & Barolo, M. (2020). A hybrid framework for process monitoring: Enhancing data-driven methodologies with state and parameter estimation. Journal of Process Control, 92, 333–351. https://doi.org/10.1016/j.jprocont.2020.06.002
  • Dueben, P. D., Schultz, M. G., Chantry, M., John Gagne, D., Matthew Hall, D., & McGovern, A. (2022). Challenges and benchmark datasets for machine learning in the atmospheric sciences: Definition, status, and outlook. Artificial Intelligence for the Earth Systems, 1(3), e210002. https://doi.org/10.1175/AIES-D-21-0002.1
  • Elkhashap, A., Meier, R., & Abel, D. (2020). A grey box distributed parameter model for a continuous vibrated fluidized bed dryer in pharmaceutical manufacturing. In 2020 European Control Conference (ECC), Saint Petersburg, Russia (pp. 1415–1421). IEEE.
  • Farahani, M. A., McCormick, M. R., Gianinny, R., Hudacheck, F., Harik, R., Liu, Z., & Wuest, T. (2023). Time-series pattern recognition in smart manufacturing systems: A literature review and ontology. Journal of Manufacturing Systems, 69, 208–241. https://doi.org/10.1016/j.jmsy.2023.05.025
  • Farahani, M. A., McCormick, M. R., Harik, R., & Wuest, T. (2023). Time-series classification in smart manufacturing systems: An experimental evaluation of state-of-the-art machine learning algorithms. arXiv preprint arXiv:2310.02812.
  • Farlessyost, W., & Singh, S. (2022). Reduced order dynamical models for complex dynamics in manufacturing and natural systems using machine learning. Nonlinear Dynamics, 110(2), 1613–1631. https://doi.org/10.1007/s11071-022-07695-x
  • Fathy, Y., Jaber, M., & Brintrup, A. (2020). Learning with imbalanced data in smart manufacturing: A comparative analysis. Institute of Electrical and Electronics Engineers Access, 9, 2734–2757. https://doi.org/10.1109/ACCESS.2020.3047838
  • Gaikwad, A., Yavari, R., Montazeri, M., Cole, K., Bian, L., & Rao, P. (2020). Toward the digital twin of additive manufacturing: Integrating thermal simulations, sensing, and analytics to detect process faults. IISE Transactions, 52(11), 1204–1217. https://doi.org/10.1080/24725854.2019.1701753
  • Geiger, A., Lenz, P., & Urtasun, R. (2012). Are we ready for autonomous driving? The KITTI vision benchmark suite. In Conference on Computer Vision and Pattern Recognition (CVPR), Providence, Rhode Island, USA.
  • Gerogiorgis, D. I., & Castro-Rodriguez, D. (2021). A digital twin for process optimisation in pharmaceutical manufacturing. In M. Türkay & R. Gani (Eds.), Computer aided chemical engineering (Vol. 50, pp. 253–258). Elsevier.
  • Glassey, J., & Von Stosch, M. (2018). Hybrid modeling in process industries. CRC Press.
  • Guo, J., Yang, Z., Chen, C., Luo, W., & Wei, H. (2021). Real-time prediction of remaining useful life and preventive maintenance strategy based on digital twin. Journal of Computing and Information Science in Engineering, 21(3), 3. https://doi.org/10.1115/1.4049153
  • Haghighi, A., & Lin, L. (2020). A hybrid physics-based and data-driven approach for characterizing porosity variation and filament bonding in extrusion-based additive manufacturing. Additive Manufacturing, 36, 101399. https://doi.org/10.1016/j.addma.2020.101399
  • Hanachi, H., Wennian, Y., Yong Kim, I., Liu, J., & Mechefske, C. K. (2019). Hybrid data-driven physics-based model fusion framework for tool wear prediction. The International Journal of Advanced Manufacturing Technology, 101(9), 2861–2872. https://doi.org/10.1007/s00170-018-3157-5
  • Harding, J., Srinivas Shahbaz, M., Kusiak, A., & Kusiak, A. (2006). Data mining in manufacturing: A review. Journal of Manufacturing Science and Engineering, 128(4), 969–976. https://doi.org/10.1115/1.2194554
  • Hermann, F., Chen, B., Ghasemi, G., Stegmaier, V., Ackermann, T., Reimann, P., Vogt, S., Graf, T., & Weyrich, M. (2022). A digital twin approach for the prediction of the geometry of single tracks produced by laser metal deposition. Procedia CIRP, 107, 83–88. https://doi.org/10.1016/j.procir.2022.04.014
  • Hong, J., Wang, Q., Qiu, X., & Leng Chan, H. (2019). “Remaining useful life prediction using time-frequency feature and multiple recurrent neural networks.” In 2019 24th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA), Zaragoza, Spain (pp. 916–923). IEEE.
  • Huang, J., Chang, Q., & Arinez, J. (2020). Product completion time prediction using a hybrid approach combining deep learning and system model. Journal of Manufacturing Systems, 57, 311–322. https://doi.org/10.1016/j.jmsy.2020.10.006
  • Huang, W., Zhang, X., Chaoqun, W., Cao, S., & Zhou, Q. (2022). Tool wear prediction in ultrasonic vibration-assisted drilling of CFRP: A hybrid data-driven physics model-based framework. Tribology International, 174, 107755. https://doi.org/10.1016/j.triboint.2022.107755
  • Hu, J., Min, W., Chen, X., Cao, W., & Pedrycz, W. (2019). Multi-model ensemble prediction model for carbon efficiency with application to iron ore sintering process. Control Engineering Practice, 88, 141–151. https://doi.org/10.1016/j.conengprac.2019.05.009
  • Hürkamp, A., Lorenz, R., Ossowski, T., Behrens, B.-A., & Dröder, K. (2021). Simulation-based digital twin for the manufacturing of thermoplastic composites. Procedia CIRP, 100, 1–6. https://doi.org/10.1016/j.procir.2021.05.001
  • Hwang, R., Hyeontae, J., Seok Kim, K., & Ju Hwang, H. (2020). Hybrid model of mathematical and neural network formulations for rolling force and temperature prediction in hot rolling processes. Institute of Electrical and Electronics Engineers Access, 8, 153123–153133. https://doi.org/10.1109/ACCESS.2020.3016725
  • Imoto, K., Nakai, T., Ike, T., Haruki, K., & Sato, Y. (2018). “A CNN-based transfer learning method for defect classification in semiconductor manufacturing.” In 2018 international symposium on semiconductor manufacturing (ISSM), Toyko, Japan (pp. 1–3). IEEE.
  • Islam, F., Wanigasekara, C., Rajan, G., Swain, A., & Gangadhara Prusty, B. (2022). An approach for process optimisation of the automated fibre placement (AFP) based thermoplastic composites manufacturing using machine learning, photonic sensing and thermo-mechanics modelling. Manufacturing Letters, 32, 10–14. https://doi.org/10.1016/j.mfglet.2022.01.002
  • Jain, S., Seth, G., Paruthi, A., Soni, U., & Kumar, G. (2020). Synthetic data augmentation for surface defect detection and classification using deep learning. Journal of Intelligent Manufacturing, 33(4), 1007–1020. https://doi.org/10.1007/s10845-020-01710-x
  • Johnson, J. M., & Khoshgoftaar, T. M. (2019). Survey on deep learning with class imbalance. Journal of Big Data, 6(1), 1–54. https://doi.org/10.1186/s40537-019-0192-5
  • Junbo, T., Weining, L., Juneng, A., & Xueqian, W. (2015). Fault diagnosis method study in roller bearing based on wavelet transform and stacked auto-encoder. In The 27th Chinese control and decision conference (2015 CCDC), Qingdao, China (pp. 4608–4613). IEEE.
  • Kang, P., Kim, D., & Cho, S. (2016). Semi-supervised support vector regression based on self-training with label uncertainty: An application to virtual metrology in semiconductor manufacturing. Expert Systems with Applications, 51, 85–106. https://doi.org/10.1016/j.eswa.2015.12.027
  • Kapusuzoglu, B., & Mahadevan, S. (2020). Physics-informed and hybrid machine learning in additive manufacturing: Application to fused filament fabrication. Jom, 72(12), 4695–4705. https://doi.org/10.1007/s11837-020-04438-4
  • Karpatne, A., Watkins, W., Read, J., & Kumar, V. (2017). Physics-guided neural networks (pgnn): An application in lake temperature modeling. arXiv preprint arXiv:1710.11431 2.
  • Kasilingam, S., Keepers, M., & Wuest, T. (2021). A bibliometric analysis of physics-based and data-driven hybrid modeling. Procedia CIRP, 103, 49–54. https://doi.org/10.1016/j.procir.2021.10.007
  • Kharwar, P. K., Kumar Verma, R., & Singh, A. (2022). Neural network modeling and combined compromise solution (CoCoso) method for optimization of drilling performances in polymer nanocomposites. Journal of Thermoplastic Composite Materials, 35(10), 1604–1631. https://doi.org/10.1177/0892705720939165
  • Kim, K. M., Hurley, P., & Pacheco Duarte, J. (2022). Physics-informed machine learning-aided framework for prediction of minimum film boiling temperature. International Journal of Heat and Mass Transfer, 191, 122839. https://doi.org/10.1016/j.ijheatmasstransfer.2022.122839
  • Krizhevsky, A., & Hinton, G. (2009). Learning multiple layers of features from tiny images.
  • Lee, J., Chul Lee, Y., & Tae Kim, J. (2020). Fault detection based on one-class deep learning for manufacturing applications limited to an imbalanced database. Journal of Manufacturing Systems, 57, 357–366. https://doi.org/10.1016/j.jmsy.2020.10.013
  • Liang, H., Yang, C., Huang, K., Yonggang, L., & Gui, W. (2021). A hybrid first principles and data-driven process monitoring method for zinc smelting roasting process. IEEE Transactions on Instrumentation and Measurement, 70, 1–14. https://doi.org/10.1109/TIM.2021.3126390
  • Lindemann, B., Jazdi, N., & Weyrich, M. (2020). Anomaly detection and prediction in discrete manufacturing based on cooperative LSTM networks. In 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), Hong Kong (pp. 1003–1010). IEEE.
  • Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., & Lawrence Zitnick, C. (2014). “Microsoft coco: Common objects in context.” In European conference on computer vision, Zurich, Switzerland (pp. 740–755). Springer.
  • Liu, Z., Guo, S., Wang, L., Du, B., & Pang, S. (2019). A multi-objective service composition recommendation method for individualized customer: Hybrid MPA-GSO-DNN model. Computers & Industrial Engineering, 128, 122–134. https://doi.org/10.1016/j.cie.2018.12.042
  • Liu, X., Kan, C., Zehao, Y., & Liu, B. (2022). In-process multiscale performance evaluation of FDM-based honeycomb structures with geometric defects. In AIAA SCITECH 2022 forum (pp. 1425). The American Institute of Aeronautics and Astronautics (AIAA).
  • Liu, D., & Wang, Y. (2022). Metal additive manufacturing process design based on physics constrained neural networks and multi-objective bayesian optimization. Manufacturing Letters, 33, 817–827. https://doi.org/10.1016/j.mfglet.2022.07.101
  • Li, X., Zhang, W., Ding, Q., & Sun, J.-Q. (2020). Intelligent rotating machinery fault diagnosis based on deep learning using data augmentation. Journal of Intelligent Manufacturing, 31(2), 433–452. https://doi.org/10.1007/s10845-018-1456-1
  • Lotfi, M., Farid, A. A., & Soleimanimehr, H. (2016). A new hybrid model based on the radius ratio for prediction of effective cutting limit of chip breakers. Proceedings of the Institution of Mechanical Engineers, Part B: Journal of Engineering Manufacture, 230(8), 1417–1427.
  • Luo, W., Tianliang, H., Yingxin, Y., Zhang, C., & Wei, Y. (2020). A hybrid predictive maintenance approach for CNC machine tool driven by digital twin. Robotics and Computer-Integrated Manufacturing, 65, 101974. https://doi.org/10.1016/j.rcim.2020.101974
  • McGowan, E., Gawade, V., & Guo, W. (2022). A physics-informed convolutional neural network with custom loss functions for porosity prediction in laser metal deposition. Sensors, 22(2), 494. https://doi.org/10.3390/s22020494
  • Mei, Z., Ding, J., Chen, L., Ting, P., & Mei, Z. (2019). Hybrid multi-domain analytical and data-driven modeling for feed systems in machine tools. Symmetry, 11(9), 1156. https://doi.org/10.3390/sym11091156
  • Mekarthy, S. M. R. (2019). A Combined Finite Element and Machine Learning Approach for Predicting Specific Cutting Forces and Maximum Tool Temperatures During Orthogonal Machining [PhD diss]. The University of North Carolina at Charlotte.
  • Moges, T., Yang, Z., Jones, K., Feng, S., Witherell, P., & Yan, L. (2021). Hybrid modeling approach for melt-pool prediction in laser powder bed fusion additive manufacturing. Journal of Computing and Information Science in Engineering, 21(5). https://doi.org/10.1115/1.4050044
  • Mondal, S., Gwynn, D., Ray, A., & Basak, A. (2020). Investigation of melt pool geometry control in additive manufacturing using hybrid modeling. Metals, 10(5), 683. https://doi.org/10.3390/met10050683
  • Naeem, K., Ullah, M., Tariq, A., Maqsood, S., Akhtar, R., Nawaz, R., & Hussain, I. (2016). Optimization of steel bar manufacturing process using six sigma. Chinese Journal of Mechanical Engineering, 29(2), 332–341. https://doi.org/10.3901/CJME.2015.1225.155
  • Nami, Z., Misman, O., Erbil, A., & May, G. S. (1997). Semi-empirical neural network modeling of metal-organic chemical vapor deposition. IEEE Transactions on Semiconductor Manufacturing, 10(2), 288–294. https://doi.org/10.1109/66.572084
  • Nannapaneni, S., & Mahadevan, S. (2016). Manufacturing process evaluation under uncertainty: A hierarchical bayesian network approach. In International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, Charlotte, North Carolina, USA (Vol. 50084, pp. V01BT02A026). American Society of Mechanical Engineers.
  • Nouri, M., Artozoul, J., Caillaud, A., Ammar, A., Chinesta, F., & Köser, O. (2022). Shrinkage porosity prediction empowered by physics-based and data-driven hybrid models. International Journal of Material Forming, 15(3), 25. https://doi.org/10.1007/s12289-022-01677-5
  • Okaro, I. A., Jayasinghe, S., Sutcliffe, C., Black, K., Paoletti, P., & Green, P. L. (2019). Automatic fault detection for laser powder-bed fusion using semi-supervised machine learning. Additive Manufacturing, 27, 42–53. https://doi.org/10.1016/j.addma.2019.01.006
  • Okura, T., Ahmad, I., Kano, M., Hasebe, S., Kitada, H., & Murata, N. (2013). High-performance prediction of molten steel temperature in tundish through gray-box model. ISIJ International, 53(1), 76–80. https://doi.org/10.2355/isijinternational.53.76
  • Olleak, A., & Zhimin, X. (2020). Calibration and validation framework for selective laser melting process based on multi-fidelity models and limited experiment data. Journal of Mechanical Design, 142(8), 081701. https://doi.org/10.1115/1.4045744
  • Papacharalampopoulos, A., Sabatakakis, K., & Stavropoulos, P. (2021). Incorporating process physics phenomena in formation of digital twins: Laser welding case. Procedia CIRP, 99, 490–495. https://doi.org/10.1016/j.procir.2021.03.069
  • Pasandideh, S. H. R., Niaki, S. T. A., & Atyabi, S. M. (2014). A new approach to solve multi-response statistical optimization problems using neural network, genetic algorithm, and goal attainment methods. International Journal of Advanced Manufacturing Technology, 75(5–8), 1149–1162. https://doi.org/10.1007/s00170-014-6206-8
  • Perez, L., & Wang, J. (2017). The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621.
  • Pillai, P., Kaushik, A., Bhavikatti, S., Roy, A., & Kumar, V. (2016). A hybrid approach for fusing physics and data for failure prediction. International Journal of Prognostics and Health Management, 7(4). https://doi.org/10.36001/ijphm.2016.v7i4.2463
  • Pittner, A., Weiss, D., Schwenk, C., & Rethmeier, M. (2008). Methodology to improve applicability of welding simulation. Science and Technology of Welding and Joining, 13(6), 496–508. https://doi.org/10.1179/136217108X329322
  • Qi, B., & Park, H. S. (2020). Data-driven digital twin model for predicting grinding force. In IOP Conference Series: Materials Science and Engineering, Chennai, India (Vol. 916, pp. 012092). IOP Publishing.
  • Rahimi, M. H., Nam Huynh, H., & Altintas, Y. (2021). On-line chatter detection in milling with hybrid machine learning and physics-based model. CIRP Journal of Manufacturing Science and Technology, 35, 25–40. https://doi.org/10.1016/j.cirpj.2021.05.006
  • Rai, R., & Sahu, C. K. (2020). Driven by data or derived through physics? a review of hybrid physics guided machine learning techniques with cyber-physical system (cps) focus. Institute of Electrical and Electronics Engineers Access, 8, 71050–71073. https://doi.org/10.1109/ACCESS.2020.2987324
  • Raissi, M., & Em Karniadakis, G. (2018). Hidden physics models: Machine learning of nonlinear partial differential equations. Journal of Computational Physics, 357, 125–141. https://doi.org/10.1016/j.jcp.2017.11.039
  • Rajesh, R., Dev Anand, M., & Benny, K. N. (2017). Prediction of EDM process parameters for AISI 1020 steel using RSM, GRA and ANN. International Journal of Mechanical Engineering and Technology, 8(5), 924–940.
  • Reinhart, R. F., Shareef, Z., & Jakob Steil, J. (2017). Hybrid analytical and data-driven modeling for feed-forward robot control †. Sensors, 17(2), 311. https://doi.org/10.3390/s17020311
  • Relan, R., Grønborg Junker, R., Nauheimer, M., Høgsbro Thygesen, U., Lindstrom, E., & Madsen, H. (2021). Physics informed stochastic grey-box model of the flow-front in a vacuum assisted resin transfer moulding process with missing data. IFAC-Papersonline, 54(7), 797–802. https://doi.org/10.1016/j.ifacol.2021.08.459
  • Ren, J.-C., Liu, D., & Wan, Y. (2022). Data-driven and mechanism-based hybrid model for semiconductor silicon monocrystalline quality prediction in the czochralski process. IEEE Transactions on Semiconductor Manufacturing, 35(4), 658–669. https://doi.org/10.1109/TSM.2022.3202610
  • Ren, Y., Wang, Q., & Michaleris, P. (2019). Machine-learning based thermal-geometric predictive modeling of laser powder bed fusion additive manufacturing. In Dynamic Systems and Control Conference, Park City, Utah, USA (Vol. 59148, pp. V001T10A001). American Society of Mechanical Engineers.
  • Roland, W., Marschik, C., Kommenda, M., Haghofer, A., Dorl, S., & Winkler, S. (2021). Predicting the non-linear conveying behavior in single-screw extrusion: A comparison of various data-based modeling approaches used with CFD simulations. International Polymer Processing, 36(5), 529–544. https://doi.org/10.1515/ipp-2020-4094
  • Sabadka, D., Molnar, V., & Fedorko, G. (2017). The use of lean manufacturing techniques–SMED analysis to optimization of the production process. Advances in Science and Technology Research Journal, 11(3), 187–195. https://doi.org/10.12913/22998624/76067
  • Sadoughi, M., & Chao, H. (2019). Physics-based convolutional neural network for fault diagnosis of rolling element bearings. IEEE Sensors Journal, 19(11), 4181–4192. https://doi.org/10.1109/JSEN.2019.2898634
  • Saqlain, M., Abbas, Q., & Yun Lee, J. (2020). A deep convolutional neural network for wafer defect identification on an imbalanced dataset in semiconductor manufacturing processes. IEEE Transactions on Semiconductor Manufacturing, 33(3), 436–444. https://doi.org/10.1109/TSM.2020.2994357
  • Sarishvili, A., Just, D., Moser, K., Wirsen, A., Diemert, J., & Jirstrand, M. (2021). Plastic extrusion process optimization by digital twins. Chemie Ingenieur Technik, 93(12), 1949–1954. https://doi.org/10.1002/cite.202100093
  • Schoeneberger, J. C., Aker, B., & Fricke, A. (2021). Explaining and integrating machine learning models with rigorous simulation. Chemie Ingenieur Technik, 93(12), 1998–2009. https://doi.org/10.1002/cite.202100089
  • Sim, J. (2018). Hybrid nugget diameter prediction for resistance spot welding. Procedia Manufacturing, 17, 395–402. https://doi.org/10.1016/j.promfg.2018.10.062
  • Slatteke, O., & Astrom, K. J. (2005). Modeling of a steam heated rotating cylinder-a grey-box approach. In Proceedings of the 2005, American Control Conference, 2005, Portland, Oregon, USA (pp. 1449–1454). IEEE.
  • Smith, S., & Randall, R. B. (2015). Rolling element bearing diagnostics using the Case Western Reserve University data: A benchmark study. Mechanical systems and signal processing, 64, 100–131.
  • Song, L., Wang, L., Jun, W., Liang, J., & Liu, Z. (2021). Integrating physics and data driven cyber-physical system for condition monitoring of critical transmission components in smart production line. Applied Sciences, 11(19), 8967. https://doi.org/10.3390/app11198967
  • Spiegel, I. A., Sammons, P., & Barton, K. (2019). Hybrid modeling of electrohydrodynamic jet printing. IEEE Transactions on Control Systems Technology, 28(6), 2322–2335. https://doi.org/10.1109/TCST.2019.2939963
  • Śpiewak, S., Jitpraphai, T., & Nickel, T. (1999). “Grey box” model based visualisation of vibrations in mechanical systems. The International Journal of Advanced Manufacturing Technology, 15(10), 743–748. https://doi.org/10.1007/s001700050127
  • Sun, H., Cao, D., Zhao, Z., & Kang, X. (2018). A hybrid approach to cutting tool remaining useful life prediction based on the wiener process. IEEE Transactions on Reliability, 67(3), 1294–1303. https://doi.org/10.1109/TR.2018.2831256
  • Sun, C., Meng, M., Zhao, Z., Tian, S., Yan, R., & Chen, X. (2018). Deep transfer learning based on sparse autoencoder for remaining useful life prediction of tool in manufacturing. IEEE Transactions on Industrial Informatics, 15(4), 2416–2425.
  • Sun, B., Yang, C., Wang, Y., Gui, W., Craig, I., & Olivier, L. (2020). A comprehensive hybrid first principles/machine learning modeling framework for complex industrial processes. Journal of Process Control, 86, 30–43. https://doi.org/10.1016/j.jprocont.2019.11.012
  • Sun, J., Yan, C., & Wen, J. (2017). Intelligent bearing fault diagnosis method combining compressed data acquisition and deep learning. IEEE Transactions on Instrumentation and Measurement, 67(1), 185–195. https://doi.org/10.1109/TIM.2017.2759418
  • Sutherland, J., Dornfeld, D., & Linke, B. (2018). Energy efficient manufacturing: Theory and applications. Wiley.
  • Swischuk, R., Mainini, L., Peherstorfer, B., & Willcox, K. (2019). Projection-based model reduction: Formulations for physics-based machine learning. Computers & Fluids, 179, 704–717. https://doi.org/10.1016/j.compfluid.2018.07.021
  • Tabar, R. S., Wärmefjord, K., & Söderberg, R. (2020). A new surrogate model–based method for individualized spot welding sequence optimization with respect to geometrical quality. The International Journal of Advanced Manufacturing Technology, 106(5), 2333–2346. https://doi.org/10.1007/s00170-019-04706-x
  • Teren, T., Penter, L., Peukert, C., & Ihlenfeldt, S. (2022). A finite-difference based parallel solver algorithm for online-monitoring of resistance spot welding. Materials, 15(18), 6348. https://doi.org/10.3390/ma15186348
  • Tian, J., Morillo, C., Azarian, M. H., & Pecht, M. (2015). Motor bearing fault detection using spectral kurtosis-based feature extraction coupled with K-nearest neighbor distance analysis. IEEE Transactions on Industrial Electronics, 63(3), 1793–1803. https://doi.org/10.1109/TIE.2015.2509913
  • Tidriri, K., Chatti, N., Verron, S., & Tiplica, T. (2016). Bridging data-driven and model-based approaches for process fault diagnosis and health monitoring: A review of researches and future challenges. Annual Reviews in Control, 42, 63–81. https://doi.org/10.1016/j.arcontrol.2016.09.008
  • Umbrello, D., Ambrogio, G., Filice, L., & Shivpuri, R. (2008). A hybrid finite element method–artificial neural network approach for predicting residual stresses and the optimal cutting conditions during hard turning of AISI 52100 bearing steel. Materials & Design, 29(4), 873–883. https://doi.org/10.1016/j.matdes.2007.03.004
  • Vorkov, V., Rodrigues, G. C., & Duflou, J. R. (2019). Gray box approach for prediction of air bending. Procedia Manufacturing, 27, 45–50. https://doi.org/10.1016/j.promfg.2018.12.042
  • Wang, P., & Gao, R. X. (2020). Transfer learning for enhanced machine fault diagnosis in manufacturing. CIRP Annals, 69(1), 413–416. https://doi.org/10.1016/j.cirp.2020.04.074
  • Wang, R., Mohanty, I., Srivastava, A., Kumar Roy, T., Gupta, P., & Chattopadhyay, K. (2022). Hybrid method for endpoint prediction in a basic oxygen furnace. Metals, 12(5), 801. https://doi.org/10.3390/met12050801
  • Wang, Z., Rosa, S., Yang, B., Wang, S., Trigoni, N., & Markham, A. (2018). 3d-physnet: Learning the intuitive physics of non-rigid object deformations. arXiv preprint arXiv:1805.00328.
  • Wang, Z., Yang, W., Liu, Q., Zhao, Y., Liu, P., Dazhong, W., Banu, M., & Chen, L. (2022). Data-driven modeling of process, structure and property in additive manufacturing: A review and future directions. Journal of Manufacturing Processes, 77, 13–31. https://doi.org/10.1016/j.jmapro.2022.02.053
  • Wang, J., Yulin, M., Zhang, L., Gao, R. X., & Dazhong, W. (2018). Deep learning for smart manufacturing: Methods and applications. Journal of Manufacturing Systems, 48, 144–156. https://doi.org/10.1016/j.jmsy.2018.01.003
  • Wattamwar, S., Weiland, S., & Backx, T. (2008). Identification of low dimensional models for slow geometric parameter variation in an industrial glass manufacturing process. In 2008 IEEE International Conference on Control Applications, San Antonio, Texas, USA (pp. 989–994). IEEE.
  • Wuest, T., Weimer, D., Irgens, C., & Thoben, K.-D. (2016). Machine learning in manufacturing: Advantages, challenges and applications. Production Manufacturing Research, 4(1), 23–45. https://doi.org/10.1080/21693277.2016.1192517
  • Xie, J., Chai, Z., Luming, X., Ren, X., Liu, S., & Chen, X. (2022). 3D temperature field prediction in direct energy deposition of metals using physics informed neural network. The International Journal of Advanced Manufacturing Technology, 119(5–6), 3449–3468. https://doi.org/10.1007/s00170-021-08542-w
  • Xiong, Q., & Jutan, A. (2002). Grey-box modelling and control of chemical processes. Chemical Engineering Science, 57(6), 1027–1039. https://doi.org/10.1016/S0009-2509(01)00439-0
  • Yang, Z., Eddy, D., Krishnamurty, S., Grosse, I., Denno, P., Yan, L., & Witherell, P. (2017). Investigating grey-box modeling for predictive analytics in smart manufacturing. In International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, Cleveland, Ohio, USA (Vol. 58134, pp. V02BT03A024). American Society of Mechanical Engineers.
  • Yang, L., & Özel, T. (2021). Physics-based simulation models for digital twin development in laser powder bed fusion. International Journal of Mechatronics and Manufacturing Systems, 14(2), 143–163. https://doi.org/10.1504/IJMMS.2021.119155
  • Yang, R.-Y., & Rai, R. (2019). Machine auscultation: enabling machine diagnostics using convolutional neural networks and large-scale machine audio data. Advances in Manufacturing, 7(2), 174–187. https://doi.org/10.1007/s40436-019-00254-5
  • Yang, X., Ran, Y., Zhang, G., Wang, H., Zongyi, M., & Zhi, S. (2022). A digital twin-driven hybrid approach for the prediction of performance degradation in transmission unit of CNC machine tool. Robotics and Computer-Integrated Manufacturing, 73, 102230. https://doi.org/10.1016/j.rcim.2021.102230
  • Yazdi, R. M., Imani, F., & Yang, H. (2020). A hybrid deep learning model of process-build interactions in additive manufacturing. Journal of Manufacturing Systems, 57, 460–468. https://doi.org/10.1016/j.jmsy.2020.11.001
  • Yucesan, Y. A., & Viana, F. (2020). A hybrid model for main bearing fatigue prognosis based on physics and machine learning. In AIAA Scitech 2020 forum (p. 1412). The American Institute of Aeronautics and Astronautics (AIAA).
  • Yucesan, Y. A., & Viana, F. A. (2020). A physics-informed neural network for wind turbine main bearing fatigue. International Journal of Prognostics and Health Management, 11(1). https://doi.org/10.36001/phmconf.2019.v11i1.807
  • Yumei, Q., You, W., Shen, C., Jiang, X., Huang, W., & Zhu, Z. (2017). Hierarchical diagnosis network based on sparse deep neural networks and its application in bearing fault diagnosis. In 2017 Prognostics and System Health Management Conference (PHM-Harbin), Harbin, China (pp. 1–7). IEEE.
  • Yun, J. P., Crino Shin, W., Koo, G., Su Kim, M., Lee, C., & Jun Lee, S. (2020). Automated defect inspection system for metal surfaces based on deep learning and data augmentation. Journal of Manufacturing Systems, 55, 317–324. https://doi.org/10.1016/j.jmsy.2020.03.009
  • Zhang, X., Liu, L., Wan, X., & Feng, B. (2021). Tool wear online monitoring method based on DT and SSAE-PHMM. Journal of Computing and Information Science in Engineering, 21(3), 3. https://doi.org/10.1115/1.4050531
  • Zhang, J., Mengcheng, Q., & Li, B. (2016). Hybrid diagnosis of fault tree and Bayesian network in BIW automatic welding production line.” In 2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), Xian, China (pp. 297–300). IEEE.
  • Zhang, Y., Xie, M., Yihai, H., & Han, X. (2022). Capability-based remaining useful life prediction of machining tools considering non-geometry and tolerancing features with a hybrid model. International Journal of Production Research, 61(21), 7540–7556. https://doi.org/10.1080/00207543.2022.2152126
  • Zhou, H., Kun-Ming, Y., & Hsu, H.-P. (2021). Hybrid modeling method for soft sensing of key process parameters in chemical industry. Sensors and Materials, 33(8), 2789. https://doi.org/10.18494/SAM.2021.3436
  • Zhou, H., Zhang, H., & Yang, C. (2019). Hybrid-model-based intelligent optimization of ironmaking process. IEEE Transactions on Industrial Electronics, 67(3), 2469–2479. https://doi.org/10.1109/TIE.2019.2903770
  • Zotov, E., & Kadirkamanathan, V. (2021). CycleStyleGAN-based knowledge transfer for a machining digital twin. Frontiers in Artificial Intelligence, 4, 175. https://doi.org/10.3389/frai.2021.767451