Prévia do material em texto
Reliability Engineering and System Safety 226 (2022) 108700 Available online 2 July 2022 0951-8320/© 2022 Elsevier Ltd. All rights reserved. A real-time probabilistic risk assessment method for the petrochemical industry based on data monitoring Rui He a,*, Jingyu Zhu b, Guoming Chen b, Zhigang Tian a a Department of Mechanical Engineering, University of Alberta, Edmonton, AB, Canada b Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), No.66, Changjiang West Road, Qingdao, China A R T I C L E I N F O Keywords: Probabilistic risk assessment (PRA) Dynamic Bayesian network (DBN) Data monitoring Petrochemical industry A B S T R A C T Safety is of high societal concern in the petrochemical industry. With advancing digitalization, the techniques of probabilistic risk assessment (PRA), which provide a system-level perspective for industrial risk analysis, have become increasingly dynamic. This paper provides a further advancement in this direction by proposing a risk updating method based on the dynamic Bayesian network (DBN) to incorporate data monitoring into PRA in real- time. We update the probabilistic risk for basic events in Bayesian networks based on their prior probability and the probability of their online monitoring signals exceeding the alarm threshold. The key idea behind this approach is that if the signal of a basic event exceeds the alarm threshold, the corresponding risk should increase. Otherwise, the basic event should have a low risk. The contribution in this paper is twofold. First, residual methods are developed to estimate signal probabilities exceeding abnormal thresholds. Second, a DBN model is proposed to integrate prior risk with data monitoring for risk analysis. The proposed DBN model does not require additional expert knowledge and historical accident data to define the conditional relationship between data monitoring and prior risk. The proposed approach is validated using the RT 580 experimental setup and managed pressure drilling operations. 1. Introduction Modern petrochemical processes are complex socio-technical sys- tems that are susceptible to catastrophic accidents due to an abnormal process, instrumentation, or human error [1]. Probabilistic risk assess- ment (PRA) provides a well-established family of techniques to address safety, risk, and reliability of these complex petrochemical systems [2]. To equip PRA with the capability of investigating accident propagation mechanisms, various methodologies have been proposed over the past decades, including fault tree analysis (FTA) [3], event tree analysis (ETA) [4], bow-tie (BT) [5], and Bayesian network (BN) [6]. These traditional PRA techniques consider possible accident propagation with chain rules and use pre-set deterministic failure probabilities to assess the likelihood of accidents. However, traditional PRA methods are limited by the use of static failure probabilities and thus is hard to capture deviations and changes in system status. As the modern industry moves towards a data-driven “Industry 4.0”, advances in sensor tech- nology have made PRA techniques more and more dynamic, but effec- tively utilizing data information in PRA remains a challenge [7]. Most petrochemical processes are intermittent systems with multiple working phases. During system operations, the risk of accidents may change over time, and the accident propagation mechanism during phase shifts may be different. To account for the time-dependent behavior of accidents, engineers started to consider additional time slices in PRA methods. OREDA [8] provides a reference for exponential failure rates of most petrochemical equipment, and thus it is possible to develop static PRA into dynamic forms based on failure rates. Typically, constant failure rates have been widely used for extending BN into a dynamic Bayesian network (DBN) to consider time-dependent opera- tions and equipment degradations [9–11]. When statistical accident data or precursor data is available, the risk can also be updated in PRA by Bayesian inference. Exponential distribution and Beta distribution can be assumed as the basic probability distribution of possible accidents in ETA or BN, and the failure probabilities of safety barriers are updated by counting the number of accidents [12,13]. Since the available acci- dent data is often limited, hierarchical Bayesian analysis (HBA) is pro- posed to use scarce statistical data to update the failure probability distribution of safety barriers [14,15]. Other graph-based methods, such as directed acyclic graphs (DAG) that assign a deterministic value to each path for the risk propagation analysis also use statistical accident * Corresponding author. E-mail address: rhe5@ualberta.ca (R. He). Contents lists available at ScienceDirect Reliability Engineering and System Safety journal homepage: www.elsevier.com/locate/ress https://doi.org/10.1016/j.ress.2022.108700 Received 20 August 2021; Received in revised form 18 May 2022; Accepted 26 June 2022 mailto:rhe5@ualberta.ca www.sciencedirect.com/science/journal/09518320 https://www.elsevier.com/locate/ress https://doi.org/10.1016/j.ress.2022.108700 https://doi.org/10.1016/j.ress.2022.108700 https://doi.org/10.1016/j.ress.2022.108700 http://crossmark.crossref.org/dialog/?doi=10.1016/j.ress.2022.108700&domain=pdf Reliability Engineering and System Safety 226 (2022) 108700 2 data to update risk by assuming distributions of risk factors [16]. One issue is that statistical data are often collected from a population of similar systems and cannot fully interpret the specific features of target systems. In addition, the accident data is commonly hard to collect in the short term, and thus these PRA techniques can only be updated regularly over long periods, such as yearly or weekly [17]. Due to the real-time characteristics of operating parameters in petrochemical industries, the credibility of statistical data is always in question [18]. Advances in sensor technology provide online data sup- port for PRA techniques, and therefore, PRA methods have begun to update estimated risk in real-time using online measurements. Unlike process monitoring, PRA uses risk to measure the lack of safety from system-level perspectives and thus can identify possible problems and make corrective decisions before accidents occur [19]. Commonly, an accident is caused by a combination of several basic events, including abnormal operating conditions and safety barrier failures. Condition data can be incorporated in degradation models based on reliability assessment methods when considering the dynamic mechanism of safety barriers. Initially, a condition fault tree is proposed to update the failure rates of components according to equipment condition data [20]. Model-based methods, like Kalman filter and particle filtering (PF), are developed to update the degradation state of safety barriers based on real-time measurements [21–23]. In addition, Gaussian mixture model (GMM) is developed in [24] to estimate the degradation of safety bar- riers and identify health status in real-time. Furthermore, in [7], a sys- tematical framework is presented to incorporate the degradation model into PRA techniques. Even though the degradation characteristics of safety barriers are effectively modeled, the reliability-based methods cannot fully meet the requirement of petrochemical industries because the instantaneous changes in operations cannot be considered. In addi- tion, the process equipment will not be significantly degraded in a short operating period. To equip PRA with data monitoring, some works construct data- driven risk models based on large amounts of historical or experi- mental accident data. Commonly, as in [25–27], BN structural and parameter learning are performed to learn probabilistic relationships between operating parameters andaccidents using historical event data, and the learned BN can be used to predict the probability of accidents when the online data is available. In [28–30], data monitoring is incorporated into BN models for risk analysis, where BN parameters are determined using statistical methods such as maximum likelihood esti- mation (MLE) based on sufficient accident-related data. Also in [31,32], a risk-based warning model is constructed for real-time risk analysis using historical data and expert knowledge. Oftentimes, accident-related data are challenging to collect, and thus statistical and data-driven approaches may lead to biased and inconsistent estimates in PRA models. Other related works determine different failure probabili- ties for basic events under varying operations. In [33,34], different risk-based loss functions are defined to use monitoring data to assess accident probabilities in real-time. In addition, in [35–37], the online monitoring signal is divided into multiple discrete levels (e.g., high, medium, low), and corresponding failure probabilities are set for basic events under different groups according to expert knowledge. However, despite no use of historical data, these PRA techniques determine the risks of basic events based only on the online information while ignoring their prior probabilities in real-time risk estimations. If the prior prob- ability is not considered, the risk of an event will depend entirely on the data monitoring and will be unrelated to its own accident mechanism. Only a few works, for example [38,39], consider both data monitoring and prior probabilities in PRA without using historical accident data, but expert knowledge is still required to determine the effect of different monitoring parameters on the probability of basic events or accidents. In this work, we provide a further advancement in this direction by proposing a real-time PRA approach that embeds both data monitoring and prior knowledge in risk analysis. We start with the estimates of the probability that online signals exceed the alarm threshold. Thereafter, these estimated probabilities are incorporated into PRA to update the risk of basic events based on their prior knowledge. The significant contribution of this work is twofold. First, residual-based methods are developed for environmental and process monitoring to estimate the probability of their online signals exceeding abnormal thresholds. Sec- ond, a DBN model is proposed to integrate prior risk with data moni- toring for risk analysis. The proposed DBN model does not require additional expert knowledge and historical accident data for each basic event to determine the conditional relationship (conditional probability table) between data monitoring and prior risk so that the proposed PRA is practical, especially when historical data is scarce. The rest of the paper is organized as follows. In Section 2, the pre- liminaries and engineering motivations are briefly introduced, and the critical problems are formulated. In Section 3, the proposed real-time PRA method is described. Section 4 uses two cases, including the RT 580 experimental setup and managed pressure drilling (MPD) operation, to demonstrate and validate the proposed method. Finally, conclusions are made in Section 5. 2. Preliminaries and problem formulation This work aims to predict the probability of accidents and to search for critical events with potential accident risks. Let TE represents the top event (main accident) of the Fault Tree (FT) and suppose that there are N basic events to determine the accident, denoted by BEi, i = 1, 2, …, N. Based on the logical relationship of FT gates, the probability of an ac- cident is a function of the prior probability of basic events PTE = fFT(PBE1 ,PBE2 ,…,PBEN ), (1) wherefFT(⋅) denotes the FT model and PBEi is the prior probability of the i-th basic event obtained by FTA. The mapping algorithm in [6] can be used to set conditional prob- ability tables (CPTs) to map FT to BN. In order to discover the potential basic events that may lead to accidents, the risk index is defined as the conditional probability of BEi given that TE has already occurred. The conditional probabilities can be estimated based on the rules of BN P(BEi|TE) = P{BEioccurs|TEhasoccured} = P(TE|BEi)PBEi∑ BEi P(TE,BEi) , i = 1, 2,…N. (2) To meet the timing requirement of petrochemical industries, data monitoring is considered in PRA. Fig. 1 shows a simple process (feeding tank) and its FT with high-level incident [40]. The top event, i.e., high level, results from the combined effect of both the abnormal operating flow and the failure of safety barriers. In this case, the condition of input and output flow and the level control logic can be respectively observed from FI01, FI02, and LIC01, and the pump status can be assessed based on the accelerometers. The online-monitoring data can reflect the sys- tem operating status. For example, if FI01 is high, the risk of increased feed should also rise. However, since different events have different prior probabilities, the updated risk should not only depend on data monitoring but also on the prior risk of the event. When a BN model is constructed, the nodes of BN are categorized into static nodes and dynamic nodes. The real-time probability of a static node is always equal to its prior probability, while the probability of dynamic nodes will be continuously updated based on data monitoring. Dynamic nodes can be further classified into environmental nodes and process nodes according to signal characteristics. Environmental nodes refer to various environmental variables that are not affected by changes in operating conditions, such as wind speed and formation pressure. Process nodes, such as input flow, tank level, and pump vibration, are associated with process variables affected by varying working condi- tions. We assume that the risk of m environmental nodes can be updated based on environmental monitoring, and the risk of n process nodes can R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 3 be updated based on process monitoring. In this work, the probability of online signals beyond the alarm threshold is first estimated for each dynamic node, which can be regarded as the probability that the event deviates from the normal status, denoted byPMONi , i = 1, 2,…, m+n. Afterward, the risks of dynamic nodes are estimated in real-time by integrating PMON and their prior probability PBE. The PRA tasks are formally defined as: (1) forward risk updating: update the probability of basic events at the current time t = tk, k = 1, 2, …,T, based on data monitoring available up to tk, to predict the probability of top event at time tk; (2) posterior risk estimation: given that the top event has occurred, estimate the risk index (posterior probability) of basic events at time tk to discover the critical event with potential accident risks. Fig. 1. Process diagram and Fault Tree model for a feeding tank. Fig. 2. Procedures for real-time PRA based on data monitoring R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 4 3. Proposed methodology Fig. 2 summarizes the major steps of the proposed real-time PRA method for petrochemical industries. Before conducting a risk analysis, the event type of each basic event is first determined. After that, the probability of the online signal exceeding the alarm thresholdPMON is estimated for each dynamic node. We show how to evaluatePMONi , i = 1, 2,…, m, for environmental nodes in Section 3.1.1, and present how to estimate PMONi , i = m+1, m+2,…, m+n, for process nodes in Section 3.1.2. Finally, in Section 3.2, a novel PRA method is presented to inte- grate PMON with prior probability PBE based on the idea of DBN for risk updating and posterior estimations in real-time. At time t = tk, the for- ward risk updating is performed first, then the posterior probability of basic events can be estimated based on the updated probability. 3.1. Probability calculation for online signal exceeding alarm threshold 3.1.1. Probability estimations for environmental nodes As mentioned before, the environmental nodes in this section denote the dynamic nodes that cannot be affected by changes in system oper- ating conditions, such as increased wind speed and deduced ambient temperature. This section aims to estimate the probability that the on- line signal of the environmental node exceeds the alarm threshold. In general, the historical data of environment-related events is scarce for a new project, but the normal reference value and the alarm threshold of monitored parameters are often known in advance. Let AEi represent the i-th environmental node, i = 1, 2,…, m, and assume that there are M monitoring signals to determine the status of AEi denoted by xi,j, j = 1, 2, …, M. The probability estimation problems for an environmental node can be formulated as: given the reference signal value and the alarm threshold of xi,j, and the process operating measurements xi(t) = [xi,1(t), …, xi,M(t)]T, to estimate the probability of its monitoring parameters exceeding the alarm threshold PMONi (t)at time t. In order to combine multiple signals, the signal residual of the i-th environmental node is calculated by a multivariate logic solver ri,j(t) = ⃒ ⃒xi,j(t) − x0,i,j ⃒ ⃒ xa,i,j , ri(t) = 1 M ∑M j=1 ri,j(t), (3) where x0,i,j is the reference signal value of xi,j, xa,i,j is a positive value equal to the modulus of the difference between the alarm threshold and reference value, and therefore, ri(t) > 1 means that the residual value of AEi is over a threshold value, that is, xi(t) < xi,0-xi,a or xi(t) > xi,0 + xi,a. Let xU,i,j and xL,i,j represent the upper limit and lower limit of xi,j, respectively. Oftentimes, only one threshold (upper or lower limit) exists for a basic event. Therefore, when only xU,i,j exists and xi,j(t) > x0,i,j, or only xL,i,j exists and xi,j(t) < x0,i,j, the value of ri,j(t) is calculated by Eq. (3); otherwise, ri,j(t) is equal to 0. The probability PMON increases when the monitored residual moves further away from the normal zone. This behavior can be captured using a cumulative normal distribution [41]. Therefore, the probabilityPMONof AEi at time t is defined as PMONi (t) = φ ( ri(t) − 1 σ ) = ∫ ri(t) − ∞ 1̅̅̅ ̅̅ 2π √ σ e (r− 1)2 2σ2 dr, (4) where σ = 1/3 up to the three-sigma rule. Fig. 3 shows a visual depiction of the probability estimation. The probability PMON for a point on the line ri = 0 is 0, and for a given point on the threshold ri = 1 is equal to 0.5. When the online residual ri(t) is at the threshold (ri =1), the probability of monitoring data exceeding the alarm threshold is 0.5 as it can either go back to the normal zone or keep growing and lead to the abnormal event with the same probability. In this way, it is possible to estimate the probability that the moni- toring parameters of an environmental event exceed the alarm threshold based on data monitoring in real-time. However, PMON is not equal to the probability of the occurrence of environmental events. In Section 3.2, we combine PMON and prior knowledge to estimate the real-time risk for all dynamic events. 3.1.2. Probability estimations for process nodes Unlike in environmental nodes, the monitoring parameters in pro- cess nodes will be affected by varying working conditions. As operating conditions vary over time, the reference signal values and alarm thresholds of a process node may also change accordingly. Often, many fault-free monitoring data can be collected from systems under normal operating conditions. Therefore, let yi,j with j = 1, 2, …, M denotes the j- th process monitoring signal corresponding to the i-th process node. The probability estimation problems for process nodes can be formulated as: given historical fault-free data and online signalszi,t = [zi,1,t ,zi,2,t ,...,zi,M,t], i = m+1, m+2,…, m+n, to construct a nonlinear residual model for the i- th process node to estimate the probability of its residual signal exceeding the alarm threshold PMONi (t) at time t. Gaussian mixture model (GMM) can represent different operation scenarios in model building [42]. Therefore, GMM is applied to construct the residual model in this work. The probability density of a given set of training sample yi can then be approximated by GMM as P(yi|Λi) = ∑K k=1 βi,kg ( yi|λi,k ) , ∑K k=1 βi,k = 1, (5) where yi = [yi,1, …, yi,M]T is the given M-dimensional measurement sample, K is the number of Gaussian components, λi,k is a parameter vector containing mean value vector μi,k and covariance matrix Σi,k, and βi,k denotes the prior probability of the k-th Gaussian component. Conceptually, we denote the GMM compactly as Λi = {βi, μi, Σi}, where βi = [βi,1, …, βi,K], μi = [μi,1, …, μi,K], and Σi = [Σi,1, …, Σi,K]. In the offline stage, GMM parameters Λi = {βi, μi, Σi} are learned based on the Expectation Maximization (EM) algorithm by maximizing the log-likelihood of observing historical data. The minimum message length (MML) criterion is used to search the optimal number of Gaussian components. More details to train GMM can be found in [43]. When new signals zi,t are measured at time t in the online stage, the GMM model can be transformed into a dynamic form, and the proba- bility distribution p(yi,t+1| zi,t) at the time step (t+1) can be predicted via the Chapman-Kolmogorov equation P ( yi,t+1 ⃒ ⃒zi,t ) = ∫ P ( yi,t+1 ⃒ ⃒yi,t ) P ( yi,t ⃒ ⃒zi,t ) dyi,t ≈ ∑K k=1 βi,t,k ∫ g ( yi,t|μi,t,k ,Σi,t,k ) P ( yi,t+1 ⃒ ⃒yi,t ) dyi,t, (6) where μtn, Σtn, and βtn denotes the updated mean value vector, Fig. 3. Changes in the probability of signals exceeding alarm threshold with monitored residual. R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 5 covariance matrix, and prior probability of the n-th Gaussian component at the time step t, respectively. According to the idea of particle filter (PF) [44], the posterior probability distribution P(yi,t| zi,t) can be represented by a set of random particles y(p)i,t , p = 1, 2, …, L, with corresponding weight ω (p) i,t , and thus Eq. (6) can be represented as P ( yi,t+1 ⃒ ⃒zi,t ) ≈ ∑L p=1 ω(p)i,t δ ( yi,t − y(p)i,t ) P ( yi,t+1 ⃒ ⃒yi,t ) = ∑L p=1 ω(p)i,t P ( yi,t+1 ⃒ ⃒ ⃒y(p)i,t ) , (7) whereδ(⋅)stands for the Dirac delta function. Initially, the weight of each particle is set as 1/L, and the weight is recursively updated based on the likelihood of the observation zi,t. To address the degeneracy problem, resampling described in [44] is applied in each step to eliminate the particles with small weights and duplicate the particles with high weights. After that, we can obtain a new set of particles y(p)i,t+1, p = 1, 2, …, L, with the same weight. The parameters of GMM can then be updated by a one-step EM algorithm [45] γ(p)i,t+1,k = βi,t,kg ( y(p)i,t+1|μi,t,k ,Σi,t,k ) ∑K k=1βi,t,kg ( y(p)i,t+1|μi,t,k ,Σi,t,k ), (8) μi,t+1,k = ( ∑L p=1 γ(p)i,t+1,k )− 1 ∑L p=1 y(p)i,t+1γ (p) i,t+1,k , (9) Σi,t+1,k = ( ∑L p=1 γ(p)i,t+1,k )− 1 ∑L p=1γ(p)i,t+1,k ( y(p)i,t+1 − μi,t+1,k )( y(p)i,t+1 − μi,t+1,k )T , (10) βi,t+1,k = 1 L ∑L p=1 γ(p)i,t+1,k. (11) When the parameters of GMM at the time step (t+1) are available, the residual signal at the time step (t+1) can be defined with the updated parameters of Gaussian components as [46] ri(t+ 1) = ∑K k=1 P ( Ci,k ⃒ ⃒zi,t+1 ) PMn ( zi,t+1 ) , 0 < ri(t+ 1) < 1, (12) with P ( Ci,k ⃒ ⃒zi,t+1 ) = βi,t+1,kg ( zi,t+1 ⃒ ⃒λi,t+1,k ) ∑K k=1βi,t+1,kg ( zi,t+1 ⃒ ⃒λi,t+1,k ), (13) PMn ( zi,t+1 ) = P { Dr ( yi,t+1,Ci,k ) ≤ Dr ( zi,t+1,Ci,k )} , (14) where P(Ci,k|zi,t+1) denotes the posterior probability of the monitored zi,t+1 belonging to the k-th Gaussian component, and PMn(zi,t+1) is the Mahalanobis distance of zi,t+1 from the center μi,t+1,k, which can be estimated as Dr ( zi,t+1,Ci,k ) = ( zi,t+1 − μi,t+1,k )T ( Σi,t+1,k + εI )− 1( zi,t+1 − μi,t+1,k ) , (15) where ε is a small positive number to avoid an ill-conditioned covariance matrix. Since Dr follows χ2 distribution with an appropriate degree of freedom, the threshold of the monitored residual signal should be set as the value of the confidence level (1-α), then the probability of monitored signals exceeding the alarm threshold for the i-th process node at time step (t+1) can be defined as PMONi (t+ 1) = φ ( ri(t + 1) − 1 + α σm ) = ∫ ri(t+1) − ∞ 1 ̅̅̅̅̅ 2π √ σm e (r− 1+α)2 2σ2m dr, (16) where σm = (1-α)/3 according to the three-sigma rule. It is worth noting that the monitored residuals larger than the threshold (1-α) should be scaled to ensure that when ri(t+1) is equal to 1, PMONi (t + 1)is also approximately equal to 1. The whole procedure for estimating the probability of process data exceeding the alarm threshold is summarized in Algorithm 1 below. 3.2. Integrated real-time risk analysis When the temporal dependencies are introduced, the probability function of the top event (accident) can be transformed to the dynamic form from Eq. (1), and the accident probability at time tk is predicted as PTE(tk) = fFT(PBE1 (tk),PBE2 (tk),…,PBEN (tk)). (17) As shown in Fig. 4, DBN is used to present the dynamic behavior of risk models. Each BN node has two functional states, i.e., Occurrence (Occ) and Non-occurrence (Non). Since the state of basic events is observed firmly based on data monitoring at each time, the conditional probability distribution of P(BEi(tk)|BEi(tk − 1)) is neglected [47]. Therefore, the risk index (posterior probability) of each basic event can be estimated based on the Bayes rule P(BEi(tk) = Occ|TE = Occ) = P(TE = Occ|BEi(tk) = Occ)PBEi (tk)∑ BEi(tk)=Occ,Non P(BEi(tk))P(TE = Occ|BEi(tk)) , i = 1, 2,…N. (18) From the above equations, we can see that the key point of PRA is to estimate the probability of BEi at time tk. In this paper, a BN (as shown in the red frame in Fig. 4) is developed to describe the dependencies be- tween prior probability PBEi and the corresponding alarm probability PMONi for the i-th event, and then we can haveP(BEi, MONi) = P(MONi|BEi)PBEi . Since PMONi obtained in Section 3.1 is a continuous value between 0 and 1, the continuous probability needs to be dis- cretized to construct the CPT of node MONi. Assume that the state of the node MONi is divided into Q discretization states S = {S1, S2, …, SQ}, where Q should be an even number and the state SQ corresponds to the probability from (Q − 1)/Q to 1. According to the definition in Section 3.1, the probability PMON is 0.5 for a given point on the alarm threshold. As shown in Fig. 4, we assume that the monitored signal of an event obeys a normal distribution, in which its probability density function (PDF) is denoted byf(⋅); the upper and lower limits of thef(⋅) are denoted by TU and TL, respectively; and τ is the alarm threshold set for the Algorithm 1 Probability estimation for process nodes based on GMM and PF Offline Inputs: historical fault-free data yi Outputs: Λi = {βi, μi, Σi} Step: Train GMM using the EM algorithm. Online (at time t+1) Inputs: Λi,t = {βi,t, μi,t, Σi,t}, zi,t, zi,t+1 Outputs: Λi,t+1 = {βi,t+1, μi,t+1, Σi,t+1}, PMONi(t+1) Step 1: Generate a set of random particlesy(p)i,t , p = 1, 2, …, L, according to the Λi,t = {βi,t, μi,t, Σi,t}. Step 2: Calculate the weight of particles based on zi,t. Step 3: Resample to eliminate particles with small weights and obtain a new set of particlesy(p)i,t+1, p = 1, 2, …, L. Step 4: Update the GMM parameters by Eq. (8)-(11) and get Λi,t+1 = {βi,t+1, μi,t+1, Σi,t+1}. Step 5: Calculate the monitored residual ri(t+1) based on Λi,t+1 and zi,t+1 by Eq. (12)- (15). Step 6: Calculate the probability PMONi(t+1) by Eq. (16) R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 6 monitored signal. If the state of basic event is Non, the monitored signal will be in the range from TL to τ (corresponding to PMON from 0 to 0.5) with the probability of ∫ τ TL f(⋅) and will be in the range from τ to TU (corresponding to PMON from 0.5 to 1) with the probability of ∫ TU τ f(⋅), and vice versa. Using this idea, the distribution of the monitored signal can be scaled into CPT of BN. The conditional probability P(Sj |BE) can be estimated with a pre-set threshold τ and PDFf(⋅)as P ( Sj ⃒ ⃒BE ) = ⎧ ⎪⎪⎪⎪⎪⎪⎪⎨ ⎪⎪⎪⎪⎪⎪⎪⎩ ∫ 2τj/Q 2τ(j− 1)/Q f (x)dx ∫ TU TL f (x)dx , 0 <j ≤ Q 2 ∫ τ+(2j− Q)(TU − τ)/Q τ+(TU − τ)(2j− Q− 2)/Q f (x)dx ∫ TU TL f (x)dx , Q 2 < j ≤ Q . (19) When the historical data is available, for example, when the histor- ical fault-free data is useful, the kernel density estimation (KDE) can be used to estimate the PDF of the residual signal under normal conditions, and the estimated PDF can be applied as the f(⋅) for P(Sj ⃒ ⃒BE = Non). The ∑j=Q j=Q/2+1P(Sj ⃒ ⃒BE= Non) can denote the false alarm rate (FAR), and the ∑j=Q j=Q/2+1P(Sj ⃒ ⃒BE= Occ) can represent the fault detection rate (FDR). When no historical data is available, thef(⋅)can be defined as the PDF of normal distribution N(μ,σ), and the TU and TL can be set as (μ+3σ) and (μ-3σ), separately. The value of μ and σ can be adjusted arbitrarily to ensure TL = 0. A high value of FAR indicates many false alarms will be triggered when the process is normal, while a low FDR implies few alarms can be raised when the process is abnormal. The threshold τ is set to determine the value of FAR and FDR, which can represent the un- certainty in the risk monitoring model. For the case of BE = Non, the threshold τ can be set as (μ+2.5σ) to guarantee nearly 95% alarms are correct for the fault-free case; for the case of BE = Occ, the threshold τ can be set as (μ-2.5σ) to ensure only 5% alarms can not be triggered for the faulty case. When PMONi is observed, the corresponding state of the node MONi, denoted by S∗i , can be obtained, and PBEi (tk) can then be computed as follows PBEi (tk) = P ( S∗i ⃒ ⃒BEi ) PBEi ∑ BEi=Occ,NonP(BEi)P ( S∗i ⃒ ⃒BEi ). (20) The CPT in Fig. 4 shows an example where Q and μ are 10 and 3, respectively. As an example, for a basic event, if the alarm probability PMON is 0.25 and prior probability PBE is 0.05, the S∗ will be S3, and the updated probability will be 9.14× 10− 4 × 0.05/(9.14 × 10− 4 × 0.05 + 0.4072 × 0.95) = 1.18× 10− 4. If PMON is less than 0.5, the updated probability will be less than the prior probability. Furthermore, based on the idea of discretization, the P(MONi(tk)| BEi) at time tk can be computed directly by P(MONi(tk)|BEi) = ⎧ ⎪⎪⎪⎪⎪⎪⎪⎨ ⎪⎪⎪⎪⎪⎪⎪⎩ ∫ 2τPMONi (tk)+Δ 2τPMONi (tk)− Δ f (x)dx ∫ TU TL f (x)dx , 0 ≤ PMONi (tk) ≤ 0.5 ∫ 2τ− TU+2PMONi (tk)(TU − τ)+Δ 2τ− TU+2PMONi (tk)(TU − τ)− Δ f (x)dx ∫ TU TL f (x)dx , 0.5 < PMONi (tk) ≤ 1 , (21) where Δ is a small positivenumber representing the discrete interval. By Eq. (21), we can directly input the value of PMONi into the model without identifying its corresponding state S∗i , so that we can get continuously updated probability based on different PMONi . It is worth mentioning that both the first and the second equations of P(MONi(tk)|BEi) will be equal to zero when PMONi (tk) is equal to zero or one. Therefore, in this work, we compress the boundary of PMONi (tk) to a new boundary from 0.2 to 0.8 by PMONi (tk) = 0.6× PMONi (tk)+ 0.2. The figure on the left side of CPT in Fig. 4 (as shown in the blue frame) shows the value of P(MON(tk)| BE) with Δ equal to 0.2. Finally, thePBEi (tk)can be computed based on the value ofP(MONi(tk)|BEi = Occ)andP(MONi(tk)|BEi = Non) as follows Fig. 4. Diagram of the proposed PRA method. R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 7 PBEi (tk) = P(MONi(t)|BEi = Occ)PBEi∑ BEi=Occ,NonP(BEi)P(MONi(t)|BEi) . (22) 4. Implementation and model discussion In this section, two case studies are used to demonstrate the benefits of the proposed method brought to PRA. In Section 4.1, an experimental study with comparative analysis is presented. Then in Section 4.2, data monitoring-based PRA is performed for MPD operations to evaluate the proposed model. 4.1. Case study 1: RT 580 experimental setup 4.1.1. System description In the first case, the RT 580 experimental setup (shown in Fig. 5) [48], situated at the Memorial University, is applied to demonstrate the proposed method. The RT 580 setup controls the level of process tank (B2) at 40% through a circuit with a collecting tank (B1), pump (P1), and valves. The pneumatic control valve (V7) is used as the actuator. The FT shown in Fig. 6 represents the accident mechanism of “dry-out in tank B2”. Eight basic events (BE1 ~ BE8) are considered, and their prior probabilities can be found in [48]. The flow rate and level in the process tank are monitored during the experiment. The basic event BE1 “Sensor is broken” is introduced into the system. Fig. 7 shows the experimental data collected from the stable operation to the failure of the system. When the level sensor is broken, the controller acquires the default value (high value) of the level in the tank. The controller attempts to close the control valve (V7), causing the flow and level in tank B2 to drop. Since the system operates in a signal condition, we consider the node BE1 as an environmental node. Ac- cording to Ref. [48], a flow rate less than 100 L/h and a level less than 10% will cause a dry-out situation. Therefore, the probability of BE1 is determined by flow rate and level monitoring, and the normal reference value and the threshold value of flow rate and level are set as {400, 40} and {100, 10}, respectively. 4.1.2. PRA results and discussions The figure on the left-hand side of Fig. 8 shows the probability of BE1 signals exceeding alarm thresholds. The real-time probability of BE1 shown in the middle figure of Fig. 8 can be calculated by integrating its prior probability and alarm probability. The first two figures in Fig. 8 present probability profiles with similar growth trends. As defined in Section 3.1, a flow rate less than 100 L/h and a level less than 10% will result in the alarm probability of BE1 higher than 0.5, indicating that the abnormality of its monitored signal keeps growing and may lead to abnormal events. Although the alarm probability by the end of BE1 is as high as a probability close to 0.9, the real-time probability of BE1 is still small due to the low prior probability. In this case, the real-time prob- ability of BE1 is consistent with its prior risk knowledge. The figure on the right-hand side of Fig. 8 shows the real-time probability of dry-out event calculated by the proposed method and dynamic fault tree (DFT) [48]. In DFT, the probability of BE1 (sensor is broken) is equal to 1 when the flow rate is less than 100 L/h or the level is less than 10%, which will result in a real-time probability of 1 for the dry-out event. However, when the process variable exceeds the alarm threshold, the corresponding accident may not always occur. Alarms can also be caused by process disturbances or signal noise. In contrast, the accident probability predicted by the proposed method does not increase immediately when the process variables exceed the alarm threshold. The probability curves of the dry-out accident undertaken with the proposed method are smoother and show significant variations in values compared to the normal case, and thus are more reasonable. To search for critical basic events, the normalized ratios of posterior probabilities are compared. Fig. 9 presents the static posterior estimates of basic events in the RT 580 setup when the dry-out event occurs. Before the level sensor is damaged, static posterior analysis can find some basic events with potential dangers of the dry-out accident, such as BE1, BE7, and BE8. Fig. 10 compares posterior estimates of basic events in the event of a dry-out accident between the proposed method and DFT. Since BE1 has been monitored and the monitoring results show that BE1 is abnormal when the accident occurs, the posterior probability ratio of BE1 increases rapidly and is significantly higher than other basic events. Both of the proposed method and DFT can find the most critical event in real-time. However, on the basis of posterior results by DFT, the pro- posed method can further find that the posterior probability of BE8 is slightly higher than the other six events, indicating that BE8 still needs further attention. 4.2. Case study 2: managed pressure drilling operations 4.2.1. System description Managed pressured drilling (MPD) system is a typical closed-loop kick control system used in deepwater drilling operations. As shown in Fig. 11, MPD utilizes the choke and the pump to adjust annular pressure to meet the requirements of bottomhole pressure (BHP) and observes the BHP condition based on the measurements of flow and hydrodynamic pressure. Fig. 12 shows the FTA of MPD operations. The top event (kick) results from the combined effect of both the abnormal BHP and the failure of the MPD system [50]. Twenty-one basic events (BE1 ~ BE21) are listed with the potential accidents. The prior probability PBEi, i = 1, 2, …, 21 of basic events can be found in [50]. In this case, all basic events are divided into two groups; the first group, GBE = {BE7,BE8,BE12,BE14,BE16,BE18}, (23) represents the events that can be updated in real-time, in which the probability of BE12 and BE14 can be set as 1 when the corresponding signal is lost, whereas the probabilities of the remaining basic events are assumed to be constant values equal to their prior probabilities. In addition, the BE7 (abnormal pressured zone) and BE8 (loss of circulation) are regarded as environmental nodes, and their probabilities are upda- ted based on fluid and pressure monitoring. The BE16 (rig pump failure) and BE18 (backpressure pump failure) are considered process nodes since their reference signal values and alarm thresholds can be affected by changes in pump operating conditions. Data collected from MPD in conjunction with experimental equip- ment data are used in the second case study. The fluid and pressure Fig. 5. The simplified flow diagram of RT 580 experimental setup. R. He et al. Reliability Engineering and System Safety 226 (2022) 1087008 measurement data come from the actual drilling of Well GS19 in the Qixia Formation [51]. The density of injected and returned drilling fluid is measured with a flowmeter to identify overflow or lost circulation, Fig. 6. FT for “Dry-out in tank B2” incident (adapted from the Ref. [48]). Fig. 7. Flow rate and level measurements during the experiment [48]. Fig. 8. Forward PRA updating of BE1 and dry-out event. Fig. 9. Static posterior estimation of RT 580 setup. R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 9 and the automatic choke manifolds are used to regulate wellhead back pressure to maintain the BHP in a safe range. Fig. 13 shows the MPD data, e.g., casing pressure, inlet density, outlet density, and BHP equivalent circulation density, collected from the drilling depth of 4035 m to 4235 m. Two layers with lost circulation were encountered at a depth of 4145.13 m and 4232.22 m, while the lost layers were plugged successfully through the automatic injection of bridging mud at managed pressure. In addition, it can be seen from the change in pumping rate that the operating condition of the rig pump occurs at depths of 4140 m, 4210 m, and 4230 m. Observed from the dataset, higher casing pressure and BHP equivalent circulation density indicate a high risk of BE7, and the outlet density increases and is slighter higher than the inlet density when the loss of circulation occurs. In practice, a basic event can be modeled based on multiple signals using similar approaches. The health status of oil pumps can be represented by the vibration features of the bearing inside the pump, and thus, the vibration data of the bearing provided by Case Western Reserve University (CWRU) Bearing Data Center [52] is used to test the monitoring ability of the proposed method. The database includes normal bearing monitoring data in four conditions, labeled from Normal 1 to Normal 4, corre- sponding to the motor speed from 1797 rpm to 1730 rpm. We assume that 10000 vibration data points can be obtained for every 10 m step of drilling depth to match the drilling process in Fig. 13. To simplify the Fig. 10. Posterior estimation of RT 580 setup at time 180 s. Fig. 11. MPD Process Flow Diagram (adapted from the Ref. [49]). R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 10 model, only the vibration data of the drive end acceleration are used. The rig pump is assumed to be fault-free during the whole drilling process but has multiple working conditions up to the actual case, whereas for the backpressure pump, an abrupt rolling element fault is assumed to occur at a depth of 4200 m. In this case, the rest of the Normal 1 and Normal 4 data are used to train the monitoring model for the rig pump and backpressure pump, respectively. The use of vibration data is shown in Fig. 14. More details of the CWRU database can be found in [53]. 4.2.2. Real-time PRA results In this section, the PRA of MPD operations is performed following the procedure in Fig. 2. According to the actual drilling of Well GS19 [51], the BHP is kept at a slight over-balance during drilling, and the annular back pressure and BHP equivalent circulation density are controlled at 2–5 MPa and 2.46–2.52 g/cm3. Therefore, the normal reference value and the threshold value of casing pressure, BHP equivalent circulation density, and the difference between outlet and inlet densities are set as {3.5, 2.49, -0.03} and {5, 2.52, 0}, respectively. In addition, three sta- tistical time-domain features are extracted from the vibration measurements ⎧ ⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎨ ⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎩ x1 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ 1 N ∑N k=1 y2k √ √ √ √ x2 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ 1 N − 1 ∑N k=1 (yk − y)2 √ √ √ √ x3 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ 1 N ∑N k=1 y2k √ 1 N ∑N k=1 |yk| , (24) where x1 is the root mean square, x2 is the standard deviation, and x3 is the shape factor [54]. In Eq. (24), y is the vibration signal, and N is the number of vibration points equal to the product of time interval length and sampling frequency. In this case, N is equal to 100. The EM algo- rithm is used to train the GMM based on the historical vibration data, and then we can obtain Fig. 12. FTA for kick control incident (adapted from the Ref. [50]). βBE16 = [ 0.038 0.070 0.264 0.629 ], μBE16 = ⎡ ⎢ ⎣ 0.064 0.087 0.089 0.065 0.057 0.084 0.088 0.063 1.266 1.221 1.179 1.230 ⎤ ⎥ ⎦, Σ(1)BE16 = ⎡ ⎢ ⎣ 5.64e − 5 6.24e − 5 5.33e − 5 6.24e − 5 8.31e − 5 1.50e− 4 5.33e − 5 1.50e− 4 2.30e− 3 ⎤ ⎥ ⎦,Σ(2)BE16 = ⎡ ⎢ ⎣ 1.20e − 4 1.19e − 4 − 5.95e − 6 1.19e − 4 1.27e − 4 2.21e− 5 − 5.95e − 6 2.21e− 5 7.60e − 4 ⎤ ⎥ ⎦ Σ(3)BE16 = ⎡ ⎢ ⎣ 9.28e − 5 9.04e − 5 − 1.27e − 5 9.04e − 5 9.09e − 5 − 1.47e − 5 − 1.27e − 5 − 1.47e − 5 4.49e − 4 ⎤ ⎥ ⎦,Σ(4)BE16 = ⎡ ⎢ ⎣ 1.22e− 4 1.21e − 4 − 7.83e − 5 1.21e − 4 1.22e − 4 − 8.20e − 5 − 7.83e − 5 − 8.20e − 5 1.40e − 3 ⎤ ⎥ ⎦, (25) R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 11 where ΛBE16 = {βBE16, μBE16, ΣBE16} and ΛBE18 = {βBE18, μBE18, ΣBE18} are the trained GMM for BE16 and BE18, respectively. After determining model parameters, the probability PMON7 and PMON8 that online signals exceed alarm thresholds are calculated for BE7 and BE8 based on fluid and pressure monitoring by Section 3.1.1. In addition, with the assumption that the interval length is equal to 1 m, the alarm probabilities PMON16 and PMON18 are computed in real-time for BE16 and BE18 based on the trained GMMs and the dynamic updating method shown in Algorithm 1. Fig. 15 exhibits the results of real-time proba- bility PMON for four basic events. As depicted in Fig. 15 (a), when the lost circulation occurs at a depth of 4145.13 m and 4232.22 m, PMON8 (Loss of circulation) increases significantly. However, since this case only applies a single variable to evaluate the lost circulation, the measure- ment noise will seriously affect the evaluation results, so that sometimes under normal working conditions, BE8 also has a higher alarm proba- bility PMON8 . In contrast, PMON7 (abnormal pressured zone) is assessed by two variables and thus has higher stability. As shown in Fig. 15 (b), PMON16 (rig pump failure) will not change significantly due to the changes in operating conditions. Meanwhile, the proposed method can effectively monitor the fault of BE18 (backpressure pump failure) at a depth of 4200 m. Next, the estimated alarm probabilities PMONi are integrated with prior probabilities to evaluate the risk of dynamic nodes in real-time. According to Eq. (21) in Section 3.2, the conditional probability P (MONi| BEi) at each depth can be computed based on PMONi , and P(MONi| BEi) is incorporated into prior probabilities to calculate the real-time risks of dynamic nodes by Eq. (22). The obtained real-time probability of basic events BE7, BE8, BE16, and BE18 is shown in Fig. 16. In this case, the model parameter Δ is set to 0.2, and μ is set to 3. Detailed parameter settings will be discussed further in Section 4.2.4. It can be observed that if the alarm probability PMON obtain in Fig. 15 is less than 0.5, the risk of basic events in Fig. 16 will not change significantly. Fig. 16 shows that therisk of BE8 increases when lost circulation occurs at depths of Fig. 14. Vibration measurements for the drilling process. Fig. 13. Measurements during the MPD of Well GS19 [51]. The BHP equivalent circulation density is only monitored at depths of 4090 m and 4170 m. βBE18 = [ 0.046 0.074 0.053 0.162 ], μBE18 = ⎡ ⎢ ⎣ 0.062 0.062 0.075 0.074 0.055 0.061 0.073 0.070 1.238 1.231 1.290 1.207 ⎤ ⎥ ⎦, Σ(1)BE18 = ⎡ ⎢ ⎣ 4.93e − 5 3.79e − 5 2.60e − 5 3.79e − 5 3.83e − 5 2.88e− 5 2.60e − 5 2.88e− 5 1.10e− 3 ⎤ ⎥ ⎦,Σ(2)BE18 = ⎡ ⎢ ⎣ 7.97e − 5 7.81e − 5 − 1.89e − 5 7.81e − 5 7.93e − 5 − 1.67e− 5 − 1.89e − 5 − 1.67e− 5 9.94e − 4 ⎤ ⎥ ⎦ Σ(3)BE18 = ⎡ ⎢ ⎣ 1.28e − 4 1.24e − 4 − 1.99e − 5 1.24e − 4 1.25e − 4 − 1.94e − 5 − 1.99e − 5 − 1.94e − 5 1.20e− 3 ⎤ ⎥ ⎦,Σ(4)BE18 = ⎡ ⎢ ⎣ 7.99e− 5 8.22e − 5 − 3.64e − 5 8.22e − 5 9.47e − 5 − 3.23e − 5 − 3.64e − 5 − 3.23e − 5 1.00e − 3 ⎤ ⎥ ⎦, (26) R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 12 4145.13 m and 4232.22 m, and the risk of BE18 rises when pump fault occurs at a depth of 4200 m. Therefore, the proposed method is vali- dated to be effective. When we get the risk of all basic events at a certain drilling depth, the probability of kick can be estimated by Eq. (17). As shown in Fig. 17 (a), a single lost circulation accident will not significantly increase the probability of a kick due to the role of the MPD system as a safety barrier. When the MPD system can work normally, the closed-loop control of MPD can balance the BHP to alleviate the lost circulation and avoid the kick, and also, this phenomenon is entirely consistent with the actual drilling case [51]. In contrast, the MPD pump equipment failed at a depth of 4200m and could not adjust the BHP normally, and thus the probability of kick increased instantly. At a depth of 4232 m, the lost circulation and the backpressure pump fault occur at the same time, and therefore, the probability of kick reaches the highest value, which also indicates that the kick is a result of the combined effect of both the abnormal BHP and the failure of the MPD system. As a comparison, Fig. 17 (b) shows the kick probability calculated without considering the prior probability of basic events, which is a similar idea that has been widely used in many reported works [33–37]. Although the comparative Fig. 16. The real-time risk of basic events. Fig. 15. The probability of online signals exceeding alarm thresholds. Fig. 17. The real-time probability of kick. R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 13 method can successfully identify the rise in risk after the depth of 4220 m, the obtained kick risks have large fluctuations due to measurement noise, even when the MPD system is working normally (from a depth of 4035 m to 4150 m). Beyond the real-time probability analysis of kick, it is also essential to discover the critical events with potential kick risk. The posterior risk estimation can be performed based on obtained real-time probabilities by Eq. (18) and the proposed method. In this case, the kick is adopted as the top event to calculate the posterior probability of basic events. Fig. 18 presents the normalized posterior estimates at some representative drilling depths. The first graph in Fig. 18 shows the result of the static posterior analysis. Before drilling operations, static poste- rior analysis can find some basic events with potential dangers of kick accident, such as BE1, BE2, BE4, BE7, BE15, and BE16. However, since BE7 and BE16 have been monitored and the monitoring results show that they are normal, the posterior probability ratio of the events BE7 and BE16 decreases at a depth of 4060 m. At a depth of 4150 m, the lost circulation occurs, and thus, the BE8 (loss of circulation) increases, indicating that the BE8 is quite critical and should be noted at this depth. From the depth of 4150 m to 4210 m, the lost layers are successfully Fig. 18. Real-time posterior probability of MPD operations using the proposed method. R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 14 plugged, but at a depth of 4210 m, the backpressure pump malfunction occurs and the posterior probability ratio of the events BE18 increases correspondingly. In addition, with the rise of the posterior probability of BE18, the posterior probability of BE19 also increases, and thus BE19 and BE18 may have similar accident propagation paths. A similar analysis can be conducted for the last two graphs. At a depth of 4230 m, the rig pump changes its operating conditions to suppress the lost circulation so that the posterior probability of BE8 and BE16 is slightly increased. At a depth of 4235 m, the rig pump returns to its initial operating condition, and the corresponding posterior probability drops. For comparison, Fig. 19. Real-time posterior probability of MPD operations without considering the prior probability. Fig. 20. The probability of BE16 signals exceeding alarm thresholds. R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 15 when prior risk is not considered, the posterior risks of BE8 (loss of circulation) are still high, as shown in Fig. 19, even if the layers are not lost, or the lost layers are successfully plugged (at depths of 4060 m, 4230 m, and 4235 m). In addition, at a depth of 4150 m, the comparative method incorrectly gets a high posterior risk of BE16 since the prior risk of rig pump failure is not incorporated in the estimation. Overview, the static risk analysis is hard to capture the deviations and changes in MPD operations, and the risk cannot be well estimated if only the monitoring information is considered without considering prior risks. However, once the prior risks are to be considered, CPTs need to be set up to connect prior probabilities to data monitoring, which often requires historical accident data or expert knowledge in reported works. There- fore, the proposed real-time PRA is particularly effective in evaluating the time-dependent behavior of MPD operations in the presence of data scarcity and missing expertise. 4.2.3. The comparison of GMM and GMM-PF MPD is a typical intermittent operation system. The operating con- dition of the rig pump and backpressure pump will change over time, and the transient behavior can occur during phase shifts. GMM can be utilized to construct the monitoring model for process nodes. However, as the condition of mechanical equipment may change in MPD opera- tions, the component parameters of GMM require updates over time. Therefore, PF is developed to update the GMM in Algorithm 1. This part is applied to demonstrate the positive impact of particle filtering-based parameter updating. As shown in Fig. 14 (a), the operating condition of the rig pump will change during the drilling. Since the features in Eq. (24) cannot distin- guish different operating condition, we build a simple neural network model to extract a new set of three featuresfrom vibration signals, which can effectively present the difference between different conditions, and the input of the neural network is 100 vibration points. New features are used to train the GMM for the rig pump (BE16) and the backpressure pump (BE18). The probability of BE16 and BE18 signals exceeding alarm thresholds based on GMM are shown in Fig. 20 (a) and Fig. 21 (a), respectively. When we use PF to update the GMM parameters, the ob- tained alarm probability of BE16 and BE18 are shown in Fig. 20 (b) and Fig. 21 (b). If the alarm probability exceeds 0.5, it means that the cor- responding mechanical equipment may be malfunctioning. As presented in Fig. 20 (a), the GMM-based monitoring model will incorrectly eval- uate other normal operating conditions as faults. Specifically, the motor speed of the rig pump changes from 1797 rpm to 1750 rpm at a depth of 4140 m and increases from 1750 rpm to 1772 rpm at a depth of 4210 m. Fig. 21. The probability of BE18 signals exceeding alarm thresholds. Fig. 22. Sensitivity analysis of μ Fig. 23. Sensitivity analysis of Δ R. He et al. Reliability Engineering and System Safety 226 (2022) 108700 16 Although the rig pump still works normally, the GMM-based monitoring model incorrectly identifies the change in operating conditions as a fault since most of the corresponding probabilities exceed 0.5 after depths of 4140 m and 4210 m. In contrast, as shown in Fig. 20 (b), PF can effec- tively adjust the GMM parameters to a reasonable range so that GMM-PF will not be too sensitive to changes in pump operating conditions. In addition, at a depth of 4200 m, the backpressure pump experienced a sudden rolling element fault, as shown in Fig. 14 (b). As presented in Fig. 21, both GMM and GMM-PF can identify the fault of the back- pressure pump at a depth of 4200 m. Overall, the proposed GMM-PF method is more applicable when the monitoring parameters are affected by varying working conditions. 4.2.4. Model parameter analysis This work contains three models presented in Section 3. The first monitoring model shown in Section 3.1.1 is a deterministic model, which only requires reference values and thresholds for process mea- surements instead of any pre-set parameters. For the GMM-PF model presented in Section 3.1.2, the parameter settings of the model can refer to many existing references, such as Ref. [42] and [43]. As a result, this section focuses on discussing the parameter setting of the integrated PRA model shown in Section 3.2. We need to set only two parameters, μ and Δ, for the proposed PRA model, and other parameters can be derived from these two parameters. This section provides a sensitivity analysis to analyze how to determine model parameters for PRA. First, we set Δ to 0.05 and prior probability to 0.05 to observe the influence of the change of parameter μ on the posterior probability. As shown in Fig. 22, whether the PMON value is 0.4 or 0.6, the posterior probability will remain stable when μ is greater than about 2, and thus we can set the parameter μ to a number greater than 2. Fig. 23 shows the sensitivity analysis result of parameter Δ, in which the μ is set as 3 and the prior probability is set as 0.05. Δ is a small number denoting the discrete interval. When the parameter Δ is less than 0.3, the posterior probabilities are stable, and therefore, we can set Δ to a very small positive number for the PRA model. 4.2.5. Model uncertainty and validation The uncertainty and validation of risk analysis models have been widely studied. The model uncertainty may greatly impact the estima- tion accuracy of the probability estimation of basic events, and then affect the calculated risk. One way to represent the uncertainty in PRA is to conduct a bounding analysis by using probability boxes (P-boxes) [55]. In the bounding analysis, the prior probability of basic events can be assessed as a credible interval rather than a single point. By propa- gating the uncertainty, a credibility interval can also be obtained for the estimated kick risk and posterior risks, which can help decision-makers understand the confidence of risk estimations. Three-axiom-based validation [56] can be performed to give a partial validation for the proposed PRA model. If the proposed model is robust, the estimated probability of basic events is often sensitive but rarely presents abrupt variations due to any slight changes in corresponding alarm probability. As shown in Fig. 20 and Fig. 21, if PMON is less than 0.5, the real-time risk of the corresponding basic event will only change slightly. On the contrary, when PMON is greater than 0.5, the real-time risk of the corresponding basic event will increase significantly. With a minor change in the alarm probability, the real-time risk of basic events is sensitive but rarely exhibits abrupt changes. In addition, once the monitoring results show that the event status is abnormal, the real-time risk of the basic event will rise reasonably. The sensitive analysis results satisfy the axioms and thus partially validate the pro- posed PRA model. Conclusions This paper proposes a novel integrated PRA method for petrochem- ical industries based on data monitoring. The proposed method can fully satisfy the timing requirement of process operations. Additionally, the proposed probability estimation not only considers the update of the failure probability of safety barriers but also assesses the probability of abnormal events. Furthermore, PF-based GMM can estimate the alarm probability of process signals effectively during phase shafts in industrial systems. By comparing different risk analysis methods, the main con- clusions from the case study validate the effectiveness and necessity of the proposed PRA method. The significant contribution of the proposed approach is the inte- gration of data monitoring with DBN for PRA in real-time. Specifically, the probability of the monitored signal exceeding the alarm threshold is estimated first for environmental and process monitoring based on the multivariate method and PF-based GMM, respectively. Data monitoring can be effectively incorporated into PRA in real-time by the proposed DBN with the estimated alarm probability. The proposed DBN model will not require additional expert knowledge and historical accident data to determine the conditional relationship between data monitoring and prior risk so that is more suitable for practical engineering appli- cations. We plan to collect actual process data to test the model and incorporate effective monitoring models into PRA. CRediT authorship contribution statement Rui He: Conceptualization, Methodology, Writing – original draft, Writing – review & editing. Jingyu Zhu: Investigation, Writing – review & editing. Guoming Chen: Writing – review & editing, Funding acquisition. Zhigang Tian: Writing – review & editing. Declaration of Competing Interest No conflict of interest exists in the submission of this manuscript, and all authors approve the manuscript for publication. We would like to declare on behalf of co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere, in whole or in part. We declare that we do not have any commercial or associative interest that repre- sents a conflict of interest in connection with the work submitted. Acknowledgments The authors greatly appreciate the help from the Centre for Offshore Engineering and Safety Technology (COEST) at China University of Petroleum (East China), and gratefully acknowledge the financial sup- port provided by China National Key Research and Development Pro-gram (No: 2016YFC0304005), and National Natural Science Foundation of China (No: 52004142). References [1] Khan F, Abbasi S. Techniques and methodologies for risk analysis in chemical process industries. J Loss Prevent Proc 1998;11(4):261–77. [2] Hu Y, Parhizkar T, Mosleh A. Guided simulation for dynamic probabilistic risk assessment of complex systems: Concept, method, and application. Reliab Eng Syst Safe 2022;217:108047. [3] Khan F, Abbasi SA. Analytical simulation and PROFAT II: a new methodology and a computer automated tool for fault tree analysis in chemical process industries. J Hazard Mater 2000;75(1):1–27. [4] Ferdous R, Khan F, Sadiq R, Amyotte P, Veitch B. Handling data uncertainties in event tree analysis. Process Saf Environ 2009;87(5):283–92. [5] Aqlan F, Mustafa Ali E. Integrating lean principles and fuzzy bow-tie analysis for risk assessment in chemical industry. J Loss Prevent Proc 2014;29:39–48. [6] Khakzad N, Khan F, Amyotte P. Dynamic safety analysis of process systems by mapping bow-tie into Bayesian network. Process Saf Environ 2013;91(1–2):46–53. [7] Moradi R, Groth KM. Modernizing risk assessment: A systematic integration of PRA and PHM techniques. Reliab Eng Syst Safe 2020;204:107194. [8] OREDA-offshore reliability data handbook. 4th ed. Trondheim: SINTEF; 2002. [9] Li X, Zhang Y, Abbassi R, Khan F, Chen G. Probabilistic fatigue failure assessment of free spanning subsea pipeline using dynamic Bayesian network. Ocean Eng 2021;234:109323. [10] He R, Chen G, Wang Y, Jiang S, Zhi C. A quantitative risk analysis model considering uncertain information. Process Saf Environ 2018;118:361–70. R. He et al. http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0001 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0001 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0002 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0002 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0002 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0003 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0003 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0003 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0004 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0004 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0005 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0005 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0006 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0006 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0007 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0007 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0008 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0009 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0009 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0009 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0010 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0010 Reliability Engineering and System Safety 226 (2022) 108700 17 [11] Liu Z, Ma Q, Cai B, Shi X, Zheng C, Liu Y. Risk coupling analysis of subsea blowout accidents based on dynamic Bayesian network and NK model. Reliab Eng Syst Safe 2022;218:108160. [12] Meel A, Seider W D. Plant-specific dynamic failure assessment using Bayesian theory. Chem Eng Sci 2006;61:7036–56. [13] Kalantarnia M, Khan F, Hawboldt K. Dynamic risk assessment using failure assessment and Bayesian theory. J Loss Prevent Proc 2009;22:600–6. [14] Li X, Chen G, Khan F, Xu C. Dynamic risk assessment of subsea pipelines leak using precursor data. Ocean Eng 2019;178:156–69. [15] Yu H, Khan F, Veitch B. A flexible hierarchical Bayesian modeling technique for risk analysis of major accidents. Risk Anal 2017;37(9):1668–82. [16] Meng X, Li X, Wang W, Song G, Chen G, Zhu J. A novel methodology to analyze accident path in deepwater drilling operation considering uncertain information. Reliab Eng Syst Safe 2021;205:107255. [17] Yang X, Haugen S, Paltrinieri N. Clarifying the concept of operational risk assessment in the oil and gas industry. Safety Sci 2018;108:259–68. [18] Khakzad N, Khan F, Amyotte P. Quantitative risk analysis of offshore drilling operations: A Bayesian approach. Safety Sci 2013;57:108–17. [19] Pasman H, Reniers G. Past, present and future of Quantitative Risk Assessment (QRA) and the incentive it obtained from Land-Use Planning (LUP). J Loss Prevent Proc 2014;28:2–9. [20] Shalev DM, Tiran J. Condition-based fault tree analysis (CBFTA): A new method for improved fault tree analysis (FTA), reliability and safety calculations. Reliab Eng Syst Safe 2007;92(9):1231–41. [21] Liu Q, Dong M, Peng Y. A novel method for online health prognosis of equipment based on hidden semi-Markov model using sequential Monte Carlo methods. Mech Syst Signal Pr 2012;32:331–48. [22] Zeng Z, Zio E. Dynamic risk assessment based on statistical failure data and condition-monitoring degradation data. IEEE T Reliab 2018;67(2):609–22. [23] Gomes JPP, Rodrigues LR, Galvao RKH, Yoneyama T. System level RUL estimation for multiple-component systems. In: Proc 2013 Annu Conf Prognostics Health Manage. Soc; 2013. p. 74–82. [24] Xing J, Zeng Z, Zio E. A framework for dynamic risk assessment with condition monitoring data and inspection data. Reliab Eng Syst Safe 2019;191:106552. [25] Wang H, Khan F, Ahmed S, Imtiaz S. Design of Scenario-Based Early Warning System for Process Operations. Ind Eng Chem Res 2015;54(33):8255–65. [26] Adedigba SA, Oloruntobi O, Khan F, Butt S. Data-driven dynamic risk analysis of offshore drilling operations. J Petrol Sci Eng 2018;165:444–52. [27] Mamudu A, Khan F, Zendehboudi S, Adedigba S. Dynamic risk assessment of reservoir production using data-driven probabilistic approach. J Petrol Sci Eng 2020;184:106486. [28] Zhang X, Zhang L, Hu J. Real-time risk assessment of a fracturing manifold system used for shale-gas well hydraulic fracturing activity based on a hybrid Bayesian network. J Nat Gas Sci Eng 2019;62:79–91. [29] Zywiec WJ, Mazzuchi TA, Sarkani S. Analysis of process criticality accident risk using a metamodel-driven Bayesian network. Reliab Eng Syst Safe 2021;207: 107322. [30] Yu Q, Teixeira AP, Liu K, Rong H, Soares CG. An integrated dynamic ship risk model based on Bayesian networks and evidential reasoning. Reliab Eng Syst Safe 2021;216:107993. [31] Islam R, Khan F, Venkatesan R. Real time risk analysis of kick detection: Testing and validation. Reliab Eng Syst Safe 2017;161:25–37. [32] He R, Li X, Chen G, Chen G, Liu Y. Generative adversarial network-based semi- supervised learning for real-time risk warning of process industries. Expert Syst Appl 2020;150:113244. [33] Wang H, Khan F, Ahmed S, Imtiaz S. Dynamic quantitative operational risk assessment of chemical processes. Chem Eng Sci 2016;142:62–78. [34] Hashemi SJ, Ahmed S, Khan F. Loss functions and their applications in process safety assessment. Process Saf Prog 2014;33(3):285–91. [35] Sule I, Imtiaz S, Khan F, Butt S. Risk analysis of well blowout scenarios during managed pressure drilling operation. J Petrol Sci Eng 2019;182:106296. [36] Wang Q, Zhang L, Hu J. Real-time risk assessment of casing-failure incidents in a whole fracturing process. Process Saf Environ 2018;120:206–14. [37] Dimaio F, Scapinello O, Zio E, Ciarapica C, Cincotta S, Crivellari A, Decarli L, Larosa L. Accounting for safety barriers degradation in the risk assessment of oil and gas systems by multistate Bayesian networks. Reliab Eng Syst Safe 2021;216: 107943. [38] Wu S, Zhang L, Fan J, Zheng W, Zhou Y. Real-time risk analysis method for diagnosis and warning of offshore downhole drilling incident. J Loss Prevent Proc 2019;62:103933.[39] Parhizkar T, Hogenboom S, Vinnem JE, Utne IB. Data driven approach to risk management and decision support for dynamic positioning systems. Reliab Eng Syst Safe 2020;201:106964. [40] Ahn J, Chang D. Fuzzy-based HAZOP study for process industry. J Hazard Mater 2016;317:303–11. [41] Zadakbar O, Imtiaz S, Khan F. Dynamic risk assessment and fault detection using a multivariate technique. Process Saf Prog 2013;32(4):365–75. [42] Yu J. A particle filter driven dynamic Gaussian mixture model approach for complex process monitoring and fault diagnosis. J Process Contr 2012;22:778–88. [43] Figueiredo MAF, Jain AK. Unsupervised learning of finite mixture models. IEEE Trans Pattern Anal Mach Intell 2002;24:381–96. [44] Arulampalam MS, Maskell S, Gordon N, Clapp T. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans Signal Proces 2002; 50(2):175–88. [45] Kotecha JH, Djuric PM. Gaussian sum particle filtering. IEEE Trans Signal Proces 2003;51(10):2602–12. [46] Yu J, Qin SJ. Multimode process monitoring with Bayesian inference-based finite Gaussian Mixture models. AIChE J 2008;54(7):1811–29. [47] Li C, Mahadevan S, Ling Y, Choze S, Wang L. Dynamic Bayesian network for aircraft wing health monitoring digital twin. AIAA J 2017;55(3). [48] Ghosh A, Ahmed S, Khan F. Modeling and testing of temporal dependency in the failure of a process system. Ind Eng Chem Res 2019;58:8162–71. [49] Sule I, Khan F, Butt S, Yang M. Kick control reliability analysis of managed pressure drilling operation. J Loss Prevent Proc 2018;52:7–20. [50] Abimbola M, Khan F, Khakzad N, Butt S. Safety and risk analysis of managed pressure drilling operation using Bayesian network. Safety Sci 2015;76:133–44. [51] Yan L, Wu H, Yan Y. Application of fine managed pressure drilling technique in complex wells with both blowout and lost circulation risks. Natural Gas Indus B 2015;2:192–7. [52] “Case western reserve university bearing data center website”, https://csegroups. case.edu/bearingdatacenter. [53] Smith WA, Randall RB. Rolling element bearing diagnostics using the Case Western Reserve University data: A bench mark study. Mech Syst Signal Pr 2015;64-65: 100–31. [54] Goyal D, Vanraj Pabla BS, Dhami SS. Condition monitoring parameters for fault diagnosis of fixed axis gearbox: A review. Arch Computat Methods Eng 2017;24: 543–56. [55] Karanki D R, Kushwaha H S, Verma A K, Ajit S. Uncertainty analysis based on probability bounds (P-box) approach in probabilistic safety assessment. Risk Anal 2009;29(5):662–75. [56] Jones B, Jenkinson I, Yang Z, Wang J. The use of Bayesian network modelling for maintenance planning in a manufacturing industry. Reliab Eng Syst Safe 2010;95: 267–77. R. He et al. http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0011 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0011 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0011 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0012 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0012 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0013 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0013 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0014 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0014 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0015 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0015 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0016 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0016 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0016 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0017 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0017 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0018 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0018 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0019 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0019 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0019 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0020 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0020 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0020 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0021 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0021 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0021 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0022 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0022 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0023 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0023 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0023 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0024 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0024 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0025 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0025 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0026 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0026 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0027 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0027 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0027 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0028 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0028 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0028 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0029 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0029 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0029 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0030 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0030 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0030 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0031 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0031 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0032 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0032 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0032 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0033 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0033 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0034 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0034 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0035 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0035 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0036 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0036 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0037 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0037 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0037 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0037 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0038 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0038 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0038 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0039 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0039 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0039 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0040 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0040 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0041 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0041 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0042 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0042 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0043 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0043 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0044 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0044 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0044 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0045 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0045http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0046 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0046 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0047 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0047 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0048 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0048 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0049 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0049 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0050 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0050 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0051 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0051 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0051 https://csegroups.case.edu/bearingdatacenter https://csegroups.case.edu/bearingdatacenter http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0053 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0053 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0053 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0054 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0054 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0054 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0055 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0055 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0055 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0056 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0056 http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0056 A real-time probabilistic risk assessment method for the petrochemical industry based on data monitoring 1 Introduction 2 Preliminaries and problem formulation 3 Proposed methodology 3.1 Probability calculation for online signal exceeding alarm threshold 3.1.1 Probability estimations for environmental nodes 3.1.2 Probability estimations for process nodes 3.2 Integrated real-time risk analysis 4 Implementation and model discussion 4.1 Case study 1: RT 580 experimental setup 4.1.1 System description 4.1.2 PRA results and discussions 4.2 Case study 2: managed pressure drilling operations 4.2.1 System description 4.2.2 Real-time PRA results 4.2.3 The comparison of GMM and GMM-PF 4.2.4 Model parameter analysis 4.2.5 Model uncertainty and validation Conclusions CRediT authorship contribution statement Declaration of Competing Interest Acknowledgments References