Prévia do material em texto
Reliability Engineering and System Safety 226 (2022) 108700
Available online 2 July 2022
0951-8320/© 2022 Elsevier Ltd. All rights reserved.
A real-time probabilistic risk assessment method for the petrochemical
industry based on data monitoring
Rui He a,*, Jingyu Zhu b, Guoming Chen b, Zhigang Tian a
a Department of Mechanical Engineering, University of Alberta, Edmonton, AB, Canada
b Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), No.66, Changjiang West Road, Qingdao, China
A R T I C L E I N F O
Keywords:
Probabilistic risk assessment (PRA)
Dynamic Bayesian network (DBN)
Data monitoring
Petrochemical industry
A B S T R A C T
Safety is of high societal concern in the petrochemical industry. With advancing digitalization, the techniques of
probabilistic risk assessment (PRA), which provide a system-level perspective for industrial risk analysis, have
become increasingly dynamic. This paper provides a further advancement in this direction by proposing a risk
updating method based on the dynamic Bayesian network (DBN) to incorporate data monitoring into PRA in real-
time. We update the probabilistic risk for basic events in Bayesian networks based on their prior probability and
the probability of their online monitoring signals exceeding the alarm threshold. The key idea behind this
approach is that if the signal of a basic event exceeds the alarm threshold, the corresponding risk should increase.
Otherwise, the basic event should have a low risk. The contribution in this paper is twofold. First, residual
methods are developed to estimate signal probabilities exceeding abnormal thresholds. Second, a DBN model is
proposed to integrate prior risk with data monitoring for risk analysis. The proposed DBN model does not require
additional expert knowledge and historical accident data to define the conditional relationship between data
monitoring and prior risk. The proposed approach is validated using the RT 580 experimental setup and managed
pressure drilling operations.
1. Introduction
Modern petrochemical processes are complex socio-technical sys-
tems that are susceptible to catastrophic accidents due to an abnormal
process, instrumentation, or human error [1]. Probabilistic risk assess-
ment (PRA) provides a well-established family of techniques to address
safety, risk, and reliability of these complex petrochemical systems [2].
To equip PRA with the capability of investigating accident propagation
mechanisms, various methodologies have been proposed over the past
decades, including fault tree analysis (FTA) [3], event tree analysis
(ETA) [4], bow-tie (BT) [5], and Bayesian network (BN) [6]. These
traditional PRA techniques consider possible accident propagation with
chain rules and use pre-set deterministic failure probabilities to assess
the likelihood of accidents. However, traditional PRA methods are
limited by the use of static failure probabilities and thus is hard to
capture deviations and changes in system status. As the modern industry
moves towards a data-driven “Industry 4.0”, advances in sensor tech-
nology have made PRA techniques more and more dynamic, but effec-
tively utilizing data information in PRA remains a challenge [7].
Most petrochemical processes are intermittent systems with multiple
working phases. During system operations, the risk of accidents may
change over time, and the accident propagation mechanism during
phase shifts may be different. To account for the time-dependent
behavior of accidents, engineers started to consider additional time
slices in PRA methods. OREDA [8] provides a reference for exponential
failure rates of most petrochemical equipment, and thus it is possible to
develop static PRA into dynamic forms based on failure rates. Typically,
constant failure rates have been widely used for extending BN into a
dynamic Bayesian network (DBN) to consider time-dependent opera-
tions and equipment degradations [9–11]. When statistical accident
data or precursor data is available, the risk can also be updated in PRA
by Bayesian inference. Exponential distribution and Beta distribution
can be assumed as the basic probability distribution of possible accidents
in ETA or BN, and the failure probabilities of safety barriers are updated
by counting the number of accidents [12,13]. Since the available acci-
dent data is often limited, hierarchical Bayesian analysis (HBA) is pro-
posed to use scarce statistical data to update the failure probability
distribution of safety barriers [14,15]. Other graph-based methods, such
as directed acyclic graphs (DAG) that assign a deterministic value to
each path for the risk propagation analysis also use statistical accident
* Corresponding author.
E-mail address: rhe5@ualberta.ca (R. He).
Contents lists available at ScienceDirect
Reliability Engineering and System Safety
journal homepage: www.elsevier.com/locate/ress
https://doi.org/10.1016/j.ress.2022.108700
Received 20 August 2021; Received in revised form 18 May 2022; Accepted 26 June 2022
mailto:rhe5@ualberta.ca
www.sciencedirect.com/science/journal/09518320
https://www.elsevier.com/locate/ress
https://doi.org/10.1016/j.ress.2022.108700
https://doi.org/10.1016/j.ress.2022.108700
https://doi.org/10.1016/j.ress.2022.108700
http://crossmark.crossref.org/dialog/?doi=10.1016/j.ress.2022.108700&domain=pdf
Reliability Engineering and System Safety 226 (2022) 108700
2
data to update risk by assuming distributions of risk factors [16]. One
issue is that statistical data are often collected from a population of
similar systems and cannot fully interpret the specific features of target
systems. In addition, the accident data is commonly hard to collect in the
short term, and thus these PRA techniques can only be updated regularly
over long periods, such as yearly or weekly [17].
Due to the real-time characteristics of operating parameters in
petrochemical industries, the credibility of statistical data is always in
question [18]. Advances in sensor technology provide online data sup-
port for PRA techniques, and therefore, PRA methods have begun to
update estimated risk in real-time using online measurements. Unlike
process monitoring, PRA uses risk to measure the lack of safety from
system-level perspectives and thus can identify possible problems and
make corrective decisions before accidents occur [19]. Commonly, an
accident is caused by a combination of several basic events, including
abnormal operating conditions and safety barrier failures. Condition
data can be incorporated in degradation models based on reliability
assessment methods when considering the dynamic mechanism of safety
barriers. Initially, a condition fault tree is proposed to update the failure
rates of components according to equipment condition data [20].
Model-based methods, like Kalman filter and particle filtering (PF), are
developed to update the degradation state of safety barriers based on
real-time measurements [21–23]. In addition, Gaussian mixture model
(GMM) is developed in [24] to estimate the degradation of safety bar-
riers and identify health status in real-time. Furthermore, in [7], a sys-
tematical framework is presented to incorporate the degradation model
into PRA techniques. Even though the degradation characteristics of
safety barriers are effectively modeled, the reliability-based methods
cannot fully meet the requirement of petrochemical industries because
the instantaneous changes in operations cannot be considered. In addi-
tion, the process equipment will not be significantly degraded in a short
operating period.
To equip PRA with data monitoring, some works construct data-
driven risk models based on large amounts of historical or experi-
mental accident data. Commonly, as in [25–27], BN structural and
parameter learning are performed to learn probabilistic relationships
between operating parameters andaccidents using historical event data,
and the learned BN can be used to predict the probability of accidents
when the online data is available. In [28–30], data monitoring is
incorporated into BN models for risk analysis, where BN parameters are
determined using statistical methods such as maximum likelihood esti-
mation (MLE) based on sufficient accident-related data. Also in [31,32],
a risk-based warning model is constructed for real-time risk analysis
using historical data and expert knowledge. Oftentimes,
accident-related data are challenging to collect, and thus statistical and
data-driven approaches may lead to biased and inconsistent estimates in
PRA models. Other related works determine different failure probabili-
ties for basic events under varying operations. In [33,34], different
risk-based loss functions are defined to use monitoring data to assess
accident probabilities in real-time. In addition, in [35–37], the online
monitoring signal is divided into multiple discrete levels (e.g., high,
medium, low), and corresponding failure probabilities are set for basic
events under different groups according to expert knowledge. However,
despite no use of historical data, these PRA techniques determine the
risks of basic events based only on the online information while ignoring
their prior probabilities in real-time risk estimations. If the prior prob-
ability is not considered, the risk of an event will depend entirely on the
data monitoring and will be unrelated to its own accident mechanism.
Only a few works, for example [38,39], consider both data monitoring
and prior probabilities in PRA without using historical accident data, but
expert knowledge is still required to determine the effect of different
monitoring parameters on the probability of basic events or accidents.
In this work, we provide a further advancement in this direction by
proposing a real-time PRA approach that embeds both data monitoring
and prior knowledge in risk analysis. We start with the estimates of the
probability that online signals exceed the alarm threshold. Thereafter,
these estimated probabilities are incorporated into PRA to update the
risk of basic events based on their prior knowledge. The significant
contribution of this work is twofold. First, residual-based methods are
developed for environmental and process monitoring to estimate the
probability of their online signals exceeding abnormal thresholds. Sec-
ond, a DBN model is proposed to integrate prior risk with data moni-
toring for risk analysis. The proposed DBN model does not require
additional expert knowledge and historical accident data for each basic
event to determine the conditional relationship (conditional probability
table) between data monitoring and prior risk so that the proposed PRA
is practical, especially when historical data is scarce.
The rest of the paper is organized as follows. In Section 2, the pre-
liminaries and engineering motivations are briefly introduced, and the
critical problems are formulated. In Section 3, the proposed real-time
PRA method is described. Section 4 uses two cases, including the RT
580 experimental setup and managed pressure drilling (MPD) operation,
to demonstrate and validate the proposed method. Finally, conclusions
are made in Section 5.
2. Preliminaries and problem formulation
This work aims to predict the probability of accidents and to search
for critical events with potential accident risks. Let TE represents the top
event (main accident) of the Fault Tree (FT) and suppose that there are N
basic events to determine the accident, denoted by BEi, i = 1, 2, …, N.
Based on the logical relationship of FT gates, the probability of an ac-
cident is a function of the prior probability of basic events
PTE = fFT(PBE1 ,PBE2 ,…,PBEN ), (1)
wherefFT(⋅) denotes the FT model and PBEi is the prior probability of the
i-th basic event obtained by FTA.
The mapping algorithm in [6] can be used to set conditional prob-
ability tables (CPTs) to map FT to BN. In order to discover the potential
basic events that may lead to accidents, the risk index is defined as the
conditional probability of BEi given that TE has already occurred. The
conditional probabilities can be estimated based on the rules of BN
P(BEi|TE) = P{BEioccurs|TEhasoccured}
=
P(TE|BEi)PBEi∑
BEi
P(TE,BEi)
, i = 1, 2,…N. (2)
To meet the timing requirement of petrochemical industries, data
monitoring is considered in PRA. Fig. 1 shows a simple process (feeding
tank) and its FT with high-level incident [40]. The top event, i.e., high
level, results from the combined effect of both the abnormal operating
flow and the failure of safety barriers. In this case, the condition of input
and output flow and the level control logic can be respectively observed
from FI01, FI02, and LIC01, and the pump status can be assessed based
on the accelerometers. The online-monitoring data can reflect the sys-
tem operating status. For example, if FI01 is high, the risk of increased
feed should also rise. However, since different events have different
prior probabilities, the updated risk should not only depend on data
monitoring but also on the prior risk of the event.
When a BN model is constructed, the nodes of BN are categorized
into static nodes and dynamic nodes. The real-time probability of a static
node is always equal to its prior probability, while the probability of
dynamic nodes will be continuously updated based on data monitoring.
Dynamic nodes can be further classified into environmental nodes and
process nodes according to signal characteristics. Environmental nodes
refer to various environmental variables that are not affected by changes
in operating conditions, such as wind speed and formation pressure.
Process nodes, such as input flow, tank level, and pump vibration, are
associated with process variables affected by varying working condi-
tions. We assume that the risk of m environmental nodes can be updated
based on environmental monitoring, and the risk of n process nodes can
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
3
be updated based on process monitoring. In this work, the probability of
online signals beyond the alarm threshold is first estimated for each
dynamic node, which can be regarded as the probability that the event
deviates from the normal status, denoted byPMONi , i = 1, 2,…, m+n.
Afterward, the risks of dynamic nodes are estimated in real-time by
integrating PMON and their prior probability PBE. The PRA tasks are
formally defined as:
(1) forward risk updating: update the probability of basic events at
the current time t = tk, k = 1, 2, …,T, based on data monitoring
available up to tk, to predict the probability of top event at time tk;
(2) posterior risk estimation: given that the top event has occurred,
estimate the risk index (posterior probability) of basic events at
time tk to discover the critical event with potential accident risks.
Fig. 1. Process diagram and Fault Tree model for a feeding tank.
Fig. 2. Procedures for real-time PRA based on data monitoring
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
4
3. Proposed methodology
Fig. 2 summarizes the major steps of the proposed real-time PRA
method for petrochemical industries. Before conducting a risk analysis,
the event type of each basic event is first determined. After that, the
probability of the online signal exceeding the alarm thresholdPMON is
estimated for each dynamic node. We show how to evaluatePMONi , i = 1,
2,…, m, for environmental nodes in Section 3.1.1, and present how to
estimate PMONi , i = m+1, m+2,…, m+n, for process nodes in Section
3.1.2. Finally, in Section 3.2, a novel PRA method is presented to inte-
grate PMON with prior probability PBE based on the idea of DBN for risk
updating and posterior estimations in real-time. At time t = tk, the for-
ward risk updating is performed first, then the posterior probability of
basic events can be estimated based on the updated probability.
3.1. Probability calculation for online signal exceeding alarm threshold
3.1.1. Probability estimations for environmental nodes
As mentioned before, the environmental nodes in this section denote
the dynamic nodes that cannot be affected by changes in system oper-
ating conditions, such as increased wind speed and deduced ambient
temperature. This section aims to estimate the probability that the on-
line signal of the environmental node exceeds the alarm threshold. In
general, the historical data of environment-related events is scarce for a
new project, but the normal reference value and the alarm threshold of
monitored parameters are often known in advance. Let AEi represent the
i-th environmental node, i = 1, 2,…, m, and assume that there are M
monitoring signals to determine the status of AEi denoted by xi,j, j = 1, 2,
…, M. The probability estimation problems for an environmental node
can be formulated as: given the reference signal value and the alarm
threshold of xi,j, and the process operating measurements xi(t) = [xi,1(t),
…, xi,M(t)]T, to estimate the probability of its monitoring parameters
exceeding the alarm threshold PMONi (t)at time t.
In order to combine multiple signals, the signal residual of the i-th
environmental node is calculated by a multivariate logic solver
ri,j(t) =
⃒
⃒xi,j(t) − x0,i,j
⃒
⃒
xa,i,j
, ri(t) =
1
M
∑M
j=1
ri,j(t), (3)
where x0,i,j is the reference signal value of xi,j, xa,i,j is a positive value
equal to the modulus of the difference between the alarm threshold and
reference value, and therefore, ri(t) > 1 means that the residual value of
AEi is over a threshold value, that is, xi(t) < xi,0-xi,a or xi(t) > xi,0 + xi,a.
Let xU,i,j and xL,i,j represent the upper limit and lower limit of xi,j,
respectively. Oftentimes, only one threshold (upper or lower limit) exists
for a basic event. Therefore, when only xU,i,j exists and xi,j(t) > x0,i,j, or
only xL,i,j exists and xi,j(t) < x0,i,j, the value of ri,j(t) is calculated by Eq.
(3); otherwise, ri,j(t) is equal to 0.
The probability PMON increases when the monitored residual moves
further away from the normal zone. This behavior can be captured using
a cumulative normal distribution [41]. Therefore, the probabilityPMONof
AEi at time t is defined as
PMONi (t) = φ
(
ri(t) − 1
σ
)
=
∫ ri(t)
− ∞
1̅̅̅
̅̅
2π
√
σ
e
(r− 1)2
2σ2 dr, (4)
where σ = 1/3 up to the three-sigma rule.
Fig. 3 shows a visual depiction of the probability estimation. The
probability PMON for a point on the line ri = 0 is 0, and for a given point
on the threshold ri = 1 is equal to 0.5. When the online residual ri(t) is at
the threshold (ri =1), the probability of monitoring data exceeding the
alarm threshold is 0.5 as it can either go back to the normal zone or keep
growing and lead to the abnormal event with the same probability.
In this way, it is possible to estimate the probability that the moni-
toring parameters of an environmental event exceed the alarm threshold
based on data monitoring in real-time. However, PMON is not equal to the
probability of the occurrence of environmental events. In Section 3.2, we
combine PMON and prior knowledge to estimate the real-time risk for all
dynamic events.
3.1.2. Probability estimations for process nodes
Unlike in environmental nodes, the monitoring parameters in pro-
cess nodes will be affected by varying working conditions. As operating
conditions vary over time, the reference signal values and alarm
thresholds of a process node may also change accordingly. Often, many
fault-free monitoring data can be collected from systems under normal
operating conditions. Therefore, let yi,j with j = 1, 2, …, M denotes the j-
th process monitoring signal corresponding to the i-th process node. The
probability estimation problems for process nodes can be formulated as:
given historical fault-free data and online signalszi,t = [zi,1,t ,zi,2,t ,...,zi,M,t],
i = m+1, m+2,…, m+n, to construct a nonlinear residual model for the i-
th process node to estimate the probability of its residual signal
exceeding the alarm threshold PMONi (t) at time t.
Gaussian mixture model (GMM) can represent different operation
scenarios in model building [42]. Therefore, GMM is applied to
construct the residual model in this work. The probability density of a
given set of training sample yi can then be approximated by GMM as
P(yi|Λi) =
∑K
k=1
βi,kg
(
yi|λi,k
)
,
∑K
k=1
βi,k = 1, (5)
where yi = [yi,1, …, yi,M]T is the given M-dimensional measurement
sample, K is the number of Gaussian components, λi,k is a parameter
vector containing mean value vector μi,k and covariance matrix Σi,k, and
βi,k denotes the prior probability of the k-th Gaussian component.
Conceptually, we denote the GMM compactly as Λi = {βi, μi, Σi}, where
βi = [βi,1, …, βi,K], μi = [μi,1, …, μi,K], and Σi = [Σi,1, …, Σi,K].
In the offline stage, GMM parameters Λi = {βi, μi, Σi} are learned
based on the Expectation Maximization (EM) algorithm by maximizing
the log-likelihood of observing historical data. The minimum message
length (MML) criterion is used to search the optimal number of Gaussian
components. More details to train GMM can be found in [43].
When new signals zi,t are measured at time t in the online stage, the
GMM model can be transformed into a dynamic form, and the proba-
bility distribution p(yi,t+1| zi,t) at the time step (t+1) can be predicted via
the Chapman-Kolmogorov equation
P
(
yi,t+1
⃒
⃒zi,t
)
=
∫
P
(
yi,t+1
⃒
⃒yi,t
)
P
(
yi,t
⃒
⃒zi,t
)
dyi,t
≈
∑K
k=1
βi,t,k
∫
g
(
yi,t|μi,t,k ,Σi,t,k
)
P
(
yi,t+1
⃒
⃒yi,t
)
dyi,t,
(6)
where μtn, Σtn, and βtn denotes the updated mean value vector,
Fig. 3. Changes in the probability of signals exceeding alarm threshold with
monitored residual.
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
5
covariance matrix, and prior probability of the n-th Gaussian component
at the time step t, respectively.
According to the idea of particle filter (PF) [44], the posterior
probability distribution P(yi,t| zi,t) can be represented by a set of random
particles y(p)i,t , p = 1, 2, …, L, with corresponding weight ω
(p)
i,t , and thus
Eq. (6) can be represented as
P
(
yi,t+1
⃒
⃒zi,t
)
≈
∑L
p=1
ω(p)i,t δ
(
yi,t − y(p)i,t
)
P
(
yi,t+1
⃒
⃒yi,t
)
=
∑L
p=1
ω(p)i,t P
(
yi,t+1
⃒
⃒
⃒y(p)i,t
)
,
(7)
whereδ(⋅)stands for the Dirac delta function.
Initially, the weight of each particle is set as 1/L, and the weight is
recursively updated based on the likelihood of the observation zi,t. To
address the degeneracy problem, resampling described in [44] is applied
in each step to eliminate the particles with small weights and duplicate
the particles with high weights. After that, we can obtain a new set of
particles y(p)i,t+1, p = 1, 2, …, L, with the same weight. The parameters of
GMM can then be updated by a one-step EM algorithm [45]
γ(p)i,t+1,k =
βi,t,kg
(
y(p)i,t+1|μi,t,k ,Σi,t,k
)
∑K
k=1βi,t,kg
(
y(p)i,t+1|μi,t,k ,Σi,t,k
), (8)
μi,t+1,k =
(
∑L
p=1
γ(p)i,t+1,k
)− 1
∑L
p=1
y(p)i,t+1γ
(p)
i,t+1,k , (9)
Σi,t+1,k =
(
∑L
p=1
γ(p)i,t+1,k
)− 1
∑L
p=1γ(p)i,t+1,k
(
y(p)i,t+1 − μi,t+1,k
)(
y(p)i,t+1 − μi,t+1,k
)T
,
(10)
βi,t+1,k =
1
L
∑L
p=1
γ(p)i,t+1,k. (11)
When the parameters of GMM at the time step (t+1) are available,
the residual signal at the time step (t+1) can be defined with the updated
parameters of Gaussian components as [46]
ri(t+ 1) =
∑K
k=1
P
(
Ci,k
⃒
⃒zi,t+1
)
PMn
(
zi,t+1
)
, 0 < ri(t+ 1) < 1, (12)
with
P
(
Ci,k
⃒
⃒zi,t+1
)
=
βi,t+1,kg
(
zi,t+1
⃒
⃒λi,t+1,k
)
∑K
k=1βi,t+1,kg
(
zi,t+1
⃒
⃒λi,t+1,k
), (13)
PMn
(
zi,t+1
)
= P
{
Dr
(
yi,t+1,Ci,k
)
≤ Dr
(
zi,t+1,Ci,k
)}
, (14)
where P(Ci,k|zi,t+1) denotes the posterior probability of the monitored
zi,t+1 belonging to the k-th Gaussian component, and PMn(zi,t+1) is the
Mahalanobis distance of zi,t+1 from the center μi,t+1,k, which can be
estimated as
Dr
(
zi,t+1,Ci,k
)
=
(
zi,t+1 − μi,t+1,k
)T ( Σi,t+1,k + εI
)− 1( zi,t+1 − μi,t+1,k
)
, (15)
where ε is a small positive number to avoid an ill-conditioned covariance
matrix.
Since Dr follows χ2 distribution with an appropriate degree of
freedom, the threshold of the monitored residual signal should be set as
the value of the confidence level (1-α), then the probability of monitored
signals exceeding the alarm threshold for the i-th process node at time
step (t+1) can be defined as
PMONi (t+ 1) = φ
(
ri(t + 1) − 1 + α
σm
)
=
∫ ri(t+1)
− ∞
1
̅̅̅̅̅
2π
√
σm
e
(r− 1+α)2
2σ2m dr, (16)
where σm = (1-α)/3 according to the three-sigma rule. It is worth noting
that the monitored residuals larger than the threshold (1-α) should be
scaled to ensure that when ri(t+1) is equal to 1, PMONi (t + 1)is also
approximately equal to 1. The whole procedure for estimating the
probability of process data exceeding the alarm threshold is summarized
in Algorithm 1 below.
3.2. Integrated real-time risk analysis
When the temporal dependencies are introduced, the probability
function of the top event (accident) can be transformed to the dynamic
form from Eq. (1), and the accident probability at time tk is predicted as
PTE(tk) = fFT(PBE1 (tk),PBE2 (tk),…,PBEN (tk)). (17)
As shown in Fig. 4, DBN is used to present the dynamic behavior of
risk models. Each BN node has two functional states, i.e., Occurrence
(Occ) and Non-occurrence (Non). Since the state of basic events is
observed firmly based on data monitoring at each time, the conditional
probability distribution of P(BEi(tk)|BEi(tk − 1)) is neglected [47].
Therefore, the risk index (posterior probability) of each basic event can
be estimated based on the Bayes rule
P(BEi(tk) = Occ|TE = Occ)
=
P(TE = Occ|BEi(tk) = Occ)PBEi (tk)∑
BEi(tk)=Occ,Non
P(BEi(tk))P(TE = Occ|BEi(tk))
, i = 1, 2,…N. (18)
From the above equations, we can see that the key point of PRA is to
estimate the probability of BEi at time tk. In this paper, a BN (as shown in
the red frame in Fig. 4) is developed to describe the dependencies be-
tween prior probability PBEi and the corresponding alarm probability
PMONi for the i-th event, and then we can haveP(BEi, MONi) =
P(MONi|BEi)PBEi . Since PMONi obtained in Section 3.1 is a continuous
value between 0 and 1, the continuous probability needs to be dis-
cretized to construct the CPT of node MONi. Assume that the state of the
node MONi is divided into Q discretization states S = {S1, S2, …, SQ},
where Q should be an even number and the state SQ corresponds to the
probability from (Q − 1)/Q to 1. According to the definition in Section
3.1, the probability PMON is 0.5 for a given point on the alarm threshold.
As shown in Fig. 4, we assume that the monitored signal of an event
obeys a normal distribution, in which its probability density function
(PDF) is denoted byf(⋅); the upper and lower limits of thef(⋅) are denoted
by TU and TL, respectively; and τ is the alarm threshold set for the
Algorithm 1
Probability estimation for process nodes based on GMM and PF
Offline
Inputs: historical fault-free data yi
Outputs: Λi = {βi, μi, Σi}
Step: Train GMM using the EM algorithm.
Online (at time t+1)
Inputs: Λi,t = {βi,t, μi,t, Σi,t}, zi,t, zi,t+1
Outputs: Λi,t+1 = {βi,t+1, μi,t+1, Σi,t+1}, PMONi(t+1)
Step 1: Generate a set of random particlesy(p)i,t , p = 1, 2, …, L, according to the Λi,t =
{βi,t, μi,t, Σi,t}.
Step 2: Calculate the weight of particles based on zi,t.
Step 3: Resample to eliminate particles with small weights and obtain a new set of
particlesy(p)i,t+1, p = 1, 2, …, L.
Step 4: Update the GMM parameters by Eq. (8)-(11) and get Λi,t+1 = {βi,t+1, μi,t+1,
Σi,t+1}.
Step 5: Calculate the monitored residual ri(t+1) based on Λi,t+1 and zi,t+1 by Eq. (12)-
(15).
Step 6: Calculate the probability PMONi(t+1) by Eq. (16)
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
6
monitored signal. If the state of basic event is Non, the monitored signal
will be in the range from TL to τ (corresponding to PMON from 0 to 0.5)
with the probability of
∫ τ
TL
f(⋅) and will be in the range from τ to TU
(corresponding to PMON from 0.5 to 1) with the probability of
∫ TU
τ f(⋅),
and vice versa. Using this idea, the distribution of the monitored signal
can be scaled into CPT of BN. The conditional probability P(Sj |BE) can
be estimated with a pre-set threshold τ and PDFf(⋅)as
P
(
Sj
⃒
⃒BE
)
=
⎧
⎪⎪⎪⎪⎪⎪⎪⎨
⎪⎪⎪⎪⎪⎪⎪⎩
∫ 2τj/Q
2τ(j− 1)/Q
f (x)dx
∫ TU
TL
f (x)dx
, 0 <j ≤
Q
2
∫ τ+(2j− Q)(TU − τ)/Q
τ+(TU − τ)(2j− Q− 2)/Q
f (x)dx
∫ TU
TL
f (x)dx
,
Q
2
< j ≤ Q
. (19)
When the historical data is available, for example, when the histor-
ical fault-free data is useful, the kernel density estimation (KDE) can be
used to estimate the PDF of the residual signal under normal conditions,
and the estimated PDF can be applied as the f(⋅) for P(Sj
⃒
⃒BE = Non). The
∑j=Q
j=Q/2+1P(Sj
⃒
⃒BE= Non) can denote the false alarm rate (FAR), and the
∑j=Q
j=Q/2+1P(Sj
⃒
⃒BE= Occ) can represent the fault detection rate (FDR).
When no historical data is available, thef(⋅)can be defined as the PDF of
normal distribution N(μ,σ), and the TU and TL can be set as (μ+3σ) and
(μ-3σ), separately. The value of μ and σ can be adjusted arbitrarily to
ensure TL = 0. A high value of FAR indicates many false alarms will be
triggered when the process is normal, while a low FDR implies few
alarms can be raised when the process is abnormal. The threshold τ is set
to determine the value of FAR and FDR, which can represent the un-
certainty in the risk monitoring model. For the case of BE = Non, the
threshold τ can be set as (μ+2.5σ) to guarantee nearly 95% alarms are
correct for the fault-free case; for the case of BE = Occ, the threshold τ
can be set as (μ-2.5σ) to ensure only 5% alarms can not be triggered for
the faulty case. When PMONi is observed, the corresponding state of the
node MONi, denoted by S∗i , can be obtained, and PBEi (tk) can then be
computed as follows
PBEi (tk) =
P
(
S∗i
⃒
⃒BEi
)
PBEi
∑
BEi=Occ,NonP(BEi)P
(
S∗i
⃒
⃒BEi
). (20)
The CPT in Fig. 4 shows an example where Q and μ are 10 and 3,
respectively. As an example, for a basic event, if the alarm probability
PMON is 0.25 and prior probability PBE is 0.05, the S∗ will be S3, and the
updated probability will be 9.14× 10− 4 × 0.05/(9.14 × 10− 4 × 0.05 +
0.4072 × 0.95) = 1.18× 10− 4. If PMON is less than 0.5, the updated
probability will be less than the prior probability.
Furthermore, based on the idea of discretization, the P(MONi(tk)|
BEi) at time tk can be computed directly by
P(MONi(tk)|BEi) =
⎧
⎪⎪⎪⎪⎪⎪⎪⎨
⎪⎪⎪⎪⎪⎪⎪⎩
∫ 2τPMONi (tk)+Δ
2τPMONi (tk)− Δ
f (x)dx
∫ TU
TL
f (x)dx
, 0 ≤ PMONi (tk) ≤ 0.5
∫ 2τ− TU+2PMONi (tk)(TU − τ)+Δ
2τ− TU+2PMONi (tk)(TU − τ)− Δ
f (x)dx
∫ TU
TL
f (x)dx
, 0.5 < PMONi (tk) ≤ 1
,
(21)
where Δ is a small positivenumber representing the discrete interval. By
Eq. (21), we can directly input the value of PMONi into the model without
identifying its corresponding state S∗i , so that we can get continuously
updated probability based on different PMONi . It is worth mentioning that
both the first and the second equations of P(MONi(tk)|BEi) will be equal
to zero when PMONi (tk) is equal to zero or one. Therefore, in this work, we
compress the boundary of PMONi (tk) to a new boundary from 0.2 to 0.8 by
PMONi (tk) = 0.6× PMONi (tk)+ 0.2. The figure on the left side of CPT in
Fig. 4 (as shown in the blue frame) shows the value of P(MON(tk)| BE)
with Δ equal to 0.2. Finally, thePBEi (tk)can be computed based on the
value ofP(MONi(tk)|BEi = Occ)andP(MONi(tk)|BEi = Non) as follows
Fig. 4. Diagram of the proposed PRA method.
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
7
PBEi (tk) =
P(MONi(t)|BEi = Occ)PBEi∑
BEi=Occ,NonP(BEi)P(MONi(t)|BEi)
. (22)
4. Implementation and model discussion
In this section, two case studies are used to demonstrate the benefits
of the proposed method brought to PRA. In Section 4.1, an experimental
study with comparative analysis is presented. Then in Section 4.2, data
monitoring-based PRA is performed for MPD operations to evaluate the
proposed model.
4.1. Case study 1: RT 580 experimental setup
4.1.1. System description
In the first case, the RT 580 experimental setup (shown in Fig. 5)
[48], situated at the Memorial University, is applied to demonstrate the
proposed method. The RT 580 setup controls the level of process tank
(B2) at 40% through a circuit with a collecting tank (B1), pump (P1),
and valves. The pneumatic control valve (V7) is used as the actuator.
The FT shown in Fig. 6 represents the accident mechanism of “dry-out in
tank B2”. Eight basic events (BE1 ~ BE8) are considered, and their prior
probabilities can be found in [48].
The flow rate and level in the process tank are monitored during the
experiment. The basic event BE1 “Sensor is broken” is introduced into
the system. Fig. 7 shows the experimental data collected from the stable
operation to the failure of the system. When the level sensor is broken,
the controller acquires the default value (high value) of the level in the
tank. The controller attempts to close the control valve (V7), causing the
flow and level in tank B2 to drop. Since the system operates in a signal
condition, we consider the node BE1 as an environmental node. Ac-
cording to Ref. [48], a flow rate less than 100 L/h and a level less than
10% will cause a dry-out situation.
Therefore, the probability of BE1 is determined by flow rate and level
monitoring, and the normal reference value and the threshold value of
flow rate and level are set as {400, 40} and {100, 10}, respectively.
4.1.2. PRA results and discussions
The figure on the left-hand side of Fig. 8 shows the probability of BE1
signals exceeding alarm thresholds. The real-time probability of BE1
shown in the middle figure of Fig. 8 can be calculated by integrating its
prior probability and alarm probability. The first two figures in Fig. 8
present probability profiles with similar growth trends. As defined in
Section 3.1, a flow rate less than 100 L/h and a level less than 10% will
result in the alarm probability of BE1 higher than 0.5, indicating that the
abnormality of its monitored signal keeps growing and may lead to
abnormal events. Although the alarm probability by the end of BE1 is as
high as a probability close to 0.9, the real-time probability of BE1 is still
small due to the low prior probability. In this case, the real-time prob-
ability of BE1 is consistent with its prior risk knowledge. The figure on
the right-hand side of Fig. 8 shows the real-time probability of dry-out
event calculated by the proposed method and dynamic fault tree
(DFT) [48]. In DFT, the probability of BE1 (sensor is broken) is equal to 1
when the flow rate is less than 100 L/h or the level is less than 10%,
which will result in a real-time probability of 1 for the dry-out event.
However, when the process variable exceeds the alarm threshold, the
corresponding accident may not always occur. Alarms can also be
caused by process disturbances or signal noise. In contrast, the accident
probability predicted by the proposed method does not increase
immediately when the process variables exceed the alarm threshold. The
probability curves of the dry-out accident undertaken with the proposed
method are smoother and show significant variations in values
compared to the normal case, and thus are more reasonable.
To search for critical basic events, the normalized ratios of posterior
probabilities are compared. Fig. 9 presents the static posterior estimates
of basic events in the RT 580 setup when the dry-out event occurs.
Before the level sensor is damaged, static posterior analysis can find
some basic events with potential dangers of the dry-out accident, such as
BE1, BE7, and BE8. Fig. 10 compares posterior estimates of basic events in
the event of a dry-out accident between the proposed method and DFT.
Since BE1 has been monitored and the monitoring results show that BE1
is abnormal when the accident occurs, the posterior probability ratio of
BE1 increases rapidly and is significantly higher than other basic events.
Both of the proposed method and DFT can find the most critical event in
real-time. However, on the basis of posterior results by DFT, the pro-
posed method can further find that the posterior probability of BE8 is
slightly higher than the other six events, indicating that BE8 still needs
further attention.
4.2. Case study 2: managed pressure drilling operations
4.2.1. System description
Managed pressured drilling (MPD) system is a typical closed-loop
kick control system used in deepwater drilling operations. As shown in
Fig. 11, MPD utilizes the choke and the pump to adjust annular pressure
to meet the requirements of bottomhole pressure (BHP) and observes the
BHP condition based on the measurements of flow and hydrodynamic
pressure. Fig. 12 shows the FTA of MPD operations. The top event (kick)
results from the combined effect of both the abnormal BHP and the
failure of the MPD system [50]. Twenty-one basic events (BE1 ~ BE21)
are listed with the potential accidents. The prior probability PBEi, i = 1, 2,
…, 21 of basic events can be found in [50].
In this case, all basic events are divided into two groups; the first
group,
GBE = {BE7,BE8,BE12,BE14,BE16,BE18}, (23)
represents the events that can be updated in real-time, in which the
probability of BE12 and BE14 can be set as 1 when the corresponding
signal is lost, whereas the probabilities of the remaining basic events are
assumed to be constant values equal to their prior probabilities. In
addition, the BE7 (abnormal pressured zone) and BE8 (loss of circulation)
are regarded as environmental nodes, and their probabilities are upda-
ted based on fluid and pressure monitoring. The BE16 (rig pump failure)
and BE18 (backpressure pump failure) are considered process nodes
since their reference signal values and alarm thresholds can be affected
by changes in pump operating conditions.
Data collected from MPD in conjunction with experimental equip-
ment data are used in the second case study. The fluid and pressure Fig. 5. The simplified flow diagram of RT 580 experimental setup.
R. He et al.
Reliability Engineering and System Safety 226 (2022) 1087008
measurement data come from the actual drilling of Well GS19 in the
Qixia Formation [51]. The density of injected and returned drilling fluid
is measured with a flowmeter to identify overflow or lost circulation,
Fig. 6. FT for “Dry-out in tank B2” incident (adapted from the Ref. [48]).
Fig. 7. Flow rate and level measurements during the experiment [48].
Fig. 8. Forward PRA updating of BE1 and dry-out event.
Fig. 9. Static posterior estimation of RT 580 setup.
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
9
and the automatic choke manifolds are used to regulate wellhead back
pressure to maintain the BHP in a safe range. Fig. 13 shows the MPD
data, e.g., casing pressure, inlet density, outlet density, and BHP
equivalent circulation density, collected from the drilling depth of 4035
m to 4235 m. Two layers with lost circulation were encountered at a
depth of 4145.13 m and 4232.22 m, while the lost layers were plugged
successfully through the automatic injection of bridging mud at
managed pressure. In addition, it can be seen from the change in
pumping rate that the operating condition of the rig pump occurs at
depths of 4140 m, 4210 m, and 4230 m. Observed from the dataset,
higher casing pressure and BHP equivalent circulation density indicate a
high risk of BE7, and the outlet density increases and is slighter higher
than the inlet density when the loss of circulation occurs. In practice, a
basic event can be modeled based on multiple signals using similar
approaches.
The health status of oil pumps can be represented by the vibration
features of the bearing inside the pump, and thus, the vibration data of
the bearing provided by Case Western Reserve University (CWRU)
Bearing Data Center [52] is used to test the monitoring ability of the
proposed method. The database includes normal bearing monitoring
data in four conditions, labeled from Normal 1 to Normal 4, corre-
sponding to the motor speed from 1797 rpm to 1730 rpm. We assume
that 10000 vibration data points can be obtained for every 10 m step of
drilling depth to match the drilling process in Fig. 13. To simplify the
Fig. 10. Posterior estimation of RT 580 setup at time 180 s.
Fig. 11. MPD Process Flow Diagram (adapted from the Ref. [49]).
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
10
model, only the vibration data of the drive end acceleration are used.
The rig pump is assumed to be fault-free during the whole drilling
process but has multiple working conditions up to the actual case,
whereas for the backpressure pump, an abrupt rolling element fault is
assumed to occur at a depth of 4200 m. In this case, the rest of the
Normal 1 and Normal 4 data are used to train the monitoring model for
the rig pump and backpressure pump, respectively. The use of vibration
data is shown in Fig. 14. More details of the CWRU database can be
found in [53].
4.2.2. Real-time PRA results
In this section, the PRA of MPD operations is performed following the
procedure in Fig. 2. According to the actual drilling of Well GS19 [51],
the BHP is kept at a slight over-balance during drilling, and the annular
back pressure and BHP equivalent circulation density are controlled at
2–5 MPa and 2.46–2.52 g/cm3. Therefore, the normal reference value
and the threshold value of casing pressure, BHP equivalent circulation
density, and the difference between outlet and inlet densities are set as
{3.5, 2.49, -0.03} and {5, 2.52, 0}, respectively. In addition, three sta-
tistical time-domain features are extracted from the vibration
measurements
⎧
⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎨
⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎩
x1 =
̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
1
N
∑N
k=1
y2k
√
√
√
√
x2 =
̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
1
N − 1
∑N
k=1
(yk − y)2
√
√
√
√
x3 =
̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
1
N
∑N
k=1
y2k
√
1
N
∑N
k=1
|yk|
, (24)
where x1 is the root mean square, x2 is the standard deviation, and x3 is
the shape factor [54]. In Eq. (24), y is the vibration signal, and N is the
number of vibration points equal to the product of time interval length
and sampling frequency. In this case, N is equal to 100. The EM algo-
rithm is used to train the GMM based on the historical vibration data,
and then we can obtain
Fig. 12. FTA for kick control incident (adapted from the Ref. [50]).
βBE16 = [ 0.038 0.070 0.264 0.629 ],
μBE16 =
⎡
⎢
⎣
0.064 0.087 0.089 0.065
0.057 0.084 0.088 0.063
1.266 1.221 1.179 1.230
⎤
⎥
⎦,
Σ(1)BE16 =
⎡
⎢
⎣
5.64e − 5 6.24e − 5 5.33e − 5
6.24e − 5 8.31e − 5 1.50e− 4
5.33e − 5 1.50e− 4 2.30e− 3
⎤
⎥
⎦,Σ(2)BE16 =
⎡
⎢
⎣
1.20e − 4 1.19e − 4 − 5.95e − 6
1.19e − 4 1.27e − 4 2.21e− 5
− 5.95e − 6 2.21e− 5 7.60e − 4
⎤
⎥
⎦
Σ(3)BE16 =
⎡
⎢
⎣
9.28e − 5 9.04e − 5 − 1.27e − 5
9.04e − 5 9.09e − 5 − 1.47e − 5
− 1.27e − 5 − 1.47e − 5 4.49e − 4
⎤
⎥
⎦,Σ(4)BE16 =
⎡
⎢
⎣
1.22e− 4 1.21e − 4 − 7.83e − 5
1.21e − 4 1.22e − 4 − 8.20e − 5
− 7.83e − 5 − 8.20e − 5 1.40e − 3
⎤
⎥
⎦,
(25)
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
11
where ΛBE16 = {βBE16, μBE16, ΣBE16} and ΛBE18 = {βBE18, μBE18, ΣBE18} are
the trained GMM for BE16 and BE18, respectively.
After determining model parameters, the probability PMON7 and
PMON8 that online signals exceed alarm thresholds are calculated for BE7
and BE8 based on fluid and pressure monitoring by Section 3.1.1. In
addition, with the assumption that the interval length is equal to 1 m, the
alarm probabilities PMON16 and PMON18 are computed in real-time for BE16
and BE18 based on the trained GMMs and the dynamic updating method
shown in Algorithm 1. Fig. 15 exhibits the results of real-time proba-
bility PMON for four basic events. As depicted in Fig. 15 (a), when the lost
circulation occurs at a depth of 4145.13 m and 4232.22 m, PMON8 (Loss
of circulation) increases significantly. However, since this case only
applies a single variable to evaluate the lost circulation, the measure-
ment noise will seriously affect the evaluation results, so that sometimes
under normal working conditions, BE8 also has a higher alarm proba-
bility PMON8 . In contrast, PMON7 (abnormal pressured zone) is assessed by
two variables and thus has higher stability. As shown in Fig. 15 (b),
PMON16 (rig pump failure) will not change significantly due to the
changes in operating conditions. Meanwhile, the proposed method can
effectively monitor the fault of BE18 (backpressure pump failure) at a
depth of 4200 m.
Next, the estimated alarm probabilities PMONi are integrated with
prior probabilities to evaluate the risk of dynamic nodes in real-time.
According to Eq. (21) in Section 3.2, the conditional probability P
(MONi| BEi) at each depth can be computed based on PMONi , and P(MONi|
BEi) is incorporated into prior probabilities to calculate the real-time
risks of dynamic nodes by Eq. (22). The obtained real-time probability
of basic events BE7, BE8, BE16, and BE18 is shown in Fig. 16. In this case,
the model parameter Δ is set to 0.2, and μ is set to 3. Detailed parameter
settings will be discussed further in Section 4.2.4. It can be observed that
if the alarm probability PMON obtain in Fig. 15 is less than 0.5, the risk of
basic events in Fig. 16 will not change significantly. Fig. 16 shows that
therisk of BE8 increases when lost circulation occurs at depths of
Fig. 14. Vibration measurements for the drilling process.
Fig. 13. Measurements during the MPD of Well GS19 [51]. The BHP equivalent
circulation density is only monitored at depths of 4090 m and 4170 m.
βBE18 = [ 0.046 0.074 0.053 0.162 ],
μBE18 =
⎡
⎢
⎣
0.062 0.062 0.075 0.074
0.055 0.061 0.073 0.070
1.238 1.231 1.290 1.207
⎤
⎥
⎦,
Σ(1)BE18 =
⎡
⎢
⎣
4.93e − 5 3.79e − 5 2.60e − 5
3.79e − 5 3.83e − 5 2.88e− 5
2.60e − 5 2.88e− 5 1.10e− 3
⎤
⎥
⎦,Σ(2)BE18 =
⎡
⎢
⎣
7.97e − 5 7.81e − 5 − 1.89e − 5
7.81e − 5 7.93e − 5 − 1.67e− 5
− 1.89e − 5 − 1.67e− 5 9.94e − 4
⎤
⎥
⎦
Σ(3)BE18 =
⎡
⎢
⎣
1.28e − 4 1.24e − 4 − 1.99e − 5
1.24e − 4 1.25e − 4 − 1.94e − 5
− 1.99e − 5 − 1.94e − 5 1.20e− 3
⎤
⎥
⎦,Σ(4)BE18 =
⎡
⎢
⎣
7.99e− 5 8.22e − 5 − 3.64e − 5
8.22e − 5 9.47e − 5 − 3.23e − 5
− 3.64e − 5 − 3.23e − 5 1.00e − 3
⎤
⎥
⎦,
(26)
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
12
4145.13 m and 4232.22 m, and the risk of BE18 rises when pump fault
occurs at a depth of 4200 m. Therefore, the proposed method is vali-
dated to be effective.
When we get the risk of all basic events at a certain drilling depth, the
probability of kick can be estimated by Eq. (17). As shown in Fig. 17 (a),
a single lost circulation accident will not significantly increase the
probability of a kick due to the role of the MPD system as a safety barrier.
When the MPD system can work normally, the closed-loop control of
MPD can balance the BHP to alleviate the lost circulation and avoid the
kick, and also, this phenomenon is entirely consistent with the actual
drilling case [51]. In contrast, the MPD pump equipment failed at a
depth of 4200m and could not adjust the BHP normally, and thus the
probability of kick increased instantly. At a depth of 4232 m, the lost
circulation and the backpressure pump fault occur at the same time, and
therefore, the probability of kick reaches the highest value, which also
indicates that the kick is a result of the combined effect of both the
abnormal BHP and the failure of the MPD system. As a comparison,
Fig. 17 (b) shows the kick probability calculated without considering the
prior probability of basic events, which is a similar idea that has been
widely used in many reported works [33–37]. Although the comparative
Fig. 16. The real-time risk of basic events.
Fig. 15. The probability of online signals exceeding alarm thresholds.
Fig. 17. The real-time probability of kick.
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
13
method can successfully identify the rise in risk after the depth of 4220
m, the obtained kick risks have large fluctuations due to measurement
noise, even when the MPD system is working normally (from a depth of
4035 m to 4150 m).
Beyond the real-time probability analysis of kick, it is also essential
to discover the critical events with potential kick risk. The posterior risk
estimation can be performed based on obtained real-time probabilities
by Eq. (18) and the proposed method. In this case, the kick is adopted as
the top event to calculate the posterior probability of basic events.
Fig. 18 presents the normalized posterior estimates at some
representative drilling depths. The first graph in Fig. 18 shows the result
of the static posterior analysis. Before drilling operations, static poste-
rior analysis can find some basic events with potential dangers of kick
accident, such as BE1, BE2, BE4, BE7, BE15, and BE16. However, since BE7
and BE16 have been monitored and the monitoring results show that
they are normal, the posterior probability ratio of the events BE7 and
BE16 decreases at a depth of 4060 m. At a depth of 4150 m, the lost
circulation occurs, and thus, the BE8 (loss of circulation) increases,
indicating that the BE8 is quite critical and should be noted at this depth.
From the depth of 4150 m to 4210 m, the lost layers are successfully
Fig. 18. Real-time posterior probability of MPD operations using the proposed method.
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
14
plugged, but at a depth of 4210 m, the backpressure pump malfunction
occurs and the posterior probability ratio of the events BE18 increases
correspondingly. In addition, with the rise of the posterior probability of
BE18, the posterior probability of BE19 also increases, and thus BE19 and
BE18 may have similar accident propagation paths. A similar analysis
can be conducted for the last two graphs. At a depth of 4230 m, the rig
pump changes its operating conditions to suppress the lost circulation so
that the posterior probability of BE8 and BE16 is slightly increased. At a
depth of 4235 m, the rig pump returns to its initial operating condition,
and the corresponding posterior probability drops. For comparison,
Fig. 19. Real-time posterior probability of MPD operations without considering the prior probability.
Fig. 20. The probability of BE16 signals exceeding alarm thresholds.
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
15
when prior risk is not considered, the posterior risks of BE8 (loss of
circulation) are still high, as shown in Fig. 19, even if the layers are not
lost, or the lost layers are successfully plugged (at depths of 4060 m,
4230 m, and 4235 m). In addition, at a depth of 4150 m, the comparative
method incorrectly gets a high posterior risk of BE16 since the prior risk
of rig pump failure is not incorporated in the estimation. Overview, the
static risk analysis is hard to capture the deviations and changes in MPD
operations, and the risk cannot be well estimated if only the monitoring
information is considered without considering prior risks. However,
once the prior risks are to be considered, CPTs need to be set up to
connect prior probabilities to data monitoring, which often requires
historical accident data or expert knowledge in reported works. There-
fore, the proposed real-time PRA is particularly effective in evaluating
the time-dependent behavior of MPD operations in the presence of data
scarcity and missing expertise.
4.2.3. The comparison of GMM and GMM-PF
MPD is a typical intermittent operation system. The operating con-
dition of the rig pump and backpressure pump will change over time,
and the transient behavior can occur during phase shifts. GMM can be
utilized to construct the monitoring model for process nodes. However,
as the condition of mechanical equipment may change in MPD opera-
tions, the component parameters of GMM require updates over time.
Therefore, PF is developed to update the GMM in Algorithm 1. This part
is applied to demonstrate the positive impact of particle filtering-based
parameter updating.
As shown in Fig. 14 (a), the operating condition of the rig pump will
change during the drilling. Since the features in Eq. (24) cannot distin-
guish different operating condition, we build a simple neural network
model to extract a new set of three featuresfrom vibration signals, which
can effectively present the difference between different conditions, and
the input of the neural network is 100 vibration points. New features are
used to train the GMM for the rig pump (BE16) and the backpressure
pump (BE18). The probability of BE16 and BE18 signals exceeding alarm
thresholds based on GMM are shown in Fig. 20 (a) and Fig. 21 (a),
respectively. When we use PF to update the GMM parameters, the ob-
tained alarm probability of BE16 and BE18 are shown in Fig. 20 (b) and
Fig. 21 (b). If the alarm probability exceeds 0.5, it means that the cor-
responding mechanical equipment may be malfunctioning. As presented
in Fig. 20 (a), the GMM-based monitoring model will incorrectly eval-
uate other normal operating conditions as faults. Specifically, the motor
speed of the rig pump changes from 1797 rpm to 1750 rpm at a depth of
4140 m and increases from 1750 rpm to 1772 rpm at a depth of 4210 m.
Fig. 21. The probability of BE18 signals exceeding alarm thresholds.
Fig. 22. Sensitivity analysis of μ
Fig. 23. Sensitivity analysis of Δ
R. He et al.
Reliability Engineering and System Safety 226 (2022) 108700
16
Although the rig pump still works normally, the GMM-based monitoring
model incorrectly identifies the change in operating conditions as a fault
since most of the corresponding probabilities exceed 0.5 after depths of
4140 m and 4210 m. In contrast, as shown in Fig. 20 (b), PF can effec-
tively adjust the GMM parameters to a reasonable range so that GMM-PF
will not be too sensitive to changes in pump operating conditions. In
addition, at a depth of 4200 m, the backpressure pump experienced a
sudden rolling element fault, as shown in Fig. 14 (b). As presented in
Fig. 21, both GMM and GMM-PF can identify the fault of the back-
pressure pump at a depth of 4200 m. Overall, the proposed GMM-PF
method is more applicable when the monitoring parameters are
affected by varying working conditions.
4.2.4. Model parameter analysis
This work contains three models presented in Section 3. The first
monitoring model shown in Section 3.1.1 is a deterministic model,
which only requires reference values and thresholds for process mea-
surements instead of any pre-set parameters. For the GMM-PF model
presented in Section 3.1.2, the parameter settings of the model can refer
to many existing references, such as Ref. [42] and [43]. As a result, this
section focuses on discussing the parameter setting of the integrated PRA
model shown in Section 3.2.
We need to set only two parameters, μ and Δ, for the proposed PRA
model, and other parameters can be derived from these two parameters.
This section provides a sensitivity analysis to analyze how to determine
model parameters for PRA. First, we set Δ to 0.05 and prior probability
to 0.05 to observe the influence of the change of parameter μ on the
posterior probability. As shown in Fig. 22, whether the PMON value is 0.4
or 0.6, the posterior probability will remain stable when μ is greater than
about 2, and thus we can set the parameter μ to a number greater than 2.
Fig. 23 shows the sensitivity analysis result of parameter Δ, in which the
μ is set as 3 and the prior probability is set as 0.05. Δ is a small number
denoting the discrete interval. When the parameter Δ is less than 0.3, the
posterior probabilities are stable, and therefore, we can set Δ to a very
small positive number for the PRA model.
4.2.5. Model uncertainty and validation
The uncertainty and validation of risk analysis models have been
widely studied. The model uncertainty may greatly impact the estima-
tion accuracy of the probability estimation of basic events, and then
affect the calculated risk. One way to represent the uncertainty in PRA is
to conduct a bounding analysis by using probability boxes (P-boxes)
[55]. In the bounding analysis, the prior probability of basic events can
be assessed as a credible interval rather than a single point. By propa-
gating the uncertainty, a credibility interval can also be obtained for the
estimated kick risk and posterior risks, which can help decision-makers
understand the confidence of risk estimations.
Three-axiom-based validation [56] can be performed to give a partial
validation for the proposed PRA model. If the proposed model is robust,
the estimated probability of basic events is often sensitive but rarely
presents abrupt variations due to any slight changes in corresponding
alarm probability. As shown in Fig. 20 and Fig. 21, if PMON is less than
0.5, the real-time risk of the corresponding basic event will only change
slightly. On the contrary, when PMON is greater than 0.5, the real-time
risk of the corresponding basic event will increase significantly. With
a minor change in the alarm probability, the real-time risk of basic
events is sensitive but rarely exhibits abrupt changes. In addition, once
the monitoring results show that the event status is abnormal, the
real-time risk of the basic event will rise reasonably. The sensitive
analysis results satisfy the axioms and thus partially validate the pro-
posed PRA model.
Conclusions
This paper proposes a novel integrated PRA method for petrochem-
ical industries based on data monitoring. The proposed method can fully
satisfy the timing requirement of process operations. Additionally, the
proposed probability estimation not only considers the update of the
failure probability of safety barriers but also assesses the probability of
abnormal events. Furthermore, PF-based GMM can estimate the alarm
probability of process signals effectively during phase shafts in industrial
systems. By comparing different risk analysis methods, the main con-
clusions from the case study validate the effectiveness and necessity of
the proposed PRA method.
The significant contribution of the proposed approach is the inte-
gration of data monitoring with DBN for PRA in real-time. Specifically,
the probability of the monitored signal exceeding the alarm threshold is
estimated first for environmental and process monitoring based on the
multivariate method and PF-based GMM, respectively. Data monitoring
can be effectively incorporated into PRA in real-time by the proposed
DBN with the estimated alarm probability. The proposed DBN model
will not require additional expert knowledge and historical accident
data to determine the conditional relationship between data monitoring
and prior risk so that is more suitable for practical engineering appli-
cations. We plan to collect actual process data to test the model and
incorporate effective monitoring models into PRA.
CRediT authorship contribution statement
Rui He: Conceptualization, Methodology, Writing – original draft,
Writing – review & editing. Jingyu Zhu: Investigation, Writing – review
& editing. Guoming Chen: Writing – review & editing, Funding
acquisition. Zhigang Tian: Writing – review & editing.
Declaration of Competing Interest
No conflict of interest exists in the submission of this manuscript, and
all authors approve the manuscript for publication. We would like to
declare on behalf of co-authors that the work described was original
research that has not been published previously, and not under
consideration for publication elsewhere, in whole or in part. We declare
that we do not have any commercial or associative interest that repre-
sents a conflict of interest in connection with the work submitted.
Acknowledgments
The authors greatly appreciate the help from the Centre for Offshore
Engineering and Safety Technology (COEST) at China University of
Petroleum (East China), and gratefully acknowledge the financial sup-
port provided by China National Key Research and Development Pro-gram (No: 2016YFC0304005), and National Natural Science Foundation
of China (No: 52004142).
References
[1] Khan F, Abbasi S. Techniques and methodologies for risk analysis in chemical
process industries. J Loss Prevent Proc 1998;11(4):261–77.
[2] Hu Y, Parhizkar T, Mosleh A. Guided simulation for dynamic probabilistic risk
assessment of complex systems: Concept, method, and application. Reliab Eng Syst
Safe 2022;217:108047.
[3] Khan F, Abbasi SA. Analytical simulation and PROFAT II: a new methodology and a
computer automated tool for fault tree analysis in chemical process industries.
J Hazard Mater 2000;75(1):1–27.
[4] Ferdous R, Khan F, Sadiq R, Amyotte P, Veitch B. Handling data uncertainties in
event tree analysis. Process Saf Environ 2009;87(5):283–92.
[5] Aqlan F, Mustafa Ali E. Integrating lean principles and fuzzy bow-tie analysis for
risk assessment in chemical industry. J Loss Prevent Proc 2014;29:39–48.
[6] Khakzad N, Khan F, Amyotte P. Dynamic safety analysis of process systems by
mapping bow-tie into Bayesian network. Process Saf Environ 2013;91(1–2):46–53.
[7] Moradi R, Groth KM. Modernizing risk assessment: A systematic integration of PRA
and PHM techniques. Reliab Eng Syst Safe 2020;204:107194.
[8] OREDA-offshore reliability data handbook. 4th ed. Trondheim: SINTEF; 2002.
[9] Li X, Zhang Y, Abbassi R, Khan F, Chen G. Probabilistic fatigue failure assessment
of free spanning subsea pipeline using dynamic Bayesian network. Ocean Eng
2021;234:109323.
[10] He R, Chen G, Wang Y, Jiang S, Zhi C. A quantitative risk analysis model
considering uncertain information. Process Saf Environ 2018;118:361–70.
R. He et al.
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0001
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0001
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0002
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0002
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0002
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0003
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0003
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0003
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0004
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0004
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0005
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0005
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0006
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0006
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0007
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0007
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0008
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0009
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0009
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0009
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0010
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0010
Reliability Engineering and System Safety 226 (2022) 108700
17
[11] Liu Z, Ma Q, Cai B, Shi X, Zheng C, Liu Y. Risk coupling analysis of subsea blowout
accidents based on dynamic Bayesian network and NK model. Reliab Eng Syst Safe
2022;218:108160.
[12] Meel A, Seider W D. Plant-specific dynamic failure assessment using Bayesian
theory. Chem Eng Sci 2006;61:7036–56.
[13] Kalantarnia M, Khan F, Hawboldt K. Dynamic risk assessment using failure
assessment and Bayesian theory. J Loss Prevent Proc 2009;22:600–6.
[14] Li X, Chen G, Khan F, Xu C. Dynamic risk assessment of subsea pipelines leak using
precursor data. Ocean Eng 2019;178:156–69.
[15] Yu H, Khan F, Veitch B. A flexible hierarchical Bayesian modeling technique for
risk analysis of major accidents. Risk Anal 2017;37(9):1668–82.
[16] Meng X, Li X, Wang W, Song G, Chen G, Zhu J. A novel methodology to analyze
accident path in deepwater drilling operation considering uncertain information.
Reliab Eng Syst Safe 2021;205:107255.
[17] Yang X, Haugen S, Paltrinieri N. Clarifying the concept of operational risk
assessment in the oil and gas industry. Safety Sci 2018;108:259–68.
[18] Khakzad N, Khan F, Amyotte P. Quantitative risk analysis of offshore drilling
operations: A Bayesian approach. Safety Sci 2013;57:108–17.
[19] Pasman H, Reniers G. Past, present and future of Quantitative Risk Assessment
(QRA) and the incentive it obtained from Land-Use Planning (LUP). J Loss Prevent
Proc 2014;28:2–9.
[20] Shalev DM, Tiran J. Condition-based fault tree analysis (CBFTA): A new method for
improved fault tree analysis (FTA), reliability and safety calculations. Reliab Eng
Syst Safe 2007;92(9):1231–41.
[21] Liu Q, Dong M, Peng Y. A novel method for online health prognosis of equipment
based on hidden semi-Markov model using sequential Monte Carlo methods. Mech
Syst Signal Pr 2012;32:331–48.
[22] Zeng Z, Zio E. Dynamic risk assessment based on statistical failure data and
condition-monitoring degradation data. IEEE T Reliab 2018;67(2):609–22.
[23] Gomes JPP, Rodrigues LR, Galvao RKH, Yoneyama T. System level RUL estimation
for multiple-component systems. In: Proc 2013 Annu Conf Prognostics Health
Manage. Soc; 2013. p. 74–82.
[24] Xing J, Zeng Z, Zio E. A framework for dynamic risk assessment with condition
monitoring data and inspection data. Reliab Eng Syst Safe 2019;191:106552.
[25] Wang H, Khan F, Ahmed S, Imtiaz S. Design of Scenario-Based Early Warning
System for Process Operations. Ind Eng Chem Res 2015;54(33):8255–65.
[26] Adedigba SA, Oloruntobi O, Khan F, Butt S. Data-driven dynamic risk analysis of
offshore drilling operations. J Petrol Sci Eng 2018;165:444–52.
[27] Mamudu A, Khan F, Zendehboudi S, Adedigba S. Dynamic risk assessment of
reservoir production using data-driven probabilistic approach. J Petrol Sci Eng
2020;184:106486.
[28] Zhang X, Zhang L, Hu J. Real-time risk assessment of a fracturing manifold system
used for shale-gas well hydraulic fracturing activity based on a hybrid Bayesian
network. J Nat Gas Sci Eng 2019;62:79–91.
[29] Zywiec WJ, Mazzuchi TA, Sarkani S. Analysis of process criticality accident risk
using a metamodel-driven Bayesian network. Reliab Eng Syst Safe 2021;207:
107322.
[30] Yu Q, Teixeira AP, Liu K, Rong H, Soares CG. An integrated dynamic ship risk
model based on Bayesian networks and evidential reasoning. Reliab Eng Syst Safe
2021;216:107993.
[31] Islam R, Khan F, Venkatesan R. Real time risk analysis of kick detection: Testing
and validation. Reliab Eng Syst Safe 2017;161:25–37.
[32] He R, Li X, Chen G, Chen G, Liu Y. Generative adversarial network-based semi-
supervised learning for real-time risk warning of process industries. Expert Syst
Appl 2020;150:113244.
[33] Wang H, Khan F, Ahmed S, Imtiaz S. Dynamic quantitative operational risk
assessment of chemical processes. Chem Eng Sci 2016;142:62–78.
[34] Hashemi SJ, Ahmed S, Khan F. Loss functions and their applications in process
safety assessment. Process Saf Prog 2014;33(3):285–91.
[35] Sule I, Imtiaz S, Khan F, Butt S. Risk analysis of well blowout scenarios during
managed pressure drilling operation. J Petrol Sci Eng 2019;182:106296.
[36] Wang Q, Zhang L, Hu J. Real-time risk assessment of casing-failure incidents in a
whole fracturing process. Process Saf Environ 2018;120:206–14.
[37] Dimaio F, Scapinello O, Zio E, Ciarapica C, Cincotta S, Crivellari A, Decarli L,
Larosa L. Accounting for safety barriers degradation in the risk assessment of oil
and gas systems by multistate Bayesian networks. Reliab Eng Syst Safe 2021;216:
107943.
[38] Wu S, Zhang L, Fan J, Zheng W, Zhou Y. Real-time risk analysis method for
diagnosis and warning of offshore downhole drilling incident. J Loss Prevent Proc
2019;62:103933.[39] Parhizkar T, Hogenboom S, Vinnem JE, Utne IB. Data driven approach to risk
management and decision support for dynamic positioning systems. Reliab Eng
Syst Safe 2020;201:106964.
[40] Ahn J, Chang D. Fuzzy-based HAZOP study for process industry. J Hazard Mater
2016;317:303–11.
[41] Zadakbar O, Imtiaz S, Khan F. Dynamic risk assessment and fault detection using a
multivariate technique. Process Saf Prog 2013;32(4):365–75.
[42] Yu J. A particle filter driven dynamic Gaussian mixture model approach for
complex process monitoring and fault diagnosis. J Process Contr 2012;22:778–88.
[43] Figueiredo MAF, Jain AK. Unsupervised learning of finite mixture models. IEEE
Trans Pattern Anal Mach Intell 2002;24:381–96.
[44] Arulampalam MS, Maskell S, Gordon N, Clapp T. A tutorial on particle filters for
online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans Signal Proces 2002;
50(2):175–88.
[45] Kotecha JH, Djuric PM. Gaussian sum particle filtering. IEEE Trans Signal Proces
2003;51(10):2602–12.
[46] Yu J, Qin SJ. Multimode process monitoring with Bayesian inference-based finite
Gaussian Mixture models. AIChE J 2008;54(7):1811–29.
[47] Li C, Mahadevan S, Ling Y, Choze S, Wang L. Dynamic Bayesian network for
aircraft wing health monitoring digital twin. AIAA J 2017;55(3).
[48] Ghosh A, Ahmed S, Khan F. Modeling and testing of temporal dependency in the
failure of a process system. Ind Eng Chem Res 2019;58:8162–71.
[49] Sule I, Khan F, Butt S, Yang M. Kick control reliability analysis of managed pressure
drilling operation. J Loss Prevent Proc 2018;52:7–20.
[50] Abimbola M, Khan F, Khakzad N, Butt S. Safety and risk analysis of managed
pressure drilling operation using Bayesian network. Safety Sci 2015;76:133–44.
[51] Yan L, Wu H, Yan Y. Application of fine managed pressure drilling technique in
complex wells with both blowout and lost circulation risks. Natural Gas Indus B
2015;2:192–7.
[52] “Case western reserve university bearing data center website”, https://csegroups.
case.edu/bearingdatacenter.
[53] Smith WA, Randall RB. Rolling element bearing diagnostics using the Case Western
Reserve University data: A bench mark study. Mech Syst Signal Pr 2015;64-65:
100–31.
[54] Goyal D, Vanraj Pabla BS, Dhami SS. Condition monitoring parameters for fault
diagnosis of fixed axis gearbox: A review. Arch Computat Methods Eng 2017;24:
543–56.
[55] Karanki D R, Kushwaha H S, Verma A K, Ajit S. Uncertainty analysis based on
probability bounds (P-box) approach in probabilistic safety assessment. Risk Anal
2009;29(5):662–75.
[56] Jones B, Jenkinson I, Yang Z, Wang J. The use of Bayesian network modelling for
maintenance planning in a manufacturing industry. Reliab Eng Syst Safe 2010;95:
267–77.
R. He et al.
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0011
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0011
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0011
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0012
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0012
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0013
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0013
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0014
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0014
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0015
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0015
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0016
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0016
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0016
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0017
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0017
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0018
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0018
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0019
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0019
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0019
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0020
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0020
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0020
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0021
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0021
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0021
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0022
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0022
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0023
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0023
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0023
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0024
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0024
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0025
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0025
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0026
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0026
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0027
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0027
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0027
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0028
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0028
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0028
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0029
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0029
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0029
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0030
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0030
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0030
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0031
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0031
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0032
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0032
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0032
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0033
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0033
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0034
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0034
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0035
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0035
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0036
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0036
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0037
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0037
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0037
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0037
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0038
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0038
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0038
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0039
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0039
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0039
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0040
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0040
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0041
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0041
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0042
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0042
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0043
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0043
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0044
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0044
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0044
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0045
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0045http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0046
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0046
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0047
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0047
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0048
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0048
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0049
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0049
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0050
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0050
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0051
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0051
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0051
https://csegroups.case.edu/bearingdatacenter
https://csegroups.case.edu/bearingdatacenter
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0053
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0053
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0053
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0054
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0054
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0054
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0055
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0055
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0055
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0056
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0056
http://refhub.elsevier.com/S0951-8320(22)00323-4/sbref0056
A real-time probabilistic risk assessment method for the petrochemical industry based on data monitoring
1 Introduction
2 Preliminaries and problem formulation
3 Proposed methodology
3.1 Probability calculation for online signal exceeding alarm threshold
3.1.1 Probability estimations for environmental nodes
3.1.2 Probability estimations for process nodes
3.2 Integrated real-time risk analysis
4 Implementation and model discussion
4.1 Case study 1: RT 580 experimental setup
4.1.1 System description
4.1.2 PRA results and discussions
4.2 Case study 2: managed pressure drilling operations
4.2.1 System description
4.2.2 Real-time PRA results
4.2.3 The comparison of GMM and GMM-PF
4.2.4 Model parameter analysis
4.2.5 Model uncertainty and validation
Conclusions
CRediT authorship contribution statement
Declaration of Competing Interest
Acknowledgments
References