Fault diagnosis of rolling element bearing based on wavelet kernel principle component analysis-coupled hidden Markov model

Wang, Hongchao; Hao, Fang

doi:10.21595/jve.2017.18666

Journal of Vibroengineering

Browse Journal

Submit article

Published: 31 December 2017

Check for updates

Fault diagnosis of rolling element bearing based on wavelet kernel principle component analysis-coupled hidden Markov model

Hongchao Wang¹

Fang Hao²

¹Mechanical and Electrical Engineering Institute, Zhengzhou University of Light Industry, 5 Dongfeng Road, Zhengzhou, 450002, China

¹Henan Key Laboratory of Mechanical Equipment Intelligent Manufacturing, Zhengzhou University of Light Industry, 5 Dongfeng Road, Zhengzhou, 450002, China

²Institute for Nationalities, Huanghe Science and Technology College, 666 Zijingshan Road, Zhengzhou, 450063, China

Corresponding Author:

Hongchao Wang

Cite the article Download PDF

Downloads 1624

WoS Core Citations 8

CrossRef Citations 8

Abstract

Different description results will be obtained when apply hidden Markov model (HMM) to the two different channel signals from the same data collection point respectively. Besides, wrong fault diagnosis result might be obtained because fault feature information would not be described comprehensively by using only one single channel signal. In theory, two channel signals collected form the same data collection point will contain much more fault information than the single channel signal contain, but the coupled phenomenon might occur between the two channel signals. Coupled hidden Markov model (CHMM) is the improved method of HMM and it can fuse the information of two channel signals from the same data collection point efficiently, so much more reliable diagnosis result could be obtained by using CHMM than by using HMM. Stated thus, the fault diagnosis method of rolling element bearing based on wavelet kernel component analysis (WKPCA)-CHMM is proposed: Firstly, use WKPCA as fault feature vectors extraction method to increase the efficiency of the proposed method. Then apply CHMM to the extracted fault feature vectors and satisfactory fault diagnosis result is obtained at last. The feasibility and advantages of the proposed method are verified through experiment.

1. Introduction

HMM has been used in fault diagnosis of rotating machinery widely [1-5]. However, it could not solve the multi-channel data fusion problem. Many machine condition monitoring techniques have been proposed based on multi-channel data acquisition system [6]. The current data fusion techniques are mainly classified into three categories: data-level fusion, feature-level fusion and decision-level fusion. Vibration and current signals were fused basing on Dempster-Shafer (DS) to improve the diagnostic accuracy [7]. Some vibration parameters such as RMS, peak and peak to peak were used in the detection defects in the bearing [8]. In order to obtain better diagnostic result, the waterfall fusion model was adopted by fusing information from two different kinds of sensors: the accelerometer and load cell [9]. CHMH [10] was first proposed as a novel sensory fusion architecture to solve the data fusion problem in audio-visual speech recognition (AVSR). Xie [11] proposed a coupled hidden Markov model approach to video-realistic speech animation and realistic facial animations driven by speaker independent continuous speech was realized. In paper [12] the dependent faults occurring over time were diagnosed successfully by the proposed coupled factorial hidden Markov model method. In paper [13] the spatial and temporal dynamics in multi-channel electrocorticographic (ECoG) time series was investigated using CHMM. Though CHMM has been used widely in the above stated aspects, very few papers presented its using in fault diagnosis of rolling element bearing. The CHMM was used in rolling element bearing fault diagnosis and performance degradation assessment respectively in paper [14] and paper [15], and satisfactory experiment analysis results were obtained. So the using of CHMM in fault diagnosis of rolling element bearing is studied and the WKPCA is used as feature extraction method in the paper.

2. Wavelet kernel principle component analysis

Various feature parameters are expected to be obtained so as to reflect the running state of the machinery comprehensively. However, the efficiency of the subsequent intelligent diagnosis will be decreased greatly when too many feature vectors are used as the input vectors. Besides, some of the feature parameters are redundant and useless which will decrease the accuracy of intelligent diagnosis to some extent. Principle components analysis (PCA) and the improved PCA method-kernel principle component analysis (KPCA) [16, 17] are the common used linear and non-linear feature dimensionality reduction methods to solve the above contradiction. The schematic diagrams of PCA and KPCA can be referred to Fig. 1 and Fig. 2. KPCA not only owns the virtues of PCA, but also can analyze the non-linear problems which PCA could not. Besides, the KPCA has other advantages which can be referred to the paper [18]. Though KPCA improve the PCA greatly, there are still some defects in the traditional KPCA: firstly, the selection of kernel function in the traditional KPCA is based on experience. Secondly, there is not criterion for selection of the relative parameters of kernel function. Any functions can be fitted by the wavelet function [19] in theory, so in the paper a novel features reduction method named WKPCA is proposed: the wavelet function is used as the kernel function instead of the common used radial basis function (RBF) in KPCA, and the wavelet function can increase the non-linear mapping ability of KPCA greatly. The relative definitions and theory of WKPCA are given as the following.

Fig. 1The schematic diagram of PCA

Fig. 2The schematic diagram of KPCA

Definition 1 [20]: Kernel is a function $K$ which satisfies the following equation for any $x (x \in R^{n})$ :

1

K (x, x^{'}) = ⟨ϕ (x), ϕ (x^{'})⟩,

where $x^{'}$ represents the transpose and $ϕ (.)$ represents a mapping from the data space $R^{n}$ to the feature space $F$ , and the relationship of them can be shown as following:

2

ϕ : x \mapsto ϕ (x) \in F .

It not only can calculate the inner product more efficiently but also need not calculate mapping $ϕ$ process explicitly. The kernel function must satisfy the requirement of Mercer [21].

Theorem 1 [22]: Supposing $K$ is a continuous symmetric function $K \in L_{\infty} (R^{n} \times R^{n})$ which makes the integral operator $T_{K} : L_{2} (R^{n}) \to L_{2} (R^{n})$ :

3

(T_{K} f) (\cdot) = \int_{R^{n}} K (\cdot, x) f (x) d x,

to be positive. That is to say the following relationship can be obtained:

4

\int_{L_{2} \otimes L_{2}} K (x, x^{'}) f (x) f (x^{'}) d x d x^{'} \geq 0 .

In Eq. (4), the $\otimes$ symbol represents convolution algorithm. $K (x, x^{'})$ could be used as the representation of dot product in the feature space if the above conditions can be satisfied.

The kernel function $K (x, x^{'}) = K (x - x^{'})$ satisfies the requirement of Mercer which is given in theorem 2.

Theorem 2 [23]: If the translation invariant kernel function $K (x, x^{'}) = K (x - x^{'})$ is an allowable kernel whose fourier transform (FT) must satisfy the following condition:

5

F [K] (ω) = (2 π)^{- \frac{n}{2}} \int_{R^{n}} \exp (- j (ω \cdot x)) K (x) d x \geq 0 .

Wavelet function has the peculiar characteristics of multi-resolution analysis compared with the common used kernel functions such as RBF used in the traditional KPCA. The wavelet function can fit any function much more precisely, so the wavelet function is combined with PCA instead of the common used kernel function such as RBF, so much stronger non-linear mapping capability can be obtained. The combination of wavelet function with PCA is named wavelet kernel principal component analysis also called WKPCA for short.

Supposing $ψ (x) \in L_{2} (R)$ is a mother wavelet function, $x$ , $x \in R^{n}$ , and a translation invariant wavelet kernel function satisfying the requirement of Mercer can be constructed as following [22]:

6

K (x, x^{'}) = \prod_{i = 1}^{n} ψ (\frac{x_{i} - x_{i}^{'}}{a_{i}}),

where $a_{i}$ is the scale factor.

The requirement of Mercer is not only satisfied but also the properties of the wavelet function are considered when the wavelet kernel function is being constructed. The wavelet construction kernel function meeting the wavelet framework conditions has obvious advantage because it takes into account the sparseness of the training data and the complexity of the constructed kernel functions. Mexican hat wavelet function is a kernel function meeting the wavelet framework conditions [24]. The Mexican hat wavelet function shown in Eq. (7) is used to construct the translation invariant kernel function:

7

ψ (x) = (1 - x^{2}) e x p (- \frac{x^{2}}{2}) .

The constructed translation invariant wavelet kernel function is shown in Eq. (8):

8

K (x, x^{'}) = \prod_{i = 1}^{n} [(1 - {(\frac{x_{i} - x_{i}^{'}}{α_{i}})}^{2}) e x p (- \frac{{‖x_{i} - x_{i}^{'}‖}^{2}}{2 α_{i}^{2}})], (α_{i} \in [a, b]) .

The proof of Mexican hat wavelet satisfying Theorem 2 is given as following.

With regard to the Mexican hat wavelet shown in Eq. (9):

9

K (x) = \prod_{i = 1}^{n} ψ (\frac{x_{i}}{γ}) = \prod_{i = 1}^{n} [(1 - {(\frac{x_{i}}{γ})}^{2}) e x p (- \frac{{‖x_{i}‖}^{2}}{2 γ^{2}})] .

In Eq. (9), $γ$ is the scale factor same as the meaning of $α_{i}$ shown in Eqs. (6) and (8). The Eq. (10) can be obtained:

10

F [K] (ω) = (2 π)^{- n / 2} \int_{R^{n}} e x p (- j (ω x)) K (x) d x

= (2 π)^{- n / 2} \int_{R^{n}} e x p (- j (ω x)) \prod_{i = 1}^{n} [(1 - {(\frac{x_{i}}{γ})}^{2}) e x p (- \frac{{‖x_{i}‖}^{2}}{2 γ^{2}})] d x

= (2 π)^{- \frac{n}{2}} \prod_{i = 1}^{n} \int_{- \infty}^{\infty} (1 - {(\frac{x_{i}}{γ})}^{2}) e x p (- \frac{{‖x_{i}‖}^{2}}{2 γ^{2}} - j (ω_{i} x_{i})) d x_{i}

= \prod_{i = 1}^{n} ω_{i}^{2} {|γ|}^{3} e x p (- \frac{ω_{i}^{2} γ^{2}}{2}) \geq 0 .

From the above, the proof of Mexican hat wavelet satisfying Theorem 2 is obtained which can be used to construct the allowable kernel function.

3. CHMM

CHMM is constituted by multi-HMM chains which couple through cross-time and cross-chain conditional probabilities as illustrated in Fig. 3 and Fig. 4, and the CHMM can be regarded as a special case of dynamic Bayesian network. The observations of each chain in CHMM are decided by the corresponding state in the same chain. Besides, the unobservable state sequence can be only estimated by the observation sequence. The above two characteristics of CHMM are similar to HMM. Different from HMM, all the state variables in different chains may be contained at certain time slice in the states of the CHMM system. The states of all chains in the previous time slice decide the state in each chain. So much comprehensive fault diagnosis result of bearing can be obtained using CHMM because it has a potential to fuse data from multi-channel. The following is the basic theory introduction of a two-chain CHMM.

Fig. 3The schematic of HMM

Fig. 4The schematic diagram of CHMM

3.1. Elements of CHMM

The chain index is represented by $c$ , i.e., $c = \{1,2\}$ . The total set of hidden states of each chain is represented as $S^{c} = \{S_{1}^{c}, S_{2}^{c}, \dots, S_{N_{c}}^{c}\}$ . Let $o_{t} = \{o_{t}^{1}, o_{t}^{2}\}$ represent the observation vector and the hidden state at time $t$ is expressed as $q_{t} = \{q_{t}^{1}, q_{t}^{2}\}$ . The following expression describes the elements of CHMM: $λ = (A, B, π)$ .

(1) $A = \{a_{i, j}\}$ represents the state transition probability matrix. The system transfers from the state $S_{i} = {S_{i_{1}}^{1}, S_{i_{2}}^{2}}$ to the state $S_{j} = {S_{j_{1}}^{1}, S_{j_{2}}^{2}}$ with probability $a_{i, j}$ which could be represented by the following equation:

11

a_{i, j} = P (q_{t + 1} = S_{j}| q_{t} = S_{i}) = \prod_{c = 1}^{2} P (q_{t + 1}^{c} = S_{j_{c}}^{c}| q_{t} = S_{i}) .

(2) The observation probability matrix is expressed as $B = \{b_{j} (o_{t})\}$ . The output $o_{t}$ generated by each state $S_{i} = {S_{i_{1}}^{1}, S_{i_{2}}^{2}}$ with a probability distribution function can used the following equation:

12

b_{j} (o_{t}) = P (o_{t}| q_{t} = S_{j}) = \prod_{c = 1}^{2} P (o_{t}^{c}| q_{t}^{c} = S_{j_{c}}^{c}) .

(3) The initial state distribution is $π = \{π_{i}\}$ , and the calculated probability value of the system’ initial state in $S_{i} = {S_{i_{1}}^{1}, S_{i_{2}}^{2}}$ is $π_{i}$ :

13

π_{i} = P (q_{1} = S_{i}) = \prod_{c = 1}^{2} P (q_{1}^{c} = S_{i_{c}}^{c}) .

The probability distribution of continuous observation can use the Gaussian mixed model (GMM) as follows:

14

b_{j}^{c} (o_{t}^{c}) = \sum_{m = 1}^{M_{j}^{c}} w_{j, m}^{c} N (o_{t}^{c}, μ_{j, m}^{c}, \sum_{j, m}^{c}),

where $M_{j}^{c}$ is the number of Gaussian mixtures of chain $c$ in state $S_{j}^{c}$ , $w_{j, m}^{c}$ is the weight for each Gaussian mixture, and $N (o_{t}^{c}, μ_{j, m}^{c}, \sum_{j, m}^{c})$ is a Gaussian density with mean vector $μ_{j, m}^{c}$ and covariance matrix $\sum_{j, m}^{c}$ .

3.2. CHMM' basic problems

There are three basic problems existing for CHMM in real application:

(1) Evaluation. How the observation sequence $O = \{o_{1} o_{2} \dots o_{T}\}$ with a given CHMM $λ$ is computed, i.e., $P (O | λ)$ ?

(2) Decoding. Given the observation sequence $O = \{o_{1} o_{2} \dots o_{T}\}$ and a CHMM $λ$ , how do we select a hidden state sequence $S = \{S_{1} S_{2} \dots S_{T}\}$ to explain the process, i.e., $m a x_{S} P (S | O, λ)$ ?

(3) Learning. Given the observation sequence $O = \{o_{1} o_{2} \dots o_{T}\}$ , how do we adjust the model parameters $λ$ to maximize the probability $P (O | λ)$ ?

Many algorithms such as Viterbi algorithm, forward-backward procedure and Baum-Welch method were proposed to solve the above problems. The reference [21] gives more details about the above algorithms.

4. Experiment

The flow chart of the proposed method based on WKPCA-CHMM is shown in Fig. 5 and the specific details of each step are given as following:

Step 1: Data collection: collect the signals of the four states (normal state, outer race fault state, rolling element fault state and inner race fault state) of rolling element bearing using double channel accelerator sensors.

Table 1Time-domain statistics indexes

Calculation formulas
1	Peak	$x = m a x {\|x (1)\|, \|x (2)\|, \dots, \|x (N)\|}$
2	Ppvalue	$x_{p - p} = x (n)_{m a x} - x (n)_{m i n}$
3	Meanamp	${\bar{x}}_{p} = \frac{1}{N} \sum_{i = 1}^{N} x_{i}$
4	Rootamp	$x_{r} = {(\frac{1}{N} \sum_{n = 1}^{N} \sqrt{\|x (n)\|})}^{2}$
5	Root mean square	$x_{R M S} = \sqrt{\frac{1}{N} \sum_{n = 1}^{N} x^{2} (n)}$
6	Waveind	$S_{f} = \frac{x_{R M S}}{{\bar{x}}_{p}}$
7	Pluseind	$I_{f} = \frac{x}{{\bar{x}}_{p}}$
8	Peakind	$C_{f} = \frac{x}{x_{R M S}}$
9	Marginind	$C L_{f} = \frac{x_{R M S}}{x_{r}}$
10	Skewness	$S_{k} = \frac{1}{N} {\sum_{n = 1}^{N} (\frac{x (n) - \bar{x}}{σ})}^{3}$
11	Kurtosis	$K_{u} = \frac{1}{N} {\sum_{n = 1}^{N} (\frac{x (n) - \bar{x}}{σ})}^{4}$
Remark: $x (n)$ is time domain discrete signal $\bar{x} = \frac{1}{N} \sum_{n = 1}^{N} x (n)$ $σ = \sqrt{\frac{1}{N - 1} \sum_{n = 1}^{N} (x (n) - μ)^{2}}$

Step 2: Data separation and feature extraction: separate the data of the eight channel signals into 50 groups (The 1-40th groups are used as CHMM training data and 41th-50th groups are used as testing data) respectively and 400 groups are obtained in all. There are 1024 points in each group. Apply the eleven time-domain statistical indexes (The 11 indexes and their corresponding calculation formulas are shown in Table 1) and one time-frequency domain index (the wavelet packet energy entropy (WE) which will be stated in the following chapter) to each group data.

Noting: The 11 indexes are the traditional common used time-domain statistical feature vectors and they can reflect the running state correctly when the fault signal is linear. The signals usually take on non-linear characteristic when fault occurs in machinery, so the time-frequency index is also need so as to capture the characteristic of the fault signal. In the paper, the wavelet packet energy entropy (WE) is used as time-frequency index which will be discussed in the Subsequent chapters.

Step 3: Dimensionality reduction: apply WKPCA to the feature vectors obtained in step 2 in order to obtain dimensionality reduction feature vectors.

Step 4: CHMM models training: use the dimensionality reduction training feature vectors to train four CHMM models (normal state CHMM, inner race fault state CHMM, outer race fault state CHMM and rolling element fault state CHMM).

Step 5: Diagnosis: input the dimensionality reduction testing feature vectors into the trained four state CHMMs in step 4 and fault diagnosis results are obtained.

Fig. 5The framework of the proposed method

Fig. 6The test rig

The test rig is shown in Fig. 6. The two ends of the shaft are supported by two rolling element bearings, and the right end is detachable which is convenient for replacement of the test rolling element bearings. The shaft is driven by AC motor and connected by coupling. The rated power of the AC is 1.1 kW. The test rig is equipped with hydraulic position and clamping device which are used in fixing the outer race of rolling bearing. The inner race, rolling element and outer race of the test rolling bearings are eroded with very tiny point corrosions respectively using Electrical Discharge Machining (EDM) technology to simulate the three kinds of faults of the rolling bearing. The type of the test rolling bearing is GB203. The outer race is fixed on the bench and the inner race rotates synchronously with the shaft in the test process. The rotation frequency of the shaft is $f_{r} =$ 12 Hz. The parameters and the rotation frequency of the test rolling element bearings are shown in Table 2.

Table 2Rolling bearing’s parameters and the rotating frequency

Type	Pitch diameter $D$ (mm)	Ball diameter $d$ (mm)	Ball number $Z$ (N)	Contact angle $α$ (angle)
GB203	28.5	6.747	7	0
Feature frequency Shaft frequency		Calculation formulas $f_{r} = \frac{n}{60}$		Calculated result (Hz) 12
Remark: $n$ represent the shaft rotation speed

One sensor is installed in the traditional vibration data collection method. In the paper the two accelerometers are installed in the same one bearing case synchronously, and the two installed directions are shown in Fig. 7.

Fig. 7The installation direction of the two sensors

The four states of the test rolling bearings are carried on respectively and the corresponding time-domain waveforms of the two channel signals from the same data collection point of the four states are shown in Fig. 8. The sampling frequency is $f_{s} =$ 25.6 kHz.

It is usually taking on non-gaussian and non-linear characteristic whatever the condition of the rolling bearings is (normal or fault). The time-frequency analysis method is a very effective non-linear and non-gaussian signal handling tool to extract the non-linear features buried in the original signal. In the paper, the wavelet packet energy entropy is used as the time-frequency indicator whose calculation process is shown as following:

Apply the wavelet packet transform (WPT) to the original signal and the energies $E_{i} (i = 2^{N}$ , $N$ is the decomposition level) named wavelet packet energy on each node is obtained which is the division of the original signal in the time-frequency domain. In theory, much better frequency-domain performance could be obtained with the bigger value of $N$ . However, the amount of calculation will also be increased with the bigger value of $N$ . So, the value of $N$ is selected 3 as compromising here, and the satisfactory frequency-domain performance could be obtained with the following verification of experimental results. The wavelet packet energy entropy (WE) is defined as in Eq. (15):

15

\{\begin{array}{l} W E = - \sum_{i = 1}^{N} p_{i} \cdot l o g (p_{i}), \\ p_{i} = \frac{E_{i}}{\sum_{j = 1}^{N} E_{j}} . \end{array}

In Eq. (15), $p_{i}$ represents probability distribution. The Normalized wavelet packet energy and wavelet packet energy entropy results of the signals shown in Fig. 8 are summarized in Fig. 9.

Fig. 8The time-domain waveforms of the two channels signals from the same data collection point of the four states

a) The time-domain waveform of channel 1 of normal state

b) The time-domain waveform of channel 2 of normal state

c) The time-domain waveform of channel 1 of outer race fault state

d) The time-domain waveform of channel 2 of outer race fault state

e) The time-domain waveform of channel 1 of rolling element fault state

f) The time-domain waveform of channel 2 of rolling element fault state

g) The time-domain waveform of channel 1 of inner race fault state

h) The time-domain waveform of channel 2 of inner race fault state

So, the dimensionality of training feature vectors of every channel is 4×50×12. Apply WKPCA to the 4×40×12 feature vectors of channel 1 and 2 respectively and the analysis results are shown in Fig. 10 and Fig. 11. In Fig. 12, the curves of classification result with the number of kernel principle components (PC) is given, and it is evident that the correction ratio would be almost unchanged when the number of PC varies from 3-12. Though in theory the correction ration will obtain the biggest value when the number of PC is selected 12 as shown in Fig. 12, the classification speed will be decreased too much. So, the number is selected 3 as compromising to ensure the classification speed, and the classification correction is guaranteed at the same time. From the dimensionality reduction result it is evident that the dimensionality of the feature vectors is reduced to 4×40×3 respectively.

Fig. 9Normalized wavelet packet energy and wavelet packet energy entropy

a) The normalized energy of channel 1

b) The normalized energy of channel 2

c) The energy entropy of channel 1

d) The energy entropy of channel 2

Fig. 10The three principle components of the twelve features of the channel 1 of the four states analyzed by WKPCA method

From Fig. 10, it can be seen that, almost all of the four states’ sample vectors (* represents the sample vectors of normal state, + represents the samples vectors of outer race fault state, blue o represents the sample vectors of rolling element fault state and yellow o represents the sample vectors of inner race fault state) are classified correctly. Though small amount of sample vectors of outer race fault state and inner race fault state are misclassified in Fig. 11, most of the other sample vectors of the four states are classified correctly. In order to verify the advantage of WKPCA over KPCA, the analysis result of the signals shown in Fig. 8 using KPCA are given in Fig. 13 and Fig. 14. The advantages of WKPCA over KPCA is obvious compared the Fig. 10 and Fig. 11 with Fig. 13 and Fig.14: Much more amount of sample vectors of the four states are misclassified compared Fig. 13 and Fig. 14 with Fig. 10 and Fig. 11. Besides, the better clustering result which has bigger classes distance and smaller intra-class distance is evident in Fig. 10 and Fig. 11 compared with the results obtained in Fig. 13 and Fig. 14.

Fig. 11The three principle components of the twelve features of the channel 2 of the four states analyzed by WKPCA method

Fig. 12The curves of classification result with the number of kernel principle components

Fig. 13The three principle components of the twelve features of the channel 1 of the four states analyzed by KPCA method

Use the 4×40×3×2 feature vectors as training feature vectors respectively and the normal state CHMM, outer race fault state CMHH, rolling element fault state CHMM and inner race fault state CHMM diagnosis trained models are erected respectively. Then input the 4×40×3×2 feature vectors as testing feature vectors into the above four trained diagnosis models, and the diagnosis results are obtained and shown in Fig. 15(c) at last. From the Fig. 15(c) the ten groups of the testing feature vectors of the four states are classified corrected completely.

Fig. 14The three principle components of the twelve features of the channel 2 of the four states analyzed by KPCA method

In order to verify the advantage of CHMM over HMM, the diagnosis result based on WKPCA-HMM of the two channel signals of the four states are also carried out respectively. Same as the above CHMM models training and testing process: firstly, use the 4×40×3×2 as training feature vectors respectively and the normal state HMM, outer race fault state MHH, rolling element fault state HMM and inner race fault state HMM diagnosis trained models of the two channel signals are erected respectively, then input the 4×10×3×2 feature vectors as testing feature vectors into the above eight trained diagnosis models, and the diagnosis results are obtained and shown in Fig. 15(a) and Fig. 15(b) respectively. Compared Fig. 15(a) and Fig. 15(b) with Fig. 15(c), the advantages of the proposed method are obvious: in Fig. 15(a) the are three groups of rolling element fault state testing feature vectors are misclassified as normal state. In Fig. 15(b) there is not only one group of rolling element fault state testing feature vector is misclassified as normal state but also there are three groups of outer race fault state testing feature vectors are misclassified as inner race fault state. Based on the above shown results, the advantage of CHMM over HMM in fault diagnosis of rolling element bearing is verified: the CHMM can fuse the information of two channels signals from the same data collection point efficiently and might also resolve the possible coupling phenomenon occurring between the two channel signals synchronously so much more reliable diagnosis result could be obtained compared with the HMM method. Besides, the dimension redundancy and dimension insufficient contraction is not only resolved but also the diagnosis efficiency and correction ratio are also increased because the WKPCA method is used as feature dimensionality method.

Besides, the computation time and correction ratio of the other three relative methods (WKPCA-HMM, RKPCA-CHMM and CHMM without dimensionality reduced) and the proposed method (WKPCA-CHMM) are shown in Table 3. Based on Table 3 the advantages of the proposed are further verified.

Table 3The computation time and correction of the relative methods and the proposed method

The name of methods	Computation time (second)	Correction ratio
WKPCA-HMM	34.5	70 %
RKPCA-CHMM	40.4	65 %
CHMM	65.3	60 %
WKPCA-CHMM	38.6	100 %

Fig. 15The diagnosis results based on WKPCA-HMM and WPCA-CHMM

a) The diagnosis results of channel 1 signals of the four states based on WKPCA-HMM

b) The diagnosis results of channel 2 signals of the four states based on WKPCA-HMM

c) The diagnosis results of the two channels signals of the four states based on WKPCA-CHMM

5. Conclusions

The paper presents an integrated WKPCA-CHMM method to realize the intelligent fault diagnosis of rolling element bearing. The advantage of CHMM over HMM is following: the CHMM can fuse the information of two channel signals from the same data collection point efficiently and might also solve the possible coupling phenomenon occurring between the two channel signals synchronously. The WKPCA is used as feature dimensionality reduction method for it is not only can solve the dimension redundancy and dimension insufficient contraction but also is much more flexible than RKPCA. The feasibility and validity of the proposed method is verified through experiment. Besides, the advantages of the proposed method over other relative methods are also verified and presented.

References

Jiang R., Yu J., Makis V. Optimal Bayesiam estimation and control scheme for gear shaft fault detection. Computers and Industrial Engineering, Vol. 63, Issue 4, 2012, p. 754-762.

Publisher
Boutros T., Liang M. Detection and diagnosis of bearing and cutting tool faults using hidden Markov models. Mechanical Systems and Signal Processing, Vol. 25, Issue 6, 2011, p. 2102-2124.

Publisher
Geramifard O., Xu J. X., Panda S. K. Fault detection and diagnosis in synchronous motors using hidden Markov model-based semi-nonparametric approach. Engineering Applications of Artificial Intelligence, Vol. 26, Issue 8, 2013, p. 1919-1929.

Publisher
Georgoulas G., Mustafa M. O., Tsoumas I. P. Principal component analysis of the start-up transient and hidden Markov modeling for broken rotor bar fault diagnosis in asynchronous machines. Expert Systems with Application, Vol. 40, Issue 17, 2013, p. 7024-7033.

Publisher
Purushotham V., Narayanan S., Prasad S. A. N. Multi-fault diagnosis of rolling bearing elements using wavelet analysis and hidden Markov model based fault recognition. NDT&E International, Vol. 38, Issue 8, 2005, p. 654-664.

Publisher
Jardine A. K. S., Lin D., Banjevic D. A review on machinery diagnostics and prognostics implementing condition-based maintenance. Mechanical Systems and Signal Processing, Vol. 20, Issue 7, 2006, p. 1483-1510.

Publisher
Yang B. S., Kim K. J. Application of Dempster-Shafer theory in fault diagnosis of induction motors using vibration and current signals. Mechanical Systems and Signal Processing, Vol. 20, Issue 2, 2006, p. 403-420.

Publisher
Kulkarni S., Bewoor A. Vibration based condition assessment of ball bearing with distributed defect. Journal of Measurements in Engineering, Vol. 4, Issue 8, 2016, p. 87-94.

Search CrossRef
Safizadeh M. S., Latifi S. K. Using multi-sensor data fusion for vibration fault diagnosis of rolling element bearings by accelerometer and load cell. Information Fusion, Vol. 18, Issue 4, 2014, p. 1-8.

Publisher
Brand M., Oliver N., Pentland A. Coupled hidden Markov models for complex action recognition. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Juan, USA, 1997, p. 994-999.

Publisher
Xie L., Liu Z. Q. A coupled HMM approach to video-realistic speech animation. Pattern Recognition, Vol. 40, Issue 8, 2007, p. 2325-2340.

Publisher
Kodali A., Pattipati K. Coupled factorial hidden Markov models (CFHMM) for diagnosing multiple and coupled faults. IEEE Transactions on Systems, Man, and Cybernetics: Systems, Vol. 43, Issue 3, 2013, p. 522-534.

Publisher
Zhao R., Schalk G., Ji Q. Coupled hidden Markov model for electrocorticographic signal classification. 22nd International Conference on Pattern Recognition, 2014, p. 1858-1862.

Publisher
Xiao W. B., Chen J., Dong G. M. A multichannel fusion approach based on coupled hidden Markov models for rolling element bearing fault diagnosis. Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, Vol. 226, Issue 1, 2012, p. 202-216.

Publisher
Liu T., Chen J., Zhou X. N., Xiao W. B. Bearing performance degradation assessment using linear discriminant analysis and coupled HMM. 25th International Congress on Condition Monitoring and Diagnostic Engineering, Journal of Physic: Conference Series, Vol. 364, 2012.

Publisher
Bellino A., Fasana A., Garibaldi L. PCA-based detection of damage in time-varying systems. Mechanical Systems and Signal Processing, Vol. 24, Issue 4, 2010, p. 2250-2260.

Publisher
Xiao Y. Q., Feng L. G. A novel linear ridgelet network approach for analog fault diagnosis using wavelet-based fractal analysis and kernel PCA as preprocessors. Measurement, Vol. 45, Issue 3, 2010, p. 297-310.

Publisher
Cao M. S., Ding Y. J., Ren W. X., Wang Q., Ragulskis M. Hierarchical wavelet-aided neural intelligent identification of structural damage in noisy conditions. Applied Science, Vol. 7, Issue 391, 2017, https://doi.org/10.3390/app7040391

Publisher
Ganey J. L., Block W. M., Jenness J. S. Mexican spotted owl home range and habitat use in pine-oak forest: implications for forest management. Forest Science, Vol. 45, Issue 1, 1999, p. 127-135.

Search CrossRef
Taylor J. S., Cristianini N. Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge, 2004.

Publisher
Nefian A. V., Liang L. H., Pi X. P. Dynamic Bayesian networks for audio-visual speech recognition. Eurasip Journal on Applied Signal Processing, 2002, https://doi.org/10.1155/S1110865702206083.

Publisher
Zhang L., Zhou W. D., Jiao L. C. Wavelet support vector machine. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, Vol. 34, Issue 1, 2004, p. 34-39.

Publisher
Smola A. J., Scholkopf B., Muller K. R. The connection between regularization operators and support vector kernels. Neural Networks, Vol. 11, Issue 4, 1998, p. 637-649.

Publisher
Wen X. J., Xu X. M., Cai Y. Z. Least-squares wavelet kernel method for regression estimation. International Conference on Natural Computation 2005, Changsha, 2005, p. 582-591.

Publisher

Cited by

Research on Multi-Fault Identification of Marine Vertical Centrifugal Pump Based on Multi-Domain Characteristic Parameters

(2023)

2022 Workshop on Microwave Theory and Techniques in Wireless Communications (MTTW)

Pascal Dore | Saad Chakkor | Ahmed El Oualkadi

(2022)

Coupled Hidden Markov Fusion of Multichannel Fast Spectral Coherence Features for Intelligent Fault Diagnosis of Rolling Element Bearings

Hongchao Wang | Chuan Li | Wenliao Du

(2021)

An adaptive multi band-pass filter algorithm and its application in fault diagnosis of rolling bearing

Hongchao Wang | Hongwei Li | Wenliao Du

(2021)

Intelligent diagnosis of rolling bearing compound faults based on device state dictionary set sparse decomposition feature extraction–hidden Markov model

HongChao Wang | WenLiao Du

(2020)

Precision degradation prediction of inertial test turntable based on Hidden Markov Model and optimized particle filtering

Liming Li | Xunyi Zhou | Xingqi Zhang | Zhenghu Zhong

(2020)

Gyro motor fault classification model based on a coupled hidden Markov model with a minimum intra-class distance algorithm

Lei Dong | Wei-min Li | Ching-Hsin Wang | Kuo-Ping Lin

(2020)

Research on Fault Feature Extraction Method of Rolling Bearing Based on NMD and Wavelet Threshold Denoising

(2018)

About this article

Received

24 May 2017

Accepted

23 July 2017

Published

31 December 2017

SUBJECTS

Fault diagnosis based on vibration signal analysis

DOI

https://doi.org/10.21595/jve.2017.18666

Keywords

WKPCA

CHMM

rolling element bearing

fault diagnosis

Acknowledgements

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: the research is supported by the National Natural Science Foundation (China) (approved Rant: 51405453 and 51205371).

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Previous article in issue Previous Next article in issue Next

Research article

2024 02 29

Dynamic modeling and analysis of rolling bearing faults under time-varying excitations considering defect deformation

Chao Zhang, Yangbiao Wu, Shuai Xu, Feifan Qin, Le Wu, Bing Ouyang

Research article

2021 09 21

A study on the feature separation and extraction of compound faults of bearings based on casing vibration signals

Qizhi Fang, Baodong Qiao, Mingyue Yu

Research article

2021 01 04

An adaptive multi band-pass filter algorithm and its application in fault diagnosis of rolling bearing

Hongchao Wang, Hongwei Li, Wenliao Du

Research article

2018 12 31

Fault diagnosis of rolling element bearing based on a new noise-resistant time-frequency analysis method

Hongchao Wang, Fang Hao

H. Wang and F. Hao, “Fault diagnosis of rolling element bearing based on wavelet kernel principle component analysis-coupled hidden Markov model,” Journal of Vibroengineering, Vol. 19, No. 8, pp. 5992–6006, Dec. 2017, https://doi.org/10.21595/jve.2017.18666

Copy Extrica

Copied to clipboard!

TY  - JOUR
DO  - 10.21595/jve.2017.18666
UR  - https://doi.org/10.21595/jve.2017.18666
TI  - Fault diagnosis of rolling element bearing based on wavelet kernel principle component analysis-coupled hidden Markov model
T2  - Journal of Vibroengineering
AU  - Wang, Hongchao
AU  - Hao, Fang
PY  - 2017
DA  - 2017/12/31
PB  - JVE International Ltd.
SP  - 5992-6006
IS  - 8
VL  - 19
SN  - 1392-8716
ER  - 

Copy Ris

Copied to clipboard!

@article{Wang_2017,
	doi = {10.21595/jve.2017.18666},
	url = {https://doi.org/10.21595/jve.2017.18666},
	year = 2017,
	month = {dec},
	publisher = {{JVE} International Ltd.},
	volume = {19},
	number = {8},
	pages = {5992--6006},
	author = {Hongchao Wang and Fang Hao},
	title = {Fault diagnosis of rolling element bearing based on wavelet kernel principle component analysis-coupled hidden Markov model},
	journal = {Journal of Vibroengineering}
}

Copy Bibtex

Copied to clipboard!

[1]H. Wang and F. Hao, “Fault diagnosis of rolling element bearing based on wavelet kernel principle component analysis-coupled hidden Markov model,” Journal of Vibroengineering, vol. 19, no. 8, pp. 5992–6006, Dec. 2017, doi: 10.21595/jve.2017.18666.

Copy IEEE

Copied to clipboard!

Wang, Hongchao, and Fang Hao. “Fault Diagnosis of Rolling Element Bearing Based on Wavelet Kernel Principle Component Analysis-Coupled Hidden Markov Model.” Journal of Vibroengineering 19, no. 8 (December 31, 2017): 5992–6006. https://doi.org/10.21595/jve.2017.18666.

Copy Chicago

Copied to clipboard!

Fault diagnosis of rolling element bearing based on wavelet kernel principle component analysis-coupled hidden Markov model

Abstract

1. Introduction

2. Wavelet kernel principle component analysis

3. CHMM

3.1. Elements of CHMM

3.2. CHMM' basic problems

4. Experiment

5. Conclusions

References

Cited by

About this article

Related Articles