A rolling bearing fault diagnosis method based on VMD – multiscale fractal dimension/energy and optimized support vector machine
Fei Chen^{1} , Xiaojuan Chen^{2} , Zhaojun Yang^{3} , Binbin Xu^{4} , Qunya Xie^{5} , Heng Zhang^{6} , Yifeng Ye^{7}
^{1, 2, 3, 4, 5, 6, 7}School of Mechanical Science and Engineering, Jilin University, Jilin, China
^{4}Corresponding author
Journal of Vibroengineering, Vol. 18, Issue 6, 2016, p. 35813595.
https://doi.org/10.21595/jve.2016.16847
Received 18 January 2016; received in revised form 29 June 2016; accepted 8 August 2016; published 30 September 2016
JVE Conferences
To achieve the goal of automated rolling bearing fault diagnosis, a variational mode decomposition (VMD) based diagnosis scheme was proposed. VMD was firstly used to decompose the vibration signals into a series of bandlimited intrinsic mode functions (BLIMFs). Subsequently, the multiscale fractal dimension (MSFD) and multiscale energy (MSEN) of each BLIMF were calculated and combined together as features of the original vibration signals. In an attempt to accelerate the classification speed, oneway analysis of variance (ANOVA) test was adopted to extract significant features from the redundant features. Finally, those significant features were fed into the optimized support vector machine (SVM), which was optimized by the genetic algorithm (GA), for classification. Experimental results on the international public Case Western Reserve University bearing data indicate the effectiveness of the proposed method with a classification accuracy of 99.75 % for seven classes. Moreover, our approach also shows good antinoise performance in different signaltonoise ratios (SNRs).
Keywords: rolling bearing fault diagnosis, variational mode decomposition (VMD), multiscale fractal dimension (MSFD), multiscale energy (MSEN), support vector machine (SVM).
1. Introduction
Rolling bearing is one of the most widely applied objects and the quickwear parts in rotating machinery. The operation state of the rolling bearing directly affects the performance of the whole rotating machinery system. According to the statistics, 30 % of rotating machinery malfunction was caused by the faults of rolling bearing [1]. When the rolling bearing fails, it may lead to the crash of the entire rotating machinery system and directly influence the work efficiency and lower the reliability of the system. The fault diagnosis of rolling bearing can be a useful tool to avoid the halt of rotating machinery, because it can warn the fault information via the vibration signals, electric current and so forth which comes from the sensors seated in the component position. The useful information can be obtained through analyzing the vibration signals. For that, the fault can be found easier and prevented, then directly reduce the failures and improve the reliability of the rotating machinery.
Vibration signal analysis methods have been the most popular technique in rolling bearing fault diagnosis. Nonetheless, since the rolling bearing is affected by load, fraction, damping, propagation path, noise and other kinds of factors, the actual collected vibration signals characterized with nonlinear and nonstationary resulting the conventional linear system analysis method in timedomain, frequencydomain and waveletdomain can’t accurately represent the vibration signal which comes from the rolling bearing. To analysis the nonlinear and nonstationary signals, scholars put forward a series of nonlinear analysis method, such as fractal dimension [2, 3], approximate entropy [4], sample entropy [5], multiscale entropy [6], fuzzy entropy [7, 8] and so forth. These feature extraction method always combined with signals decomposing method, for instance, wavelet analysis, empirical mode decomposition (EMD), local mean decomposition (LMD) and intrinsic timescale decomposition (ITD) to diagnose the rolling bearing fault. Tse et al. [9] adopted the wavelet analysis and envelop detection to handle the vibration signals and this method were proved to be efficient in detecting the rolling bearing fault modes. Kankar et al. [10] employed waveletbased feature extraction to analyze the localized defects on the ball bearing. For the decomposition method EMD, Zhong et al. [11] integrated the sample entropy and 1.5dimension spectrum with EMD to classify the faults of the rolling bearing. Zhou et al. [12] pointed out that spalling and pitting were the primary failure forms in the rotary rolling bearing, in addition, they united the approximate entropy and EMD to estimate the size of the spalllike fault. Zhu et al. [13] proposed the method combined the EMD with the correlation coefficient to analysis the fault rolling bearing, the experimental results verified the effectiveness of the method. LMD, a newlydeveloped timefrequency decomposition technique was put forward by Smith in 2005 [14]. Cheng et al. [15] brought up the idea that the LMD can be employed to diagnose gear and rolling bearing fault, they compared the LMD with the EMD, and a conclusion was drawn that the LMD had superior performance than EMD in fault diagnosis. Furthermore, Liu et al. [6] introduced a novel fusion method of multiscale entropy and LMD to detect the fault of roller bearing system. In 2007, a new adaptive timefrequency analysis method, named intrinsic timescale decomposition (ITD) [16] was proposed and tested its feasibility of characterizing nonstationary signals. Luo et al. [17] combined the ITD with the fractal dimension and fuzzy entropy to separate the four operating conditions of the rolling bearing. As an extension of ITD, Zheng et al. [18] introduced a new nonstationary signals analysis method, called local characteristicscale decomposition (LCD) in 2013. Moreover, he presented a new rolling bearing fault diagnosis method based on LCD and fuzzy entropy, and the result showed this method had excellent performance in fault diagnosis. However, the above literatures mostly just considered the fault characteristic frequencies about the rolling bearing and applied the vibration signal of one motor revolving speed to analyze the rolling bearing fault. In order to do further research, in this paper, we considered two defects sizes with 0.1778 mm and 0.3556 mm in diameter, in addition, we adopted the vibration signal data of four motor revolving speeds to recognize the fault mode.
Variational mode decomposition (VMD) was recently proposed by Konstantin D. and Dominique Z. [19]. It is a new adaptive signal decomposition method which can adaptively determine the relevant frequency bands and the corresponding mode simultaneously. In other words, VMD has the capability of decomposing an arbitrary signal into a series of bandlimited intrinsic mode functions (BLIMFs). The VMD algorithm has solid theoretical foundation, its substance is several adaptive wiener filtering and it also shows better noise robustness. Wang et al. [20] had already adopted the VMD to analyze the rubbing signals and concluded that the multiple features can be better extracted using VMD than EMD and EEMD. We thus employed VMD to decompose the vibration signal in this paper, and then adopted the multiscale fractal dimension (MSFD) and multiscale energy (MSEN) to extract features. In an attempt to improve the classification speed, the oneway analysis of variance (ANOVA) test was adopted to remove the redundant features and the remaining significant features were bound together to constitute the feature vectors of vibration signals. After the procedure of feature extraction for the vibration signals, a multifault classifier should be applied to recognize the fault mode. As one of the most stateoftheart and popular classifiers, support vector machine (SVM) [21] demonstrates many specific advantages in solving small samples, nonlinear and highdimension pattern recognition problems. Whereas, the choice of appropriate parameters of SVM was difficult, therefore SVM optimized by the genetic algorithm (GA), i.e., GASVM was exploited as the multifault classifier in the current study. The block diagram of the proposed method is presented in Fig. 1.
The rest of this paper is organized as follows. Section 2 gives the detailed description about the vibration signal data we used. Section 3 introduces the principle of the new timefrequency analysis methodVMD, the algorithm of MSFD and MSEN as well as the mechanism of GASVM. Experimental results are presented in Section 4. Finally, conclusions are drawn in Section 5.
Fig. 1. Concrete process of the proposed method
2. Rolling bearing database
To verify the effectiveness of the proposed method in this study, a public vibration signal dataset, obtained from the Case Western Reserve University bearing data center website, was employed [22]. The test data collection device and description were given in detail in [22]. The 62052RS JEM SKF deep groove ball bearings were used in the test as the testing bearings. The test bearings support the motor shaft, single point faults were introduced to test bearings using electrodischarge machining with fault diameters of 0.1778 mm and 0.3556 mm, both with the same fault depth. Vibration data was collected by accelerometers, which were attached to the housing with magnetic bases. And the data we used in this work was collected through accelerometers which were placed at the 12 o’clock position at drive end of the motor housing. The sampling frequency of the vibration data was 12000 Hz.
In our study, the database was divided into seven subsets. With motor load changed from 0 to 3 horsepower (motor speeds of 1797 RPM to 1720 RPM ), the subset 1 was the collection of vibration signal in normal condition; the subsets 2 and 3 were taken from the outer race fault, whose fault diameter were 0.1778 mm and 0.3556 mm, respectively; the subsets 4 and 5 were measured from the ball element fault, whose diameter fault were 0.1778 mm and 0.3556 mm, respectively; the remaining subsets 6 and 7 were derived from the inner race fault, whose diameter fault were 0.1778 mm and 0.3556 mm, respectively. Our objective was to identify the seven statuses, namely normal, outer race fault 1, outer race fault 2, ball element fault 1, ball element fault 2, inner race fault 1 and inner race fault 2 automatically and accurately. To achieve this aim, the continuous sequence of each subset was segmented into isometric samples with a length of 4096. As a consequence, for each kind of the subset, the obtained number of samples was 116. Among them, 58 samples were randomly selected as training set and the remaining samples were used as testing set. A detailed description of the segmented vibration signals is shown in Table 1 and the time domain waveforms of these seven classes of rolling bearing vibration signals are shown in Fig. 2.
Table 1. A detailed description about the vibration signal
Database

Rolling bearing fault

Defect size (mm)

Class

Data distribution


Training samples

Testing samples

Total samples


Subset 1

Normal

0

1

58

58

116

Subset 2

Outer race 1

0.1778

2

58

58

116

Subset 3

Outer race 2

0.3556

3

58

58

116

Subset 4

Ball element 1

0.1778

4

58

58

116

Subset 5

Ball element 2

0.3556

5

58

58

116

Subset 6

Inner race 1

0.1778

6

58

58

116

Subset 7

Inner race 2

0.3556

7

58

58

116

3. Methodology
This section is composed of three subsections: the principle of VMD is presented in the first subsection; subsequently, the MSFD and MSEN are proposed in the second subsection; finally, the optimized SVM, i.e., GASVM is introduced in the last subsection.
Fig. 2. Typical waveform of seven classes of rolling bearing vibration signals
3.1. Variational mode decomposition
VMD is a newlydeveloped method for adaptive and quasiorthogonal signal decomposition, and it is very robust against noise and sampling [19]. VMD is able to decompose a real valued signal $f$ into a discrete number of subsignals ${u}_{k}$. Each subsignal, namely BLIMF, has specific sparsity properties while reproducing the input [19]. Here, each mode ${u}_{k}$ should be mostly compacted around a center pulsation ${w}_{k}$. The proposition of VMD can be summarized as a constrained variational problem [19]:
To solve Eq. (1), the quadratic penalty term and Lagrangian multipliers $\lambda $ are introduced. Accordingly, the augmented Lagrangian is given as following [19]:
$\mathrm{}\mathrm{}\mathrm{}\mathrm{}\mathrm{}\mathrm{}+{\Vert f\left(t\right)\sum _{k}{u}_{k}\left(t\right)\Vert}^{2}+\u2329\lambda \left(t\right),f\left(t\right)\sum _{k}{u}_{k}\left(t\right)\u232a,$
where $\alpha $ represents the balance parameter of the data fidelity.
The alternate direction method of multipliers (ADMM) [23] is introduced to solve the augmented Lagrangian problem. By alternating renewal ${u}_{k}^{n+1}$, ${w}_{k}^{n+1}$, ${\lambda}^{n+1}$ to search the saddle point of the extend Lagrangian expression. In this case, the value of ${u}_{k}^{n+1}$ can be described as:
According to Parseval/Plancherel Fourier isometric transformation, the frequencydomain expression of Eq. (3) can be expressed as [19]:
$\mathrm{}\mathrm{}\mathrm{}\mathrm{}\mathrm{}\mathrm{}+{\Vert \widehat{f}\left(w\right)\sum _{i}{\widehat{u}}_{i}\left(w\right)+\frac{\widehat{\lambda}\left(w\right)}{2}\Vert}_{2}^{2}.$
By means of a series of computation, solution of the quadratic optimization issue is given by [19]:
According to the same process, the recursion formula of the center frequency is obtained as [19]:
By implementing VMD, the raw vibration signal is decomposed into a fixed number of subcomponents (BLIMFs). Under the circumstances, the hidden characteristics of vibration signals are revealed and feature extraction procedure is subsequently carried out.
3.2. Multiscale fractal dimension and energy
Fractal dimensions (FD) can quantitatively characterize the nonlinear behavior of the vibration signals. There are many computing methods of FD, such as boxcounting dimension, information dimension, correlation dimension, similar dimension, spectral dimension and so forth. Among them, the boxcounting dimension [24] is widely used because of its easier calculation method compared with other methods. Therefore, the boxcounting dimension is applied to quantify the nonlinear characteristic of BLIMFs in this paper.
For a given discrete sequence $\left\{x\right(j),j=\mathrm{1,2},...,{N}_{0}\}$ and $x$ is the closed set of $n$ dimension geometry space ${R}^{n}$. ${R}^{n}$ is divided into as many grids as possible. The width of the grid is $\mathrm{\Delta}$, $\mathrm{\Delta}$ is defined as the smallest grid and being multiplied by positive integer $m$ defined as $m\mathrm{\Delta}$, ${N}_{m\mathrm{\Delta}}$ is the box counting number in the discrete signal $X$ with the grid which the width is $m\mathrm{\Delta}$. Suppose $P\left(m\mathrm{\Delta}\right)$ is the scope of longitudinal coordinate and ${N}_{m\mathrm{\Delta}}$ is the corresponding box counting number, and they can be calculated as follows [24]:
The scaleless region is defined as the scope within the line with good linearity in the coordinate system of $\mathrm{l}\mathrm{g}m\mathrm{\Delta}~\mathrm{l}\mathrm{g}{N}_{m\mathrm{\Delta}}$. The start point and end point of the scaleless region are defined as ${m}_{1}$, ${m}_{2}$, then least square is adopted to fit the linear variation trend. The estimated value of the slope of the fitting line is regarded as the value of boxcounting dimension [24]:
Energy can reflect the energy distribution property of each BLIMF and it is beneficial for recognizing different rolling bearing fault categories. Given this, the energy of each BLIMF is calculated. In our study, energy is defined as the sum of the absolute value of signal and it is given by:
where $n$ is the number of sampling points.
By extracting the boxcounting dimension and energy, the redundant timedomain signal has been condensed into a twodimensional feature vector. However, a single scale fractal dimension and energy only provide the global information. To reveal more comprehensive characteristics of a signal, multiscale [25] is introduced. MSFD and MSEN are the extensional versions of conventional fractal dimension and energy, which describe the characteristics of the experimental objects under different scale factors.
The procedure of MSFD and MSEN consist of two steps: (1) coarsegrained operation and (2) calculating FD/Energy using coarsegrained vector. Concretely, for a onedimensional data series $x$ of $N$ sampling points, the new coarsegrained vector can be constructed as:
Fig. 3. The coarse grained vector under different scale factors
In this case, different value of $\tau $ can produce different coarsegrained vector, and then its corresponding fractal dimension and energy can be calculated. When $\tau $ equals to one, the coarse grained vector becomes the original time series $x$; on the other hand, if $\tau $ is greater than one, the original data is divided into $N/\tau $ coarse grained vector $\left\{{y}^{\tau}\right\}$ (shown in Fig. 3). Based on above presented principle of multiscale analysis, the MSFD and MSEN of each BLIMF are calculated as features of the raw vibration signals.
3.3. Support vector machine
SVM, which is structured based on VapnikChervonenkis (VC) dimension theory and structural risk minimization (SRM) rule, was firstly proposed by Vapnik et al. [26]. As one of the stateoftheart machine learning algorithms, SVM shows many specific advantages in solving small sample, nonlinear and highdimension pattern recognition problems [27].
For $N$ training samples$\left\{\right({x}_{1},{y}_{1}),({x}_{2},{y}_{2}),\cdots ({x}_{N},{y}_{N}\left)\right\}$, where the $i$th sample ${x}_{i}\in {R}^{n}$ and it belongs to one label ${y}_{i}=$+1 or ${y}_{i}=$ –1. Where +1 and –1 represent the positive and negative classes respectively. By mapping the input vector $x$ into the higherdimensional feature space, the linearly separable hyperplane is defined as [26]:
where $\omega $ is the hyperplane’s normal vector, and $b$ is a constant.
When the inputs are linearly inseparable, slack variable ${\xi}_{i}$ is introduced. Accordingly, the issue of constructing optimal hyperplane is translated into a constraint equation [26]:
where $C$ represents the penalty parameter and $\varphi $ is a mapping function which responsible for mapping the inputs into a higherdimensional feature space.
To solve the abovementioned optimization issue, Lagrange multipliers are introduced and Eq. (13) is rewritten as [26]:
where $K({x}_{i},{x}_{j})=\varphi \left({x}_{i}\right)\cdot \varphi \left({x}_{j}\right)$ is the kernel function of SVM, ${\alpha}_{i}$ is the Lagrange multiplier.
Eventually, the optimal classification function can be obtained as [26]:
There are multiple kernel functions, the linear, polynomial and radial basis function (RBF) kernel functions are the popular ones [27]. Among them, RBF kernel is the most widely used kernel function and it is adopted in this work. The expression of RBF is given by:
It should be emphasized that SVM is substantially a binary classifier, while in this paper our objective is to classify seven kinds of rolling bearing vibration signals. Up until now, various algorithms have been presented to structure the multiclass SVM. Among them, the oneversusrest (OVR) [26], oneversusone (OVO) [28] and directed acyclic graphs (DAG) [29] are the most common used approaches. In the presented study, the OVO technique is selected to construct the multiclass SVM for differentiating seven types of vibration signals. In OVO, a child classification model is built on any two groups of training samples. Therefore, for a $k$classifying case, $k\times (k1)/2$ binary child classifiers are required, that is, 21 SVMs are utilized to establish the sevenclassifying SVM classifier. When a testing sample is fed into the established classifier for classification, its final class is the label which gains the most votes.
Additionally, penalty parameter $C$ and kernel parameter $\sigma $ play a crucial role in determining the classification performance of SVM [30]. Given this, GA [30] is applied to search for the best combination of parameters $C$ and $\sigma $. The main parameters of GA are the generations, population size, crossover probability and mutation probability. In our study, the generations, population size, crossover probability and mutation probability are set to 200, 20, 0.9 and 0.05, respectively. Furthermore, the value ranges of $C$ and $\sigma $ are both set to the range of 0.1 to 200. A brief flowchart of optimizing SVM using GA is illustrated in Fig. 4.
Fig. 4. Flowchart of optimizing SVM using GA
4. Results
In current paper, all experiments were implemented in MATLAB 2013a environment and a 2.53 GHz Core^{TM} processor. In this section, the experimental results of extracting BLIMFs using VMD are presented in the first part, and then the calculated MSFD and MSEN of each BLIMF are presented in the second part. Subsequently, oneway ANOVA test was implemented and the features were ranked according their $F$value, then those significant features were selected and fed into a GASVM for classification. Finally, the classification performance under different signaltonoise ratios (SNRs) was discussed.
4.1. Waveform extraction based on VMD
The VMD algorithm is able to decompose an arbitrary signal into a series of BLIMFs. Before implementing VMD, we should give the mode number $k$, different center frequencies could be useful in distinguishing kinds of patterns. After many experiments, we found that when $k=\mathrm{}$4, the center frequencies were diverse from each other. As a consequence, the number of BLIMFs was set to 4. Each rolling bearing class had 58 training samples, we selected one sample as the experimental subject to show the results of VMD. The four corresponding BLIMFs of these seven kinds of vibration signals are shown in Fig. 5. As illustrated in Fig. 5, each BLIMF of these seven classes of vibration signals somewhat varies from each other. Concretely speaking, the most obvious characteristic is each BLIMF of Class 2, 6 and BLIMF_{2} to BLIMF_{4} of Class 7 contains distinct impulse component but others not or not so obvious. Moreover, the main energy concentrations of Class 1, 2, 3, 4, 5, 6, 7 are in BLIMF_{2}, BLIMF_{3}/BLIMF_{4}, BLIMF_{2}, BLIMF_{4}, BLIMF_{3}/BLIMF_{4}, BLIMF_{3}/BLIMF_{4}, BLIMF_{2}/BLIMF_{3}/ BLIMF_{4}, respectively. There is no doubt that these waveforms are different from each other, however, it is still a very hard work to identify the fault mode from the extracted waveforms.
Fig. 5. Plots of BLIMFS after decomposing the seven classes of vibration signal based on the VMD
4.2. Feature extraction based on MSFD and MSEN
Each kind of vibration signals had been decomposed into four BLIMFs after applying VMD. In an attempt to extract features of BLIMFs of rolling bearing segments, we computed the MSFD and MSEN of four generated BLIMFs as the condensed features, and this procedure was known as feature extraction. In this section, we gave the mean value and standard deviation of MSFD and MSEN of each BLIMF of the 58 samples of each class in Fig. 6. For each kind of vibration signals, which had four decomposed BLIMFs, the 5scale MSFD and MSEN of each BLIMF were computed, and then they were arranged into a 40dimension feature vectors. The horizontal axis represents the features, among them, the first five features represent the 5scale features of BLIMF_{1}, the 610 features mean the 5scale features of BLIMF_{2}, the 1115 and 1620 features represent the 5scale features of BLIMF_{3} and BLIMF_{4}, respectively. The blue and green lines stand for the statistics trend of MSFD and MSEN of these four BLIMFs. The error bar in the plot shows the magnitude of the standard deviation, and the middle of the error bar is the mean value of MSFD and MSEN. The first row of this figure shows results of outer race1 (left subgraph) and outer race 2 (right subgraph); the second row exhibits the results of ball element 1 (left subgraph) and ball element 2 (right subgraph); the third row displays the results of inner race 1 (left subgraph) and inner race 2 (right subgraph) and the fourth row, namely, the last subgraph, is the results of the normal condition of the rolling bearing.
From Fig. 6, it can be observed that the MSFD and MSFN of the normal condition of the rolling bearing change a little and keep smoothly. Furthermore, their error bars are smaller than others’, and the mean MSEN fluctuates near zero. In addition, the 5scale MSFD of BLIMF_{1} are almost the same, which indicates that the vibration signals of the normal condition are steady. Considering the maximum peaks of MSEN for other fault conditions, the values of outer race 1, ball element 1 and inner race 1 are close to 1000, 300 and 600, respectively. The variation range of MSFD of outer race fault is the biggest one among all the faults; on the contrary, the range of MSFD of inner race fault is the smallest. For the outer race fault, the error bars of outer race 1 are longer than that of outer race 2, and it indicates that the greater the damage, the shorter error bars obtained. What’s more, the maximum peak of the MSEN of outer race 1 is bigger than that of outer race 2. For the ball element and inner race fault, the greater the damage, the longer the error bars shown, as well as the smaller maximum peak of the MSEN measured.
Fig. 6. The mean value and standard deviation of MSFD and MSEN
4.3. Classification using GASVM
We had extracted 20dimensional MSFD and 20dimensional MSEN features after applied multiscale analysis. Among the 40dimensional features, some features were redundant and they contributed few for classification accuracy. In order to obtain the significant features and reduce the computational complexity, we employed the oneway ANOVA test to analyze each feature, and then we could get the $F$statistic values corresponding to each feature. The $F$value represents the significance level, the larger the value of $F$, the more significance level is. Hence we could conclude that a large $F$ leaded to a high classification accuracy in some degree. Fig. 7 shows the result of carrying out oneway ANOVA for each feature, and the first 20dimensional features represent the $F$ value of MSFN, and the last 20dimensional features signify the $F$ value of MSEN. Through the presented histogram of $F$, we could easily distinguish which feature made more contribution to the classification performance. For example, the second dimensional feature has the greatest $F$value and it indicates that the second dimensional feature possesses the greatest impact on classification accuracy. On the contrary, the $F$value of 24th dimensional feature is smallest, which imply that the 24th dimensional feature is likely to be redundant and has tiny influence on classification performance.
Rearranged the order of the 40dimensional features according to the $F$value from largest to smallest. In this case, the first ranked feature possesses the largest $F$ and the last ranged feature owns the smallest $F$. Subsequently, 40 times independent experiments were implemented for various cases. For the $m$th ($m=\mathrm{}$1, 2,..., 40) experiment, the first mdimensional features were fed into the GASVM for classification, respectively. In addition, in order to guarantee the stability and veracity of classification accuracy, 10fold cross validation was also adopted in the classification step and final classification accuracy was obtained and shown in Fig. 8. From Fig. 8, we know that along with the increase of the number of employed features, the classification accuracy is maintaining growth. When the number of feature dimensions increased to 30, the classification accuracy reaches a maximum value, and the classification accuracy no longer or barely changed with the increase of the feature dimensions. Therefore we could concluded that the first ranked 30dimensional features could best represent the difference of the seven classes rolling bearing statuses under the premise of ensuring high accuracy, and the last 10dimension features were redundant and they contributed a little for classification performance, thus we eliminated the last 10dimension features and just employed the first 30dimension features as the significant features to distinguish the seven classes rolling bearing statuses.
By applying oneway ANOVA test, we had extracted the significant features from the redundant features. Subsequently, the training samples were used to build the GASVM classifier. Once again, the 10fold cross validation was utilized to prevent overfitting and obtain the appropriate parameters of SVM in training procedure. Finally, the remaining testing samples were fed into the trained GASVM to verify the rationality of the established classifier, and the predicted categories of testing samples are shown in Fig. 9. As displayed in Fig. 9, only one sample is misclassified among the 406 testing samples, and the classification accuracy is as high as 99.75 %.
Fig. 7. Results of oneway ANOVA for each feature
Fig. 8. Classification accuracy of the ranked feature in 40 cases
In an attempt to prove the necessity of decomposing the original signals, in addition to above experiment, we also calculated the MSFD and MSEN of the original signals directly, and then the extracted feature vectors were used to train and test the newlybuilt GASVM classifier. The rolling bearing data used in the experiment was the same as abovementioned data and the final classification result without decomposing the vibration signals is shown in Fig. 10. As illustrated in Fig. 10, there exists 9 misclassified samples among the 406 testing samples, and the classification accuracy was 97.78 %. Compared with Fig. 9, this result indicates that it is necessary to decompose the original vibration signal using VMD before extracting features from the original vibration signals.
4.4. Influence of noise on classification performance
In practical situations, the collected vibration signals are inevitably contaminated by noise. It is necessary to investigate the antinoise property of the presented diagnosis method in different signaltonoise ratio (SNR). Besides, with the decline of SNR, a 5scale feature extraction operation might result in mediocre classification accuracy and a higherscale operation should be taken into account. Hence, to simulate actual situations, the Gaussian white noise at diverse levels was added in the original signals and three different scales, namely 5, 8 and 12scale feature extraction operations were studied. Fig. 11 plots the error bars of the classification accuracy under different scales and SNRs. As shown in Fig. 11, a phenomenon can be observed that the larger the scale is, the higher classification accuracy can be achieved. Given a specified multiscale value, with the increase of SNR, the classification accuracy becomes higher and higher. When SNR is less than 30 dB, a higherscales operation should be chosen to acquire the better classification accuracy. While SNR is larger than 30 dB, the lowerscales can lead to an approving classification accuracy with a faster computing speed.
Fig. 9. Classification results of the earlier 30dimension features
Fig. 10. Classification results without decomposing the original vibration signals
Fig. 11. Classification performances of GASVM with the variation of SNRs and scales
5. Conclusions
In this paper, we proposed a novel rolling bearing fault diagnosis method based on VMDmultiscale fractal dimension/energy and GASVM. Seven classes of rolling bearing vibration signals were used to verify the effectiveness of the proposed method and the classification accuracy is as high as 99.75 %. In addition, our approach shows good antinoise performance in different signaltonoise ratios. The method we proposed can be a useful tool to monitor the rolling bearing operating state automatically and accurately. It can assist the rotating machinery users detect the fault degree and fault location timely and avoid the unnecessary downtime of the rotating machinery, the reliability of the rotating machinery can thus be improved. Future directions of our research may include application of the proposed method for diagnosis of other mechanical faults.
Acknowledgements
This work is supported by the Specialized Research Fund of the One Thousand sets of Domestic CNC Lathe Reliability Engineering (Grant No. 2013ZX04011011).
References
 Li M., Wang M., Wang C. G. Research on SVM classification performance in rolling bearing diagnosis. International Conference on Intelligent Computation Technology and Automation, 2010, p. 132135. [Search CrossRef]
 Logan D., Mathew J. Using the correlation dimension for vibration fault diagnosis of rolling element bearings. Part 1: basic concepts. Mechanical Systems and Signal Processing, Vol. 10, Issue 3, 1996, p. 241250. [Publisher]
 Zhang P. L., Li B., Mi S. S. Bearing fault detection using multiscale fractal dimensions based on morphological covers. Shock and Vibration, Vol. 19, Issue 6, 2012, p. 13731383. [Publisher]
 Yan R., Gao R. X. Approximate Entropy as a diagnostic tool for machine health monitoring. Mechanical Systems and Signal Processing, Vol. 21, Issue 2, 2007, p. 824839. [Publisher]
 Wang F., Zhang Y., Zhang B. Application of wavelet packet sample entropy in the forecast of rolling element bearing fault trend. International Conference on Multimedia and Signal Processing, IEEE Computer Society, 2011, p. 1216. [Publisher]
 Liu H., Han M. A fault diagnosis method based on local mean decomposition and multiscale entropy for roller bearings. Mechanism and Machine Theory, Vol. 75, Issue 5, 2014, p. 6778. [Publisher]
 Chen W., Wang Z., Xie H. Characterization of surface EMG signal based on fuzzy entropy. IEEE Transactions on Neural Systems and Rehabilitation Engineering, Vol. 15, Issue 2, 2007, p. 266272. [Publisher]
 Zheng J., Cheng J., Yang Y. A rolling bearing fault diagnosis method based on multiscale fuzzy entropy and variable predictive modelbased class discrimination. Mechanism and Machine Theory, Vol. 78, Issue 16, 2014, p. 187200. [Publisher]
 Tse P. W., Peng Y. H., Yam R. Wavelet analysis and envelope detection for rolling element bearing fault diagnosis – their effectiveness and flexibilities. Journal of Vibration and Acoustics, Vol. 123, Issue 3, 2001, p. 303310. [Publisher]
 Kankar P. K., Sharma S. C., Harsha S. P. Rolling element bearing fault diagnosis using wavelet transform. Neurocomputing, Vol. 74, Issue 10, 2011, p. 16381645. [Publisher]
 Zhong X. Y., Zhao C. H., Dong H. J., Liu X. M., Zeng L. C. Rolling bearing fault diagnosis using sample entropy and 1.5dimension spectrum based on EMD. Applied Mechanics and Materials, 2013, p. 10271031. [Search CrossRef]
 Zhao S. F., Liang L., Xu G. H. Quantitative diagnosis of a spalllike fault of a rolling element bearing by empirical mode decomposition and the approximate entropy method. Mechanical Systems and Signal Processing, Vol. 40, Issue 1, 2013, p. 154177. [Publisher]
 Zhu K., Song X., Xue D. Incipient fault diagnosis of roller bearings using empirical mode decomposition and correlation coefficient. Journal of Vibroengineering, Vol. 15, Issue 2, 2009, p. 597603. [Search CrossRef]
 Smith J. S. The local mean decomposition and its application to EEG perception data. Journal of the Royal Society Interface, Vol. 2, Issue 5, 2005, p. 443454. [Publisher]
 Cheng J., Yang Y., Yang Y. A rotating machinery fault diagnosis method based on local mean decomposition. Digital Signal Processing, Vol. 22, Issue 2, 2012, p. 356366. [Publisher]
 Mark G. Frei Intrinsic timescale decomposition: timefrequencyenergy analysis and realtime filtering of nonstationary signals. Royal Society of London Proceedings, Vol. 463, Issue 2078, 2007, p. 321342. [Search CrossRef]
 Luo S. R., Cheng J. S., Zheng J. D. Early fault diagnosis of rolling bearing based on ITD fractal fuzzy entropy. Journal of Vibration, Measurement and Diagnosis, Vol. 33, Issue 4, 2013, p. 706711. [Search CrossRef]
 Zheng J., Cheng J., Yang Y. A rolling bearing fault diagnosis approach based on LCD and fuzzy entropy. Mechanism and Machine Theory, Vol. 70, Issue 6, 2013, p. 441453. [Publisher]
 Konstantin D., Dominique Z. Variational mode decomposition. IEEE Transactions on Signal Processing, Vol. 3, Issue 62, 2014, p. 531544. [Search CrossRef]
 Wang Y., Markert R., Xiang J. Research on variational mode decomposition and its application in detecting rubimpact fault of the rotor system. Mechanical Systems and Signal Processing, Vol. 60, 2015, p. 243251. [Publisher]
 Li X., Zheng A., Zhang X. Rolling element bearing fault detection using support vector machine with improved ant colony optimization. Measurement, Vol. 46, Issue 8, 2013, p. 27262734. [Publisher]
 Loparo K. A. Bearings Data Center. Case Western Reserve University, http://csegroups.case.edu/bearingdatacenter/home. [Search CrossRef]
 Tosserams S., Etman L. F. P., Papalambros P. Y. An augmented Lagrangian relaxation for analytical target cascading using the alternating direction method of multipliers. Structural and Multidisciplinary Optimization, Vol. 31, Issue 3, 2006, p. 176189. [Publisher]
 Yang G., Liu Y., Zhao L. Typical power quality disturbance identification based on fractal box dimension. International Workshop on ChaosFractals Theories and Applications, 2009, p. 412416. [Publisher]
 Costa M. Multiscale entropy analysis of complex physiologic time series. Physical Review Letters, Vol. 89, Issue 3, 2002, p. 705708. [Publisher]
 Vapnik V. N. The Nature of Statistical Learning Theory. SpringerVerlag, New York, 1995. [Publisher]
 Yang J., Zhang Y., Zhu Y. Intelligent fault diagnosis of rolling element bearing based on SVMs and fractal dimension. Mechanical Systems and Signal Processing, Vol. 21, Issue 5, 2007, p. 20122014. [Publisher]
 Knerr S., Personnaz L., Dreyfus G. Singlelayer learning revisited: a stepwise procedure for building and training a neural network. Neurocomputing, Vol. 68, 1990, p. 4150. [Publisher]
 Platt J. C., Cristianini N., ShaweTaylor J. Large margin DAGs for multiclass classification. Advances in Neural Information Processing Systems, Vol. 12, Issue 3, 2000, p. 47553. [Search CrossRef]
 Sajan K. S., Kumar V., Tyagi B. Genetic algorithm based support vector machine for online voltage stability monitoring. International Journal of Electrical Power and Energy Systems, Vol. 75, 2015, p. 200208. [Publisher]
Cited By
Shock and Vibration
Xiwen Qin, Dingxin Xu, Xiaogang Dong, Xueteng Cui, Siqi Zhang, Arturo GarciaPerez

2021
