Jian Ma^{3} , Chen Lu^{2} , Jinwen Sun^{1}
^{3, 2, 1}School of Reliability and Systems Engineering, Beihang University, Beijing, 100191, China
^{3, 2, 1}Science and Technology on Reliability and Environmental Engineering Laboratory, Beijing, 100191, China
^{3}Corresponding author
Vibroengineering PROCEDIA, Vol. 14, 2017, p. 9196.
https://doi.org/10.21595/vp.2017.19196
Received 17 September 2017; accepted 25 September 2017; published 21 October 2017
Copyright © 2017  JVE International Ltd.
The vibration signals of a gearbox always contain the dynamic operation information, which are important for the feature extraction and further work. However, the low signaltonoise ratio and combined multimode faults make it difficult to extract discriminable features of gearboxes. In this study, a feature fusion method based on wavelet packet decomposition (WPD), singular value decomposition (SVD) and $t$Distributed stochastic neighbor embedding ($t$SNE) for gearbox fault diagnosis is proposed. First, timefrequency analysis method of WPTSVD as well as timedomain analysis methods are utilized to extract robust feature vectors of gearboxes with different conditions. As an effective method for the visualization of highdimensional datasets, $t$SNE is then introduced to realize the dimensionality reduction of feature vectors. Finally, with the fused features, a radial basis function (RBF) neural network is trained to realize the classification of gearbox fault modes. Sufficient experiments have been implemented to validate the effectiveness and superiority of the proposed method by analyzing the vibration signals of gearboxes.
Keywords: gearbox, fault diagnosis, wavelet packet decomposition, t.
As one of the most important machine components, gearboxes are extensively used in transmission design of many rotating machine. However, the severe operation conditions of heavy duty and intensive impact load may result in gear tooth damage and other fault modes, which heavily influences the working condition of the whole systems [1]. In order to reduce the operation and maintenance costs for gearboxes, numerous studies have been conducted to realize the gearbox fault recognition [2, 3]. But the low signaltonoise ratio and combined multimode faults make it still a challenge to extract discriminable features for gearboxes. This study provides a feature fusion method based on wavelet packet decomposition (WPD), singular value decomposition (SVD) and $t$Distributed stochastic neighbor embedding ($t$SNE) for gearbox fault diagnosis.
For fault diagnosis, one of challenges is to obtain reliable features of the gearbox by analyzing the monitoring vibration signals in the first step. Generally, the main feature extraction methods include timedomain methods, frequencydomain methods, and timefrequency methods. Timedomain analysis methods such as rootmeansquare (RMS) value, crest factor, form factor, kurtosis and skewness have been successfully used to realize fault diagnosis of rotating machine [4]. Frequencydomain analysis methods include Fourier transform, cepstrum analysis and so on. As for timefrequency analysis methods such as shorttime Fourier transform (STFT) and empirical mode decomposition (EMD), they have been proven effective to extract features from nonlinear and nonstationary vibration signals [5]. Among these timefrequency techniques, WPD is one of the best tools since it has particular advantages for decomposing original signals into different frequency bands. And the SVD method can be utilized to form the final feature vectors based on the results of WPD. To extract robust representatives for the gearbox, both timedomain analysis methods and timefrequency analysis method of WPTSVD are applied in this study.
Mapping the extracted highdimensional feature representatives into lowdimensional space properly is another challenge in this paper. A large number of dimensionality reduction techniques have been proposed, such as PCA, KPCA and manifold learning methods like local tangent space alignment (LTSA) [6]. However, most of these methods have the limitation to capture both the local and global structure of the highdimensional features. To realize the presence of clusters at several scales, Maaten et al. proposed the $t$SNE method which can achieve good visualization of highdimensional data [7]. In this study, $t$SNE is employed as the dimensionality reduction method to get the most discriminable features.
Inspired by the aforementioned challenges, a novel feature fusion method for gearbox fault diagnosis is proposed in this study. Our contributions are summarized as follows: Firstly, we proposed an effective feature extraction method relying on WPDSVD and timedomain analysis methods for gearboxes. The extracted robust feature vectors embody the key information of gearbox operation condition. Secondly, a $t$SNE based dimensionality reduction method is employed to obtain the discriminable features, relying on which fault diagnosis can be realized with a RBF neural network model. Moreover, sufficient experiments are conducted by comparing with the existing methods based on the operation data of gearboxes, which demonstrates the feasibility and effectiveness of our proposed approach.
The rest of this paper is organized as follows. In Section 2, we explain the overall scheme of fault diagnosis and the mathematical principles. The results of case study are provided and analyzed in Section 3, followed by our conclusion in Section 4.
The procedure of our methodology is shown in Fig. 1. This paper provides a fault diagnosis methodology which contains two main steps as below:
– In the first step, timedomain analysis methods, including the RMS value, crest factor, form factor, kurtosis and skewness, are applied to extract the timedomain features of gearboxes, while the WPDSVD method is employed to form the timefrequency feature vectors.
– The second step involves dimensionality reduction of the extracted feature vectors based on $t$SNE. Relying on the obtained lowdimensional fused features, the RBF neural network model can be trained to realize the fault mode classification.
The timedomain parameters of the original signals including the mean value, the maximum value, the RMS value, etc. In this study, the RMS value, crest factor, form factor, kurtosis and skewness are chosen to form the timedomain features of gearboxes.
If ${x}_{s}\left(t\right)$ denotes a set of sampling data ${x}_{1},{x}_{2},...,{x}_{N}$, the five chosen timedomain parameters are calculated as follows:
– The RMS value: $\alpha ={x}_{rms}=\sqrt{\frac{1}{N}{\sum}_{i=1}^{N}({x}_{i}{)}^{2}}.$
– The crest factor: $\beta =\mathrm{m}\mathrm{a}\mathrm{x}\left(x\right)/{x}_{rms}.$
– The form factor: $\gamma =\frac{1}{N}{x}_{rms}/{\sum}_{i=1}^{N}\left{x}_{i}\right.$
– The kurtosis: $\sigma =\frac{1}{N}{\sum}_{i=1}^{N}{\left({x}_{i}\mathrm{m}\mathrm{e}\mathrm{a}\mathrm{n}\left(x\right)/var\left(x\right)\right)}^{4}.$
– The skewness: $\epsilon =\frac{1}{N}{\sum}_{i=1}^{N}{\left({x}_{i}\mathrm{m}\mathrm{e}\mathrm{a}\mathrm{n}\left(x\right)/var\left(x\right)\right)}^{3}.$
Then the timedomain feature vectors of gearboxes can be expressed as $C=[\alpha ,\beta ,\gamma ,\sigma ,\epsilon ]$.
Fig. 1. The procedure of the proposed methodology
The method of WPD has the framework of multiresolution analysis based on wavelet analysis. The function of wavelet packet ${W}_{j,k}^{n}\left(t\right)$ can be expressed as:
where $n$ denotes the decomposition level, $j$ denotes the scale factor, and $k$ denotes the translation factor.
Here the original signal in time domain is defined as $f\left(t\right)$, and the sampling rate is ${f}_{s}$. If the signal $f\left(t\right)$ is decomposed by WPD with level $J$, ${2}^{J}$ group of wavelet packet coefficients can be obtained. The $i$th wavelet packet coefficients can be expressed as:
Then, the SVD method can be used to extract the prominent features from the wavelet packet coefficients. The singular values of wavelet packet coefficients can be utilized to represent timefrequency features in this study.
SNE is introduced based on the spirit of converting the highdimensional Euclidean distances between data points into conditional probabilities that represent similarities. The similarity of data point ${x}_{j}$ to data point ${x}_{i}$ is the conditional probability, which can be denoted as ${p}_{ji}$:
where ${\sigma}_{i}$ indicates the variance of the Gaussian distribution that is centered on data point ${x}_{i}$, and ${p}_{i\lefti\right.}$ is set as zero.
For the lowdimensional mapping values ${y}_{i}$ and ${y}_{j}$ corresponding to the original ${x}_{i}$ and ${x}_{j}$, the similarity ${q}_{j\lefti\right.}$ between them can be calculated by:
where $\delta =1/\sqrt{2}$ and ${q}_{i\lefti\right.}$ is set as zero.
In order to make ${p}_{i\lefti\right.}$ match ${q}_{j\lefti\right.}$ best, the sum of KullbackLeibler divergences over all data points is minimized by a gradient descent method. The cost function is expressed as:
where ${P}_{i}$ represents the distribution of ${p}_{ji}$, and ${Q}_{i}$ represents the distribution of ${q}_{j\lefti\right.}$.
As an extension of Stochastic Neighbor Embedding (SNE), $t$SNE was proposed for visualizing highdimensional data [7]. To optimize the cost function more effectively, $t$SNE was proposed with two improvements. Firstly, a symmetric version of SNE cost function is selected by minimizing a single KullbackLeibler divergence between the joint probability distribution $P$ in the highdimensional space and $Q$ in the lowdimensional space, respectively:
where ${p}_{ij}$ and ${q}_{ij}$ are expressed as:
To solve the problem that the widely separated data tend to be crowded in the lowdimensional space, $t$SNE employs a Student$t$ distribution rather than a Gaussian distribution to convert distances into probabilities in the lowdimensional space. Then ${q}_{ij}$ can be defined as:
And the gradient is modified as:
By solving the problems of SNE cost function, $t$SNE can realize better dimensionality reduction of highdimensional datasets.
The dataset in the 2009 PHM Conference Data Analysis Competition is applied in this paper.
The gearbox dataset consists of two types of gearboxes and fourteen kinds of fault modes. Data were collected at 30, 35, 40, 45 and 50 Hz shaft speed while being subjected to either high or low loading. To demonstrate the feasibility and effectiveness of the proposed method, we choose six typical conditions of spur gearboxes including one normal state and five fault states under 40 Hz as listed in Table 1.
Table 1. Description of fault modes in the experiment
Case

Normal

Fault 1

Fault 2

Fault 3

Fault 4

Fault 5


Gear

32T

Good

Chipped

Good

Good

Chipped

Good

48T

Good

Eccentric

Eccentric

Eccentric

Eccentric

Good


80T

Good

Good

Good

Broken

Broken

Broken


Bearing

IS:IS

Good

Good

Good

Ball

Inner

Inner

ID:IS

Good

Good

Good

Good

Ball

Ball


OS:IS

Good

Good

Good

Good

Outer

Outer


Shaft

Input

Good

Good

Good

Good

Good

Imbalance

For each state of gearboxes, 200 samples are generated with every 5000 data points. In the process of feature extraction, each sample is firstly used to obtain five timedomain parameters: the RMS value, crest factor, form factor, kurtosis and skewness. These timedomain feature vectors are normalized to eliminate the dimension effects. Then the sample is decomposed to acquire eight wavelet packet coefficients by WPD with the decomposition level 3. Based on the wavelet packet coefficients, the singular values which can represent the timefrequency features of gearboxes are obtained by SVD. The timedomain parameters and the WPDSVD results together constitute the robust feature vectors of gearboxes, as shown in Fig. 2.
Fig. 2. The results of feature extraction
In this section, $t$SNE is utilized to fuse the highdimensional feature vectors of gearboxes. The former 13dimensional features are reduced to be 3dimensional features, which can be seen as the key representatives of gearboxes. To evaluate the effectiveness and superiority of $t$SNE method, the traditional methods of PCA and LTSA are also applied to the same dataset. The perplexity of $t$SNE method is set as 25. By comparing the results of the different methods, we can find that the results of $t$SNE have the best visualization effects as well as the best separability in lowdimensional feature space, as shown in Fig. 3.
Based on the lowdimensional fused features given by $t$SNE, further fault diagnosis can be carried out. Here, 150 samples from normal condition and five fault modes are selected to train the RBF model, respectively. With the trained classification model, 50 samples from each state are chosen for testing as the model input. The samples of the PCA results and the samples of the LTSA results are also implemented to verify the classification performance.
As showed in Table 2, the accuracy rate of fault diagnosis based on $t$SNE can reach 100 %, while other feature fusion methods cannot cluster all the fault states of gearboxes clearly, which results in low accuracy of classification. The result of fault mode classification verifies the effectiveness and superiority of the proposed feature extraction and feature fusion method.
Fig. 3. The results of feature fusion
a) Result of PCA
b) Result of LTSA
c) Result of $t$SNE
Table 2. Accuracy of classification using different methods
Method

Total

Normal

Fault 1

Fault 2

Fault 3

Fault 4

Fault 5

tSNE

1

1

1

1

1

1

1

PCA

0.9600

1

1

0.8200

1

0.9600

0.9800

LTSA

0.8567

0.9600

1

0.8000

1

0.9600

0.4200

In this paper, a novel method of feature fusion based on WPDSVD and $t$SNE for gearbox fault diagnosis is proposed. In the first step, several timedomain parameters and singular values based on WPDSVD are both obtained by processing the vibration signals of gearboxes, which together constitute the robust feature vectors. Then, $t$SNE, as an effective method for the visualization of highdimensional datasets, is introduced to realize dimensionality reduction of the extracted feature vectors. Based on the fused features, a RBF based fault diagnosis model is applied to achieve the gearbox fault mode classification. Sufficient experiments have been implemented to demonstrate the effectiveness and superiority of the proposed method by analyzing the vibration signals of gearboxes.