An approach to fault diagnosis for gearbox based on order tracking and extreme learning machine
Hua Su^{1} , Chen Lu^{2} , Jian Ma^{3}
^{1, 2, 3}School of Reliability and Systems Engineering, Beihang University, Beijing, China
^{1, 2, 3}Science and Technology on Reliability and Environmental Engineering Laboratory, Beijing, China
^{3}Corresponding author
Vibroengineering PROCEDIA, Vol. 10, 2016, p. 210216.
Received 7 November 2016; accepted 8 November 2016; published 8 December 2016
JVE Conferences
Varying speed machinery condition detection and fault diagnosis are more difficult due to nonstationary machine dynamics and vibration. In this paper, an intelligent fault diagnosis method based on order analysis and extreme learning machine (ELM) is proposed. Order tracking, easily identifying speedrelated vibrations, is useful for machine condition monitoring, which could obtain the resampling signal of constant increment angle. Then, the power spectrum (PS) of characteristic orders, as the fault feature vectors, is extracted and normalized from the denoising signal. Last, in order to diagnose the faults of the gearbox automatically, ELM, provided better generalization performance at a much faster learning speed and with least human intervene, is applied to identify and classify the faults. From the result of experiment, the approach of this paper is effective to judge the fault type under variable speed conditions.
Keywords: order tracking, extreme learning machine, fault diagnosis.
1. Introduction
Gearboxes, widely used in machinery equipment, plays a significant role in industrial applications, which are mainly reflected on the largescale machinery, such as cars [1], aircraft [2], wind power generation system [3]. Thus, the gearbox damaged, will result in loss of production and raise the cost of machines’ maintenance [1]. So, it is very important to monitor the condition of the gearboxes to avoid run time breakdown of the machine. However, gearbox is a very complex mechanical system that can generate vibrations from its various elements such as gears, shafts, and bearings [4]. Even though several techniques have been proposed in the literature for feature extraction, such as, time analysis, frequency analysis, wavelet transform (WT), empirical mode decomposition (EMD) and Hilbert transformation method [58], it still remains a challenge in implementing a diagnostic tool for realworld monitoring applications because of nonstationary signals in the time domain under the complexity of machinery structures and operating conditions.
The order analysis, a method which can transform nonstationary signals in the time domain into the stationary signal in the angular domain, easily identify speedrelated vibrations. K. R. Fyfe et al. introduced two approaches to obtain signals in the angular domain, namely the hard order tracking and computed order tracking (COT), and the comparison of two methods in details is in the Ref. [9]. After that, Vold and Leuridan put forward an order tracking algorithm based on Kalman filtering, which solves interference during orders [10]. And the instantaneous frequency (IF) of a signal is also applied order analysis that it could achieve order analysis without tachometer [11]. In this study, COT is selected as the method of processing signal because of low computational cost and high precision of angular signals.
ELM, originally developed for the SLFNs, provides better generalization performance at a much faster learning speed and with least human intervene [12]. Compared with those traditional computational intelligence techniques, ELM shown its good performance in regression applications as well as in large dataset and multilabel classification applications.
This paper is organized as follows: Section 2 introduces COT and ELM, Section 3 describes the results performed to validate the method and Section 4 presents the conclusions and related future works.
2. Methodology
In the paper, a novel approach of the fault pattern recognition is presented, which contains the following steps: (1) Angular transformation of the vibration signal based on computing order tracking; (2) Feature extraction of the angular signals; and (3) Fault diagnosis based on ELM as illustrated in Fig. 1.
2.1. Computed order tracking
A method, obtaining the signal in the angular domain based on COT, depends on the rotational speed pulse (see Fig. 2). Compared with the hard order tracking, installing complex devices, COT can obtained rotational speed to install a tachometer to obtain resample signals. After resampling, the signal in angular domain, carried on the Fourier transform, then obtains the order domain. Sampling rate ${O}_{S}$ should meet the conditions of Eq. (1) because of the Nyquist sampling theorem:
where ${O}_{max}$, the maximum analysis order. And in order to ensure a full period sampling, the sampling number of each cycle should be ${2}^{n}$, $n$ equals 1, 2, 3….
Fig. 1. The flow chart of the article
Fig. 2. Rotational speed pulse signal
Angular resampling is the most important step in the realization of order tracking.
The steps of order tracking methods analysis [9]:
1) Function $n\left(t\right)$ fitted by keyphasor signal.
2) Calculating the angle function $\theta \left(t\right)=\frac{\pi}{30}{\int}_{0}^{t}n\left(t\right)$ of the reference axis.
3) Calculating the resampling time ${t}_{n}$, through the quadratic curve fitting. It is assumed that the shaft is undergoing constant angular acceleration. With this basis, the shaft angle, $\theta $, can be described by a quadratic equation of the following form:
The unknown coefficients ${b}_{0}$, ${b}_{1}$, ${b}_{2}$ are found by fitting three successive keyphasor arrival times $({t}_{1},{t}_{2},{t}_{3}$), which occur at known shaft angle increments $\mathrm{\Delta}\varphi =2\pi $
Then, this yields the three following conditions, $\theta \left({t}_{1}\right)=0$; $\theta \left({t}_{2}\right)=\mathrm{\Delta}\varphi $; $\theta \left({t}_{3}\right)=2\mathrm{\Delta}\varphi $:
In the formula, ${t}_{n}$ is the time when the angle of ${\theta}_{k}$ is turned around.
4) Calculating the vibration data ${x}^{\mathrm{\text{'}}}\left(\mathrm{}{t}_{n}\right)$ of the equivalent angle by interpolation of the original vibration signal $x\left(\mathrm{}{t}_{n}\right)$, and interpolating functions are as follows:
In the formulas, ${t}_{s}$ denotes the interval of sampling in the time domain, the value of $k$ makes that $k{t}_{s}$ and ${t}_{n}$ are the closest, ${h}_{s}$ is the interpolation filter, ${h}_{s}\left(t\right)$ is the interpolation function.
2.2. Extreme learning machine
Singlehidden layer feedforward neural networks (SLFNs) algorithm, the structure shown in Fig. 3, is the basis of ELM. Compared with the traditional SLFNs algorithm, ELM is a novel algorithm of SLFN, which has lots of advantages. Obviously, ELM could randomly produce weight between hidden layer and input layer and the thresholds of hidden layer nodes are also stochastically. Furthermore, while training the network, ELM can avoid to adjust weights or thresholds. Simply speaking, the algorithm only needs to set the number of hidden layer neurons to get the unique optimal solution. And learning pace of ELM is very quick and it shows good generalization performance. As follow, the Eq. (7) is the mathematical model of ELM [12, 13]:
In the Eq. (7), ${g}_{i}$ expresses the output function ${G}_{i}\left({a}_{i},\mathrm{}{b}_{i},\mathrm{}x\right)$ of the $i$th node which belongs to the hidden layer.
Fig. 3. Singlehidden layer feedforward network
For a neural network, the activation function is very important. For ELM, the activation functions of the hidden layer usually have three types:
In [12, 13], the authors made a statement of advantages and explain that three types of activation functions show good performance. And in this paper, Eq. (8) is selected as the activation function. The steps of ELM algorithm as follows:
1) Determine the number $L$ of neurons in the hidden layer and activation function $g\left(x\right)$, and randomly assign ${a}_{i}$, ${b}_{i}$, ${\beta}_{i}$.
2) Calculate the output vector of $i$th.
3) Calculate the output weight $\widehat{\beta}$.
3. Results
3.1. Experimental facilities
The source of the data in this paper comes from the 2009 PHM Conference Data Analysis Competition. The data is about gearbox made of 560 samples which are constituted two types of gearboxes and fourteen kinds of fault modes. And each of the working conditions has different shaft speeds 30, 35, 40, 45, and 50 Hz and four kinds of loads. And the author chooses four typical fault modes (see Table 1) to verify the feasibility of the new algorithm proposed [4].
Table 1. Fault modes chosen of the samples
Case

Normal

Fault 1

Fault 2

Fault 3


Gear

32T

Good

Chipped

Good

Good

96T

Good

Good

Good

Good


48T

Good

Eccentric

Eccentric

Eccentric


80T

Good

Good

Good

Broken


Bearing

IS:IS

Good

Good

Good

Ball

ID:IS

Good

Good

Good

Good


OS:IS

Good

Good

Good

Good


IS:OS

Good

Good

Good

Good


ID:OS

Good

Good

Good

Good


OS:OS

Good

Good

Good

Good


Shaft

Input

Good

Good

Good

Good

Output

Good

Good

Good

Good

3.2. Order tracking and feature extraction
1) Data resampling. In order to reduce system error of the signal, which could be caused by the sensor, signal preprocessing of eliminating trend was implemented. Then COT was used to transform the nonstationary vibration signal of time domain into the stationary signal of angular domain, this paper uses a method named computed order tracking.
Autocorrelation Coefficient (ACF) [14] is calculated to compare the stability of signals both time domain and angular domain. Autocorrelation Function, effective method to inspect the stability of time series while vibration signal is a kind of time series.
From the Fig. 4, the details of ACFs presented, we can see that the convergence rate of the ACF of signal in the angular domain are faster than the other, which verify the angular signal is more stable than the time signal.
2) Characteristic Parameters Extraction. In this section, characteristic vector was extracted. First, Fast Fourier transform is applied to the angular resampling signal, thus the order power spectrum can be obtained (see Fig. 5). There was six feature order selected as follows: ${W}_{1}$ represents the order power spectrum of order 32; ${W}_{2}$ represents the order power spectrum of order 64; ${W}_{3}$ represents the order power spectrum of order 96; ${W}_{4}$ represents the order power spectrum (PS) of order 128; ${W}_{5}$ represents the order power spectrum of order 48; ${W}_{6}$ represents the order power spectrum of order 80. Then the characteristic parameters of each characteristic order are ${W}_{j}$, $j$ equals 1, 2, 3, 4, 5, 6.
Because of different energy distributions under different operating conditions, it is feasible to use the energy percentage of feature orders as the feature vector. Then the feature parameters are normalized:
$w$, sum of the characteristic parameters ${W}_{j}$, that means $W=\sum _{1}^{6}{W}_{j}$, $j$ equals 1, 2, 3, 4, 5, 6. ${\overline{W}}_{j}$, the normalized parameters of ${W}_{j}$. $T$, the feature vector after normalization processing.
Fig. 4. The ACFs of different fault modes
a) ACFs in the time domain
b) ACFs in the angular domain
Fig. 5. Order power spectrum under different conditions
a) Normal
b) Fault 1
c) Fault 2
d) Fault 3
3) Fault diagnosis based on ELM.
There are 60 samples in each case. 50 samples are used as training samples for the ELM and the rest samples are used as testing samples for the ELM. By the way of cross validation, the classification performance is showed. The final classification accuracy is the average of the results of the 300 ELM’s classification (see Table 2).
Based on the COTPSELM approach, the fault detection results under different loads are studied. The result of fault diagnosis shows on Table 2. From the Table 2, the COTPSELM method shows the accuracies of total are 0.9908, 0.9803, 0.9147, 0.9420 and all of them are greater than 90 %. It is also found that the accuracies of low loads are higher than high loads. Thus, it can be seen COTPSELM is more suitable for low load conditions.
Table 2. Accuracy of classification based on COTPSELM
Load

Total

Normal

Fault 1

Fault 2

Fault 3

Low 1

0.9908

0.9880

0.9850

0.9967

0.9933

Low 2

0.9803

0.9927

0.9777

0.9640

0.9870

High 1

0.9147

0.9430

0.9290

0.8643

0.9227

High 2

0.9420

0.9560

0.9243

0.9417

0.9460

4. Conclusions
In this paper, it has been described a new method, COTPSELM, which could be effectiveness to faults diagnosis of gearbox. The COTPSELM scheme is shown to be not sensitive to a particular speed; that is, fault detected with COTPSELM could reduce the effect of different speed operations. Our subsequent work focuses on improving the accuracy of faults diagnosis under the high load conditions.
Acknowledgements
The authors declare that there is no any potential conflict of interests in the research. This study was supported by the Fundamental Research Funds for the Central Universities (Grant No. YWF16BJJ18) and the National Natural Science Foundation of China (Grant Nos. 51575021 and 51605014), as well as the Technology Foundation Program of National Defense (Grant No. Z132013B002).
References
 Praveenkumara T., Saimuruganb M., Krishnakumarb P. Fault diagnosis of automobile gearbox based on machine learning techniques. Procedia Engineering, Vol. 97, 2014, p. 20922098. [Search CrossRef]
 Faris E., David M., Cristobal R.C. A comparative study of adaptive filters in detecting a naturally degraded bearing within a gearbox. Case Studies in Mechanical Systems and Signal Processing, Vol. 3, 2016, p. 18. [Search CrossRef]
 Ragheb A., Ragheb M. Wind turbine gearbox technologies. 1st International on Nuclear and Renewable Energy Conference (INREC), 2010, p. 18. [Search CrossRef]
 Wu Fangji, Lee Jay Information reconstruction method for improved clustering and diagnosis of generic gearbox signals. International Journal of Prognostics and Health Management, Vol. 2, Issue 1, 2011, p. 14958. [Search CrossRef]
 Kar C., Mohanty A. R. Monitoring gear vibrations through motor current signature analysis and wavelet transform. Mechanical Systems and Signal Processing, Vol. 20, Issue 1, 2006, p. 158187. [Search CrossRef]
 Cheng J., Yu D., Tang J., Yang Y. Application of frequency family separation method based upon EMD and local Hilbert energy spectrum method to gear fault diagnosis. Mechanism and Machine Theory, Vol. 43, Issue 6, 2008, p. 712723. [Search CrossRef]
 Ricci R., Pennacchi P. Diagnostics of gear faults based on EMD and automatic selection of intrinsic mode functions. Mechanical Systems and Signal Processing, Vol. 25, Issue 3, 2011, p. 821838. [Search CrossRef]
 Yang Y., He Y., Cheng J., Yu D. A gear fault diagnosis using Hilbert spectrum based on MODWPT and a comparison with EMD approach. Measurement, Vol. 42, Issue 4, 2009, p. 542551. [Search CrossRef]
 Fyfe K. R., Munck E. D. S. Analysis of computed order tracking. Mechanical Systems and Signal Processing, Vol. 11, Issue 2, 1997, p. 187205. [Search CrossRef]
 Vold H., Leuridan J. High resolution order tracking at extreme slew rates, using Kalman tracking filters. SAE Technical Paper No. 931288, 1993. [Search CrossRef]
 Blough J. R. Development analysis of time variant discrete Fourier transform order tracking. Mechanical Systems and Signal Processing, Vol. 17, Issue 6, 2003, p. 11851199. [Search CrossRef]
 Huang G. B., Wang D. H., Lan Y. Extreme learning machines: a survey. International Journal of Machine Learning and Cybernetics, Vol. 2, Issue 2, 2011, p. 107122. [Search CrossRef]
 Tian Y., Ma J., Lu C., Wang Z. Rolling bearing fault diagnosis under variable conditions using LMDSVD and extreme learning machine. Mechanism ad Machine Theory, Vol. 90, 2015, p. 175186. [Search CrossRef]
 Ao S. I. Applied Time Series Analysis and Innovative Computing. Lecture Notes in Electrical Engineering, Springer Netherlands, Vol. 59, 2010. [Search CrossRef]