Deep convolutional neural networks for Bearings failure predictionand temperature correlation

Rolling elements bearings (REBs) is one of the most sensitive components and the common failure unit in mechanical equipment. Bearings failure prognostics, which aims to achieve an effective way to handle the increasing requirements for higher reliability and in the same time reduce unnecessary costs, has been an area of extensive research. The accurate prediction of bearings Remaining Useful Life (RUL) is indispensable for safe and lifetime-optimized operations. To monitor this vital component and planning repair work, a new intelligent method based on Wavelet Packet Decomposition (WPD) and deep learning networks is proposed in this paper. Firstly, features extraction from WPD used as input data. Secondly, these selected features are fed into deep Convolutional Neural Networks (CNNs) to construct the Health Indicator (HI). This study focuses on analysing the relationships such as correlations between the HI and temperature. We develop a solution for the Connectiomics contest dataset of bearings under different operating conditions and severity of defects. The performance of the proposed method is verified by four bearing data sets collected from experimental setup called “PRONOSTIA”. The results show that the health indicator obtains fairly high monotonicity and correlation values and it is beneficial to bearing life prediction. In addition, it is experimentally demonstrated that the proposed method is able to achieve better performance than a traditional neural network based method.


Introduction
The performance degradation assessment of bearing plays an important role in various rotating machines fault to guarantee reliability in industrial processes [1][2][3][4].Recently, studies on bearings prognostics can be seen as a problem of pattern recognition and many artificial intelligent methods, several researches have been made to develop techniques for machine health monitoring [5][6][7].
Cerrada et al. [8] gives a summary of PHM tools for bearings severity evaluation and possible failure modes, characteristics, the common available data types from different sensors, the different features and algorithms applied for prognostics and health management design.
Machine learning algorithms used in health assessment and RUL estimation depend on the features extracted.Recent developments of prognostics have focused on applying advanced techniques of signal processing to extract the robust features for constructing the health indicator [9].Features extraction from different signal representation such as: temporal, spectral and time-frequency [10,11], and so on, in general do not have good monotonicity, which limits their indication in faults severity evaluation.The features extracted from sensors signals contain information about the health state of bearing.In order to produce robust and significant features, diverse original features need to be studied for show the effectiveness of the proposed approach.
Several researches have been studied the feasibility of ANN for health assessment RUL estimation.However, its accuracy is highly dependent on the neural networks structure such as a number of hidden layers, nodes and kernel function.BenAli et al. [12] proposed an intelligent method based on the data-driven prognostic approach by the combination of the neural networks and Weibull distribution.Rai et al. [13] combined the neural network approach with wavelet-based denoising method for the RUL assessment.For the improvement of the traditional neural network, CNNs have been proposed in this study for learn the features, a long short term memory based neural networks scheme was proposed by Yuan et al. [14] utilizing Long Short-Term Memory neural network to get good diagnosis and prediction performance in the cases of complicated operations, hybrid faults and strong noises for RUL estimation of aero-engines in the cases of complicated operations, hybrid faults and strong noises.However, it still to develop an effective approach based on historical data such as deep learning.
Recently, several learning methods has emerged called as deep learning that improved to learn higher level abstractions from the raw data [15][16][17], deep learning models automatically learn a feature representation from raw signal.CNNs, auto-encoders and deep belief network are the mostly known models in deep learning, and applied in many research area such as: speech recognition [18], image processing [19,20], machinery condition monitoring and health assessment [21][22][23][24].Li et al. [25] propose a novel deep convolutional neural network-based method for remaining useful life predictions.The aim of this study is to estimate the RUL of aero-engine units accurately.A good prognostic performance prediction is achieved with the proposed approach using raw feature selection, data pre-processing and sample preparation with time window.
In this paper, we propose a model for health assessment and RUL estimation which integrates a WPD algorithm for fast feature extraction, a nodes energy features selection of each wavelet level decomposition, and an intelligent analysis method based on the CNNs algorithm to obtain the health assessment result of bearings.An experimental verification is given by using the data from PRONOSTIA test rig [26].Then obtained model is applied to the acceleration signals that are collected from the bearing degradation test experiment.
This paper starts with the description of the prognostics and health management along with the proposed model in Section 2. A brief introduction of WPD for features extractions is presented in Section 3 and deep CNNs for bearing health assessment in Section 4. The proposed method is experimentally validated using bearing's dataset in Section 5 and we demonstrated the effectiveness of the proposed approach.We close the paper with conclusions in Section 6.

The proposed model
PHM is a new concept paradigm of condition based maintenance for The improvement of system safety and reliability by monitoring the facility conditions, including the maximum of the operational availability and reduction of maintenance costs.In general, the PHM methods can be grouped in three main categories such as: data-driven approaches, model-based approaches and statistical approaches [4,27,2].The PHM technologies are evolving rapidly in recent years due to the different statuses and requirements of the cases, the project involved research and development.In Fig. 1 several methods and technologies which can be regarded as the steps towards prognostics and identifying maintenance needs, to support decision making and manage operational reliability.The performance assessment lean model generally consists of three main aspects: health indicators construction, RUL estimation, and health management [2].
Fig. 1 illustrates the overall structure of a typical bearing PHM system.It consists of essential modules, namely, sensing, preprocessing, feature identification, and PHM.The challenge features selection for identifying bearing failure are important step in PHM and inspired great research interest.The PHM module of bearing systems in general start by anomaly detection, fault diagnosis, prognosis, and decision-making in the final step.
The failure threshold limited by using the international standards (ISO 13381-1, ISO 10816 and ISO 7919).The ISO standards limited in vibration signals energy (the root mean square RMS

Principle of WPD for feature extraction
In the proposed approach firstly, raw signals are processed by a fast, memory-saving algorithm, that is the Wavelet packet decomposition WPD nodes energy of each level is calculated.WPD is a natural extension of Multi Resolution Analysis (MRA) technique [29].WPD decomposes the signal using both low-frequency components and high-frequency components (Fig. 3); The flexibility of collection of abundant information for the extraction of features that combine non-stationary and stationary characteristics [30].The multi-level filtering process or decomposition process shown in Fig. 3.

Fig. 3. Structure of two layers wavelet packet decomposition
The discrete signal is convolved with a low pass filter (g) and a high pass filter (h) resulting in two vectors  called the approximation coefficients and  called the details coefficients.
The process of decomposition can be repeated on the approximation vector  and successively on every new approximation vector  .This concept is presented by means of a wavelet tree having  levels, where  is the number of iterations of the basic step.In Fig. 3 the level of decomposition equal 2.
In the current contribution, the original features are extracted from raw signals related to bearings degradations shown in the Table 1.In Fig. 6 shown that there are a few helpful to consider when looking at the multitude of time frequency transform.It is helpful to go over a couple of these with another statistical parameters, because they are heavily utilized in condition monitoring.Increasing sample rate does not add information to frequency peaks of interest.The nodes energy coefficients evolution at each level shown in 7 is equal to the length of raw signals after decomposition using (db6), the WPD coefficients can retain more fault severity information, and hence can be extracted more distinguishing statistical features for RUL estimation.
In Fig. 8 shown that the monitoring temperature on the surface of the bearing housings with temperature probe (PT100) has been used in this paper.Under different operating conditions of load and speed, the temperature of a bearing is monitored for changes that can indicate defect in bearing elements.In this study the experimental has shown that temperature is a good indicator of load, speed or lubrication than of bearing condition.
Temperature monitoring techniques may be helpful in preventing machine breakdown.However, bearing defects have not been found to cause an appreciable increase in temperature until the damage has reached a severe state.
In Fig. 8 shown that the monitoring temperature technique for bearing prognostics becomes far more complicated when it is subject to changeable operating condition.In most situations, varying operating conditions refer to variable loading conditions since it is the major source of contribution to the energy of the measured vibration signal.CNNs is also a type of feedforward neural network which is composed of alternating convolutional and subsampling layer [32,33].CNNs are designed to use minimal amounts of preprocessing, which is the main difference compared to other deep architectures.
Firstly, we assume that the input sequential data is  =  , . . . that  is the length of the sequence and  ∈  at each time step.Convolution: the dot product between a filter vector  ∈   and an concatenation vector representation  : defines the convolution operation as follows: where  and  denotes bias term and non-linear activation function, respectively. : is a -length window starting from the th time step, which is described as: As defined in Eq. ( 2), the output scale  can be regarded as the activation of the filter  on the corresponding subsequence  : . By sliding the filtering window from the beginning time step to the ending time step, a feature map as a vector can be given as follows: where the index  represents the th filter.It corresponds to multi-windows as:  : ,  : , . . .,  : .Max-pooling: able to reduce the length of the feature map, which can minimize the number of model parameters.The hyper-parameter of pooling layer is pooling length denoted as .MAX operation is taking a max over the s consecutive values in feature map  .
Then, the compressed feature vector can be obtained as: where ℎ = max  ( ) ,  ( ) , . . .,  ( .Then, via the two layers: convolution and max-pooling ones, the fully connected layers and a softmax are usually added to make predictions by the top layers.
The proposed method diagram for the bearings health assessment and RUL estimation shown in Fig. 10.The method is decomposed into two main phases the preprocessing data and training data using deep CNNs. the bearing dataset prepared for the training by computing the features.All the layers use activation functions tanℎ.Then training and testing datasets are prepared for deep CNNs.In the second phase, which is achieved on-line, deals with the utilization of the model generated continuously to assess the health state of the bearing and RUL prediction.

Experimental system
An accelerated bearing life test platform called PRONOSTIA [26] (Fig. 11) is used in this section to verify the prognostics of proposed method.PRONOSTIA is a laboratory experimental platform dedicated to test, verify and validate developed methods related to bearing health assessment, diagnostic and prognostic.In the following of the paper, a set of four experiments consisting of four degraded bearings has been utilized.The four data shown in Table 2 for the different loading and speed motor.The PRONOSTIA experimental setup composed of two main parts: The first part related to the speed control and a second part to load profiles generation.The speed control part is composed of electric motor, shaft, a set of bearings.The power developed by motor equal to 1.2 kW and its speed varies between 0 to 6000 rpm.The second part of PRONOSTIA contain a hydraulic jack connected to a lever arm used to create different loads on the tested bearing mounted on the platform.
A pair of ball bearings is mounted on one end of the shaft to serve as the guide bearings and a NSK6307DU roller ball bearing is mounted on the other end to serve as the test bearing.The transmission of the movement between the motor and the shaft drive is done by a rub belt.Two accelerometers (DYTRAN3035B) mounted horizontally and vertically on the housing of the tested bearing to pick up the horizontal and the vertical accelerations (Table 3).In addition, the monitoring system includes one temperature probe and a torque sensor (Fig. 11).The sensors are connected to a data acquisition card.
The data acquisition software is programmed by using a LabView interface.Each record is stored in a matrix format where the following parameters are defined: the time, the horizontal acceleration, the vertical acceleration, the temperature, the speed and the torque.With this experimental platform, several types of profile can be created by varying the operating conditions (speed and load).The bearing's behavior is captured during its whole degradation process by using the dedicated sensors.The tested bearing has the characteristics shown in Table 4.

Results and discussion
In order to identify different working conditions monitoring of REBs, the proposed fault prognostics method based on CNNs and WPD is performed.The initial operator  and  with  = 4 and  = 4 are calculated.In this experimentation, each signal was decomposed to level 6, then eight subband coefficients were obtained.The classification errors are reported in Table 5.The PRONOSTIA experimental setup system are shown in Fig. 11.A working cycle of vibration for different loading and speed motor are saved, which include all the data for each REBs.The classification accuracies of REBs on the PRONOSTIA experimental setup were calculated and reported in Table 5.Also, the classification results obtained by using the features extracted from time domain (Fig. 4 and 5) and from time-frequency domain (Fig. 6).From Table 5, we can find a considerable improvement in the classification accuracies compared to the results obtained by using the WPD based fault prognostics method with different loading of the classifier.Another popular metric for the performances evaluation method is the Root Mean Square Error (RMSE).The RMSE of RUL estimation is employed as a performance measure shown in Fig. 13.
From the above four experiments, we can find that the classification accuracies obtained by using the CNNs and WPD are higher than those obtained by using WPT and statistical parameters-based fault prognostics method [34].
The regression results are presented in Table 5 in terms of the factors of determination  for the different training models.The  values, indicating the fraction of the total variance that could be explained by the model, are very high.From the results, it is seen that all the predictors perform very well.The objective is to apply the best power fit on the degradation model obtained by Eq. ( 6): where  is the time when the fault occurs and  the inverse of the health indicator () used to get the current cycle or time ().The validation of these results shown in Table 5 by computing of the sum square error (SSE),  RMSE.The RUL estimation is the distance between the current time and the time for which the regression model given in Eq. ( 5).The threshold or the acceptable limit of the vibration magnitude [28] of each degradation in bearings, corresponds to the end of each experiment.The power fitting of the smoothed health indicator is shown in Fig. 12.

Temperature correlation
The correlation between temperature and health indicator given a result shown in Fig. 14.More advanced prognostics interested on performance degradation assessment, so that failures can be predicted and prevented using the temperature and vibrations signal obtained in this study by CNNs for regression.As soon as, the concept of correlation between the two indicators for accurately assessing the bearing performance degradation is a critical step toward realizing an online tool condition monitoring platform.The results of its application for performance degradation assessment show that this indicator can reflect effectively the performance of degradation process.
It has been shown that the health indicator obtained by vibrations measurements given the same monotonicity compared to temperature measurements.measurements from PRONOSTIA experimental setup.Dalila Belmiloud and Tarak Benkedjouh worked out almost all of the technical details, and performed the numerical implementation of the suggested experiment.Dalila Belmiloud, Tarak Benkedjouh, Mohamed Lachi and Ali Laggoun contributed to the design and implementation of the research with future scope and listing the efficiency when compared with other methods and the future scope of the work implemented on this paper and the limitation of the proposed solution in practice, to the analysis of the results and to the writing of the manuscript.

Conclusions
In this paper, a novel fault prognostics method for REBs based on WPD and CNNs have been proposed.The model develops an effective RUL prediction method that addresses multiple challenges in complex system prognostics where many parameters are unknown.In order to investigate the effectiveness for practical applications, the proposed method is tested by using PRONOSTIA experimental setup.The test results show that the proposed method can achieve a higher classification accuracy rate than the result obtained by using wavelet packet transform based prognostics method.The good experimental results obtained method, further architecture optimization is still necessary.Deep learning methods generally suffer from high computing load, and that will be focused on in further research.In this study Temperature and vibration analysis are two dominating condition monitoring techniques applied to bearing failures prognostics.The temperature HI has reliable revealed the all change of the bearings condition monitoring and rendering the diagnosis procedure less complicated.For the proposed algorithm evaluation, we examined the performance on four bearing's data sets.The experimental results of bearing data sets demonstrate the superiority of the proposed CNNs model to other fault-prognostics methods.The proposed approach achieves a high degree of accuracy.

Fig. 1 .Fig. 2 .
Fig. 1.The flowchart of the lean model for performance assessment

Fig. 9 .
Fig. 9. Illustrations of the proposed CNNs for bearing prognostics The deep learning algorithms are machine learning technique based on distributed representations.Using structures composed of multiple non-linear transformations deep learning attempts to learn high-level features in data.The frequently used models are CNNs and Deep Belief Network (DBN).CNNs is also a type of feedforward neural network which is composed of alternating convolutional and subsampling layer[32,33].CNNs are designed to use minimal amounts of preprocessing, which is the main difference compared to other deep architectures.Firstly, we assume that the input sequential data is  =  , . . . that  is the length of the sequence and  ∈  at each time step.Convolution: the dot product between a filter vector  ∈   and an concatenation vector representation  :defines the convolution operation as follows:

Fig. 10 .
Fig. 10.Flow chart of the proposed method

Fig. 12 .
Fig. 12. Health indicator for the tested bearings

Fig. 13 .
Fig. 13.Comparison between scoring function and error value