Remaining useful life (RUL) prediction of bearing by using regression model and principal component analysis (PCA) technique

Apakrita Tayade1 , Sangram Patil2 , Vikas Phalle3 , Faruk Kazi4 , Satvasheel Powar5

1, 5Indian Institute of Technology, Mandi, India

2, 3, 4Veermata Jijabai Technological Institute, Mumbai, India

1Corresponding author

Vibroengineering PROCEDIA, Vol. 23, 2019, p. 30-36.
Received 1 March 2019; accepted 12 March 2019; published 25 April 2019

Copyright © 2019 Apakrita Tayade, et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Creative Commons License

A wind turbine works under variable load and environmental conditions because of which failure rate has been on the rise. Failure of a gearbox, an integral part of producing wind energy, contributes to 80 % of the total downtime for the wind turbine. For ensuring better utilization of the wind turbines, Fault prognosis and condition monitoring of bearings are of utmost importance as it helps to reduce the downtime by early detection of faults which further increases the power output. In this paper, vibration signals produced and machine learning approach to determine the Remaining Useful Life (RUL) for a degraded bearing is studied. The methodology includes statistical feature extraction analysis with regression models. Further the feature selection is done using Principal Component Analysis (PCA) technique which produces training and testing sets which acts as an input parameter for regression models such as Support Vector Regressor (SVR) and Random Forest (RF). Weibull Hazard Rate Function is used for calculating the RUL of the bearing. Results This study shows the potential application of regression model as an effective tool for degradation performance prediction of bearing.

Keywords: remaining useful life, bearings, prognosis, random forest regression, support vector regression.

1. Introduction

With the growing impact of climate change Renewable energy remains the only viable option to save the motherland. With India’s aim to achieve 175GW of installed capacity of renewable energy by 2020 wind energy provides one of the most potential sector. India has the 4th largest installed capacity of wind turbines in the world. We still lack to achieve to our full potentials due to unresolved failures like that of failure of a gearbox, an integral part of producing wind energy. Power output can be increased by early detection of faults by fault prognosis and condition monitoring. Prognostics forecast the performance of an element surveying the degree of deviation or degradation of a system from its typical operating conditions. The predicted time is known as the Remaining Useful Life (RUL), with accurate RUL reduction in the inspection as well as maintenance cost is observed, which further contributes in expanding the general proficiency of the plant. Predicting an approaching failure and estimating the RUL of a bearing is necessary for coming up with support and sidestepping sudden shutdowns of basic frameworks. This paper exhibits a hybrid method for prognosis of bearing which makes used of regression primarily based adaptive predictive models to gain proficiency with the advancing pattern tendency in a bearing’s health indicator. These models are then used to project forward in time and estimate the RUL of a bearing [1].

Prognostics is mainly distributed as: model-based prognostics and data-driven prognostics. Model-based prognostics attempts to incorporate physical modeling at material level while mathematical modeling at the system level with different system variables used into the estimation of RUL. Systems are complex, which arises the need for highly skilled labors, hence making it time-consuming and labor- intensive method. Whereas data-driven prognostics concentrates on the available system monitoring data. Here, failure prognosis includes prognostication of system degradation and time-to-failure supported on “state awareness” gathered from monitored information [2]. The machine learning (ML) market is rising speedily in light of the Internet insurgency and deployment of ML improves the speed and exactness of capacities performed by the framework. A ton of research work is carried out on foreseeing the RUL of bearing utilizing a machine learning approach. However, the outcome appeared by the majority of this paper isn't remarkable. Developing interest for artificial intelligence and outstanding advancements in its improvement has encouraged plenty of researchers to use this approach in the prediction of bearing RUL. Traditional methods like Support Vector Machines (SVM) [3, 4] and Principal Component Analysis (PCA) [5, 6]. Nonetheless, SVM is sensitive to feature scaling and tend to overfit for big datasets. The prediction of feature scaled dataset has low accuracy due to limited available range in our case. Researchers have also used various forms of ANN for bearing RUL prediction [7, 8]. The ensemble of BP-ANN used by Zhang et al. has also been verified through experimental data [9]. Ensembles are widely utilized in the domain of reliability in the past as they turn out to be vastly improved than alternate models. Stacking ensemble of ANN and Gradient Boosted Trees (GBT) work better than CNN, SVM, GBT, and MLP, as shown by Sandip et al. in his work on prognostics [10].

In this paper, Dataset utilized for the investigation is taken by IEEE PHM Data Challenge 2012 for FEMTO bearing informational collection [11]. The methodology contains extraction of the statistical features in X and Y direction, features ranking is done utilizing a PCA approach which is furthermore used to make distinctive datasets. The appropriate scoring function is taken to figure the score of models using error between actual RULs and values predicted for test bearings.

2. Theory

2.1. Support vector regression

The Support Vector Regression (SVR) utilizes similar principles because of the SVM for classification, with just a couple of minor contrasts. They depended on a process the loss function that overlooks errors, that settled within the certain distance of the true value. This function is known as ‘epsilon intensive’ loss function. The factors measure the expense of the errors on the input training points. The loss function is applied to correct errors which are more prominent than the threshold –ε. Corresponding loss functions leads to the distributed illustration of the decision rule, giving significant algorithmic as well as illustrative preferences [12].

In SVR, the given input X is initially mapped onto an m-dimensional feature space using some fixed (nonlinear) mapping, and then a linear model is built in this feature space. Utilizing mathematical code, the linear model fx,w is given by:

f x , w = j = 1 m w j g j x + b ,

where gjx, j= 1, …, m defines a set of nonlinear transformations, where b is the “bias” term. Often the data are assumed to have a mean value equal to zero, that the “b” is dropped.

2.2. Random forest

The Random Forest (RF) is a standout amongst the best machine learning models for predictive analytics, creating its associate degree industrial workhorse for machine learning.

The RF modeling is a kind of additive model that forecasts by joining decisions from a sequence of base models. This model is written as per the equation:

g x = f 0 x + f 1 x + f 2 x + ,

where the last model g is the summation of simple base models fi. Here, every base classifier is a straightforward decision tree [13]. In RF, basic models are built independently utilizing an alternate subsample of the data. The RF model is extremely great at taking care of tabular dataset with numerical features or categorical features with less than many classes. In contrast to linear models, RF can catch non-linear activities amongst the features and the objective. One imperative argument is that tree-based models are not intended to deal with widely sparse features.

3. Experimentation

The dataset used for the analysis is taken by IEEE PHM Data Challenge 2012 for FEMTO bearing data-set. Containing failure data of REB data obtained from a PRONOSTIA platform for 17 runs to failure. Collection of data is done by the test rig shown in the Fig. 1. Use of two accelerometers was done to gather the data. The useful life of the bearing is considered to end when the amplitude of the vibration signal reaches 20 g.

Fig. 1. Experimental set-up [11]

 Experimental set-up [11]

Table 1. Bearing data-set [11]

Condition 1
Condition 2
Condition 3
1800 rpm with 4000 N radial load
1650 rpm with 4200 N radial load
1500 rpm with 5000 N radial load
Training data (6)
Testing data (11)

3.1. Feature extraction

Raw data is pre-processed, and different time-domain features were calculated to smoothen the noisy, inconsistent and long data set.

4. Methodology

The proposed methodology in this paper is shown in Fig. 2.

STEP 1: Input data is acquired by IEEE PHM Data challenge 2012 - FEMTO bearing data set.

STEP 2: Target function – Assumption: Last cycle = Failure cycle:

%   U s e d   l i f e = 100 * x c ,       x < c , 100 ,       x = c .  

STEP3: Extraction of statistical features and doing feature ranking Fig. PCA method. Ranking of RPM and load are considered as immaterial.

STEP 4: Datasets were created using their rank which were further used to create data frames for regression analysis.

STEP 5: Similar datasets were created for the testing data and RUL is predicted for each test bearing. Error and score calculations are done for the predicted RUL.

Table 2. Statistical features

Sr. No.
Mathematical formulations
Standard deviation σa=n=1N(an-μa)2N-1
Mean (amean)=n=1NanN
Variance (σa2)=n=1N(an-μa)2N-1
Skewness (SKa)=n=1N(an-μa)3(N-1)σa3
Root mean square error (arms)=n=1N(an)2N-1
Kurtosis (Ka)=n=1N(an-μa)4(N-1)σa4

Fig. 2. Methodology used for prediction of RUL

 Methodology used for prediction of RUL

4.1. Feature ranking and feature set formation

Principal component analysis (PCA) is used to recognize the contribution of features that are most adding to the principal components. It uses a variance ratio for ranking the given features. Variance is calculated using:

σ = j λ j .

So, variance ratio (rj) for the given component j is calculated the following formula:

r j   = λ j σ .  

Using ranked features, 12 feature sets were made so that the first feature set contains the first top-ranked feature, at that point the second feature set contains the top two ranked features and so on. It enables to test the importance of features and the overall accuracy of the system.

Fig. 3. Feature ranking using PCA

 Feature ranking using PCA

5. Result

RUL is calculated for the proposed model and results obtained are discussed below. Error in the predicted and actual RUL is calculate using following formula:

%   e r r o r = 100 *   A c t u a l   R U L i   - R U L p r e d i c t e d i A c t u a l   R U L i .

The scoring function used is as follows:

A i =   e x p - ln 0.5 * E r i 5 ,       E r i 0 , e x p + ln 0.5 * E r i 20 ,       E r i > 0 .    

The overall score is calculated using:

S c o r e   = 1 11 i = 1 11 A i .    

Fig. 4. Trend of scores of feature datasets

 Trend of scores of  feature datasets

Fig. 5. Comparison between regression models for 12 features dataset

 Comparison between regression  models for 12 features dataset

In the present study SVR and RF models are employed to predict RUL of bearing. firstly, features are extracted from IEEE PHM Data Challenge 2012 bearing dataset and ranked using PCA, which are further used to form training and testing input feature sets. Fig. 4. depicts the graph of calculated scores using SVR and RF as per mathematical expression given in data sheet. For RF regression model, the score is indicating lessening pattern till 8 features dataset however it rockets up with the expansion in the features and finally both the models achieve highest score with 12th feature set and score with RF outperforms SVR. Fig. 5.

Displays comparison between actual RUL and RUL computed using the regression models of IEEE FAMTO data set. Fig. 5. demonstrates the relative investigation between the 2 models using 12 features datasets with the actual value anticipated. The score of the SVR model is 0.429582 which is most reduced while the RF model is indicating most astounding score estimation of 0.547452. Table 3 contains the values of bearing’s actual RUL and predicted RUL to calculate score. The highest score of 0.547452 is achieved with 12th feature set and RF.

Table 3. Predicted life and score calculation for a dataset with 12 features

Test bearings
Actual life
Predicted life
Percentage error
Predicted life
Percentage error

6. Conclusions

Dataset utilized for the investigation is taken by IEEE PHM Data Challenge 2012 for FEMTO bearing informational collection. The methodology contains extraction of the statistical features in X and Y direction, features are ranked using a PCA approach which is additionally used to make distinctive datasets utilizing feature ranking. The outcome demonstrates RF regression is more exact than SVR models. The best score 0.5474 is accomplished utilizing RF regression mode with 12 features dataset. The result indicates the potential use of group regression procedure for a forecast of RUL.


Authors would like to acknowledge Larsen and Toubro Infotech Ltd. funding under CSR-1step initiative.


  1. Ahmad W., Ali Khan S., Kim J. M. A hybrid prognostics technique for rolling element bearings using adaptive predictive models. IEEE Transactions on Industrial Electronics, Vol. 65, Issue 2, 2017, p. 1577-1584. [Publisher]
  2. Sankavaram C., et al. Model-based and data-driven prognosis of automotive and electronic systems. IEEE International Conference on Automation Science and Engineering, 2009. [CrossRef]
  3. Soualhi A., Medjaher K., Zerhouni N. Bearing health monitoring based on Hilbert-Huang transform, support vector machine, and regression. IEEE Transactions on Instrumentation and Measurement, Vol. 64, Issue 1, 2015, p. 52-62. [Publisher]
  4. Sloukia F., Aroussi M. E., Medromi H., Wahbi M. Bearings prognostic using mixture of Gaussians hidden Markov model and support vector machine. ACS International Conference on Computer Systems and Applications, 2013. [CrossRef]
  5. Wang F., Chen X., Liu C., Yan D., Han Q., Li H. Reliability assessment of rolling bearing based on principal component analysis and Weibull proportional hazard model. IEEE International Instrumentation and Measurement Technology Conference, 2017. [CrossRef]
  6. Zhao M., Tang B., Tan Q. Bearing remaining useful life estimation based on time-frequency representation and supervised dimensionality reduction. Measurement, Vol. 86, 2016, p. 41-55. [Publisher]
  7. Guo L., Guo H., Huang H., He X., Li S. Multifeatures fusion and nonlinear dimension reduction for intelligent bearing condition monitoring. Shock and Vibration, Vol. 2016, 2016, p. 4632562. [Publisher]
  8. Zurita D., Carino J. A., Delgado M., Ortega J. A. Distributed neuro-fuzzy feature forecasting approach for condition monitoring. IEEE Conference on Emerging Technology and Factory Automation, Barcelona, 2014. [CrossRef]
  9. Zhanga Bingsen, Zhang Lijun, Xub Jinwu Remaining useful life prediction for rolling element bearing based on ensemble learning. Chemical Engineering Transactions, Vol. 33, 2013, p. 157-162. [CrossRef]
  10. Singh Sandip Kumar, Kumar Sandeep, Dwivedi J. P. A novel soft computing method for engine RUL prediction. Multimedia Tools and Application, Vol. 78, Issue 4, 2019, p. 4065-4087. [Publisher]
  11. IEEE PHM 2012 Prognostic Challenge. Outline, Experiments, Scoring of Results, Winners, 2012. [CrossRef]
  12. Prosvirin A., Islam M. M. M., Kim C., Kim J. M. Fault prediction of rolling element bearings using one class least squares SVM. The Engineering and Arts Society in Korea, EASKO, 2017. [CrossRef]
  13. Aggarwal C. C. Data mining: The Textbook. Springer, Berlin, Germany, 2015. [Publisher]